Navigation Links
-------------

Home
-------------

Viewing the
Online Catalog

-------------

Go to Online
Catalog

-------------

Publications
-------------

Newsletters
-------------

Workshops
-------------

Case Studies
-------------

Ralph Reitan
Bibliography

-------------

A to Z Product
Listing

-------------

Related Internet
Links

-------------

PDF Downloads
-------------

 

THE HALSTEAD-REITAN NEUROPSYCHOLOGICAL TEST BATTERY FOR ADULTS:
THEORETICAL, METHODOLOGICAL, AND VALIDATIONAL BASES

Ralph M. Reitan
Deborah Wolfson

Send a comment to the authors about this paper

Reprinted from: Reitan, RM and Wolfson, D. 2004. Comprehensive Handbook of Psychological Assessment. Volume 1. Intellectual and Neuropsychological Assessment. G. Goldstein and S. Beers, Eds. Hoboken, NJ: John Wiley and Sons. All rights reserved.

This paper may be downloaded and printed for personal use only. Duplication by any means for any other purpose is strictly prohibited by copyright law.


CONTENTS

I. Background and Development of the Halstead-Reitan Battery (HRB)

II. Practical Considerations in the Development of a Neuropsychological Test Battery

A. Relation to Specialized Neurological Diagnostic Procedures

B. "Fixed" versus "Flexible" Neuropsychological Testing Procedures

C. Differing Approaches to Neuropsychological Interpretation

D. The Development and Incorporation of Multiple and Complementary Methods of Inferring Cerebral Damage into a Single Battery

III. Content of the Halstead-Reitan Battery and its Conceptual and Theoretical Bases

IV. Validation of the Halstead-Reitan Neuropsychological Test Battery

A. General Neuropsychological Effects of Cerebral Damage

B. Lateralization Effects

C. Clinical Inferences Regarding Individual Subjects

V. The Halstead-Reitan Battery and Current Problems in Neuropsychology

A. A Summary Index of Overall Performance on the HRB

B. Practical and Legal Implications in the Use of the Halstead-Reitan Battery

C. Traumatic Brain Injury and the Halstead-Reitan Battery

D. Mild Head Injury

E. Frontal Lobe Deficits

F. Differing Effects of Age and Education in Brain-Damaged versus Non-Brain-Damaged Groups

G. Development of a Dissimulation Index Using Variables from the Halstead-Reitan Battery and the Wechsler Scale

VI. Recent Research with the Halstead-Reitan Battery
 

VII. REHABIT: A Structured Program for Retraining Neuropsychological Abilities Based on the Halstead-Reitan Battery
 

VIII. Summary
 

References


I.     Background and Development of the Halstead-Reitan Battery (HRB)  Return to Contents

After earning his PhD in physiological and comparative psychology at Northwestern University in 1935, Ward Halstead moved to the University of Chicago, established a working relationship with two neurosurgeons, Percival Bailey and Paul Bucy, and started the first full-time laboratory for the examination and evaluation of brain-behavior relationships in human beings.

Halstead's first step in evaluating patients was to observe them in their everyday living situations and attempt to discern which aspects of their behavior were different from the behavior of normal individuals. Fortunately, Halstead was relatively unencumbered by knowledge of the routine approach that existed in this era concerning the effects of brain damage. In fact, at this time and for quite a number of years previously, the principal approach of psychologists was to develop a single test that would diagnose brain damage.

Halstead's observations of persons with cerebral lesions made it apparent at the very beginning that brain-damaged individuals had a wide range of deficits and that a single test would not be able adequately to identify and evaluate the severity of their deficits. Some patients demonstrated motor and sensory-perceptual disorders, often involving one side of the body more than the other. Some patients had specific language deficits representing dysphasia. Other individuals were confused in a more general yet pervasive way (even though in casual contact they often appeared to be quite intact).

As Halstead studied brain-damaged persons in their routine everyday living situations, he observed that most of them seemed to have difficulties in understanding the essential nature of complex problem-situations, in analyzing the circumstances that they had observed, and in reaching meaningful conclusions about the situations they faced in everyday life.

The initial orientation and approach to research and evaluation may have long-term implications, and Halstead's approach was of significance in the eventual clinical use of his tests. Binet and Simon (1916), for example, began developing intelligence tests using academic competence as the criterion. IQ tests, though used to assess cerebral damage, are probably still best known for their relationship to school success. It was important and fortuitous that Halstead decided to observe the adaptive processes and difficulties that brain-damaged persons demonstrated in everyday life. If he had focused only upon academic achievement or classroom activities, it is probable that he would have developed tests that are quite different from the ones he did produce. The tests that are included in the Halstead-Reitan Battery emanated from a general consideration of neuropsychological impairment, and as a result, have much more relevance than IQ measures to practical aspects of rehabilitation and adaptive abilities.

Halstead devised and experimented with numerous psychological testing procedures. The design of his instruments placed them more in the context of standardized experiments than of conventional psychometric tests. Many of the procedures developed by Halstead contrasted sharply with conventional psychometric testing procedures of that time, inasmuch as they required the subject not only to solve the problem, but to observe the nature of the problem, analyze the essential elements, and after having defined the problem, proceed to solve it.

Halstead developed a series of 10 tests that ultimately formed the principal basis for his concept of biological intelligence. He described these tests and presented his theory in his book, Brain and Intelligence: A Quantitative Study of the Frontal Lobes (Halstead, 1947). Only seven of the original ten tests have withstood the rigors of both clinical and experimental evaluation. These seven tests constitute a substantial component of the Halstead-Reitan Battery, and are the seven tests used to compute the Halstead Impairment Index.

It became apparent that Halstead's seven tests had to be supplemented with an extensive array of additional procedures for the battery to attain clinical usefulness. The General Neuropsychological Deficit Scale (GNDS), a measure that provides an overall characterization of an individual's neuropsychological abilities and has proved to be highly sensitive to the effects of brain damage as well as clinically useful as a method of comparing areas of deficit, is based upon a total of 42 variables derived from measures in the Halstead-Reitan Battery (including the original seven tests developed by Halstead) (Reitan & Wolfson, 1988, 1993).

Another significant feature of the Halstead-Reitan Battery is that it was developed and validated not only by formal research, but also by evaluating thousands of patients with brain lesions and control subjects. This procedure involved administering the tests, evaluating the results, preparing a written report about their significance for brain functions, and only then turning to the independent data produced by complete and independent evaluations based on clinical tests and diagnostic studies by neurologists, neurological surgeons, and neuropathologists.

After the two sets of data had been collected, conferences were held to correlate the neuropsychological findings with the independently obtained history and neurological, neurosurgical, and neuropathological data (concerning location, type, duration, etc. of the lesion). In this way each subject served as an independent experiment. Using this procedure, Reitan studied over 8,000 patients. The Halstead-Reitan Battery (HRB) served as a focus for research in the area of human brain-behavior relationships long before the area of clinical neuropsychology emerged as a discipline, and in this sense the Halstead-Reitan Battery has played a significant role in the development of the field of neuropsychology.

During the 1950's and early 1960's published research from the Reitan Neuropsychology Laboratory stimulated much interest; however, it also generated a considerable degree of skepticism and criticism from psychologists as well as neurologists and neurological surgeons. There was a general reluctance toward accepting findings that proposed that the higher-level aspects of brain functions (which had been elusive for so many years) could actually be measured, quantified, and even used as a basis for predicting "blindly" the location and type of brain lesions (Reitan, 1964). However, the HRB was used with increasing frequency in many hospitals and clinics in the United States and other countries, and the apparent validity of the findings among researchers and clinicians gradually overcame this kind of skepticism.

Hartlage and DeFilippis (1983) reported the results of a survey conducted in 1980 of all neuropsychologists in the National Academy of Neuropsychologists and all members of the Division of Clinical Neuropsychology of the American Psychological Association. The results indicated that 89% of the respondents used the Wechsler scales in their neuropsychological assessments, 56% included portions of the Halstead-Reitan Battery, 49% used the Bender-Gestalt Test, 38% used the entire Halstead-Reitan Battery, 32% employed the Benton Visual Retention Test, and 31% used the Luria-Nebraska Battery. McCaffrey and Isaac (1984) reported that in 1982, 65% of their respondents used the HRB; in the later survey by McCaffrey and Lynch (1996) found that this percentage had risen to 77%.

Dean (1985) reviewed the Halstead-Reitan Neuropsychological Test Battery in the Ninth Mental Measurements Yearbook. In his review, he indicated that, "neuropsychological assessment in North America has focused on the development of test batteries that would predict the presence of brain damage while offering a comprehensive view of a patient's individual functions. Numerous batteries have been offered as wide-band measures of the integrity and functioning of the brain. However, the HRB remains the most researched and widely utilized measure in the United States" (p. 642).

Meier (1985) also reviewed the Halstead-Reitan Neuropsychological Test Battery and had the following introductory comment:

"This comprehensive neuropsychological test battery has a long and illustrious history of clinical research and application in American clinical neuropsychology. Following its inaugural presentation to the psychological community (Halstead, 1947), and the careful nurturance of concept and application by Reitan (1974), the battery has had perhaps the most widespread impact of any approach in clinical neuropsychology. It seems reasonable to state that in the first half of the period since World War II, during which neuropsychology expanded so remarkably, this approach was the primary force in stimulating clinical research and application in this country" (p. 646).

II.     Practical Considerations in the Development of a Neuropsychological Test Battery   Return to Contents

A valid neuropsychological test battery cannot be developed without meeting a number of criteria that relate to relationships with extensively derived indicators of brain damage as well as complementary approaches in assessment.

A. Relation to Specialized Neurological Diagnostic Procedures  Return to Contents

The relationship between neuropsychological testing and the many excellent specialized medical diagnostic procedures (especially computerized tomography (CT) and magnetic resonance imaging (MRI) must be evaluated. Comparisons of test data with the results of brain imaging procedures is important in validating neuropsychological tests. However, it has also been a cause of concern regarding the independent value of neuropsychological testing to those psychologists and physicians who consider the principal purpose of neuropsychological examination to be the diagnosis of brain damage rather than evaluation of brain-behavior relationships.

It is important to recognize that neuropsychological evaluation draws on a different domain of evidence, to a major extent, than the diagnostic methods used by neurologists and neurological surgeons do. The Halstead-Reitan Battery focuses on higher-level aspects of brain functions (and more specifically on central processing), whereas the neurological diagnostic procedures relate principally to lower-level aspects of brain functioning or non-behavioral variables, such as electrophysiological manifestations or brain imaging procedures. Neuropsychological evaluation provides the only rigorous and objective method of assessing higher-level aspects of brain functions.

CT and MRI scans are highly accurate diagnostic techniques. Both procedures identify structural abnormalities in the brain, such as space-occupying lesions and the types of tissue abnormalities characteristic of diseases, such as multiple sclerosis. However, in cases of closed head injury, in which small vascular lesions or shearing of neurons may occur on a widespread basis, CT and MRI scans are frequently within normal limits. Despite their sensitivity to structural lesions and disorders of brain functions, even functional MRI, positron emission tomography, and single photon emission spectroscopy are often not able to identify deficits in higher-level aspects of brain functions. An important need, therefore, has arisen for detailed studies of the correlation between these imaging procedures and neuropsychological test results, because imaging procedures cannot evaluate an individual's general intelligence or other adaptive abilities. (The usefulness of the clinical neurological examination and specialized neurological diagnostic procedures is discussed in detail in Reitan & Wolfson, 1992a).

The fact that results from the Halstead-Reitan Battery have been shown to correlate closely with the findings of the overall neurological examination validates the neuropsychological data as indicators of the biological condition of the brain. However, it is important to recognize that these various procedures complement rather than compete with each other in providing information about the patient. If an individual's higher-level brain functions need to be evaluated, a neuropsychological examination must be performed to obtain the relevant information.

B. "Fixed" versus "Flexible" Neuropsychological Testing Procedures  Return to Contents

The comparative advantages of fixed (or standard) versus flexible (or composed to meet the immediate assessment needs) batteries have received a considerable amount of attention (Incagnoli, Goldstein, & Golden, 1986).

The terms "fixed battery" and "flexible battery" are misleading, insofar as fixed may be taken to mean inflexible and not able to be supplemented with other tests. Flexible, on the other hand, may imply that adaptations are possible to meet the requirements of the individual situation without a loss of validity in evaluation. In practice, fixed batteries could be equated with a standard set of instruments that has been validated through extensive research with thousands of patients, with resulting consensus among thousands of psychologists around the world.

In contrast, the psychologist evaluating the patient, usually according to the history and complaints of each patient, decides upon the composition of flexible batteries. To the degree that they differ from standard batteries, these individualized series of tests have never been evaluated as a battery in research studies or assessed by consensus of other psychologists.

Flexible batteries tend to deny the concept of a battery of tests that has been designed to assess human brain functions. In this sense, fixed batteries might be better labelled as "validated" batteries, and flexible batteries as "casually composed" batteries that have never been rigorously validated or, in most instances, evaluated with regard to accuracy in diagnosing a single pathological condition (such as traumatic brain injury, cerebrovascular disease, and so on). To the extent that they are composed to assess specific referral complaints or the subjective complaints of the patient (Christensen, 1975), flexible batteries may be referred to as "symptom-oriented" batteries, with the validity and range of the battery depending on the accuracy and completeness of complaints offered by each subject examined.

Supposed advantages of the flexible battery include the prerogative to select tests to evaluate specific areas of function that accord with the patient's complaints. Of course, such a procedure might be circular in nature if the end result were only to confirm through psychological testing the patient's own initial self-evaluation (self-diagnosis).

Further, if the patient's subjective complaints are not sufficiently comprehensive, or if the patient is not able to offer an adequate and complete self-diagnosis, the resulting test battery may fail to recognize and evaluate significant areas of cerebral dysfunction. In any case, the series of tests selected by the psychologist will demonstrate a range of scores. Consequently, using this approach, the tests with low scores are usually selected as indicators of cognitive impairment.

In contrast, the fixed battery approach uses a constant group of tests. In this sense, the Halstead-Reitan Battery can be thought of as a comprehensive battery, and validated on thousands of patients to ensure that all relevant areas of brain functions are included in the assessment, thereby providing a balanced representation of the various neuropsychological functions subserved by the brain.

Finally, each test in the Halstead-Reitan Battery has been subjected to the rigorous research that is necessary to validate it as a neuropsychological test. In addition, the tests in the HRB have also been validated for their complementary significance and interpretation for assessment of the individual.

Results obtained on the Halstead-Reitan Battery regularly demonstrate the advantage of using a standard and comprehensive battery of tests. As shown in formal studies (Reitan, 1964), it is often possible, using only the HRB test results, to determine that cerebral damage has been sustained, to identify the area or location of principal involvement, and to assess whether the injury is recent or the brain has had an opportunity to stabilize. Further, from the point of view of rehabilitation, it is imperative to have a balanced representation of the nature and degree of neuropsychological deficit.

Considering the importance of assessing all neuropsychological aspects of brain function in a balanced and comparative manner, and the advantages of determining the complementary nature of results on various tests (thus achieving a battery effect in which the sum is greater than the individual components), and considering the obvious facilitation of validational research, Halstead and Reitan decided at the beginning develop a standardized battery rather than use a variable selection of individual tests.

        C. Differing Approaches to Neuropsychological Interpretation  Return to Contents

Neuropsychologists have developed differing attitudes toward neuropsychological testing and the use of supplemental information to obtain valid results. The problem appears to arise from an attempt to emulate the medical model, or more specifically, the procedures used in neurology. In the field of neurology, it is routinely recommended that the results of the EEG, CT, MRI, and other specialized diagnostic procedures be evaluated only in the context of the patient's clinical history. This approach represents a double-edged sword inasmuch as history information may influence the interpretation of the diagnostic procedure or the diagnostic procedure may be used to controvert the history information.

Some neuropsychologists contend that neuropsychological evaluation is of value only if the test results are interpreted with full knowledge of the patient and his/her usual behavior. In some instances, the neuropsychologist requires personal observation of the patient at considerable length and in various types of settings in order to obtain enough information to be able to interpret the test results. In other instances, the neuropsychologist requires the findings of other professionals, knowledge of the subject's demographic and personal behavior characteristics, detailed academic and occupational histories, and interaction with family members and other persons in the environment as a basis for interpreting the neuropsychological test results. Psychologists who adopt this approach obviously feel that neuropsychological test data does not constitute a significant source of independent information and needs to be interpreted in the framework of complementary sources of information to achieve validity.

Other neuropsychologists believe that neuropsychological testing is of value, but that a score or index cannot adequately reflect a subject's performances. This contention derives historically from the insistence by Kurt Goldstein (1942a, 1942b) that the procedures used by brain-damaged individuals to solve problems differ from the methods utilized by persons with normal brain functions, and that the principal value of the testing has been lost if the procedures used by the patient are not observed and recorded. Goldstein believed that a final score, representing the adequacy of the subject's performances, was entirely inadequate, and that the process by which the individual achieved the end result was of critical importance. This contention has been carried forward by a number of other investigators (most notably Edith Kaplan), who have differentiated performances on the same task in accordance with the types of errors made by the subject, and related such information to neuropsychological interpretations of brain functions.

A final approach to this problem, represented principally by Reitan and others, recommends that the neuropsychological examination evaluate brain functions generally for every subject, and that a "blind" interpretation of the test findings be prepared before referring to any history information, conclusions reached by other professionals, or the results of additional diagnostic procedures. If the psychologist administers the tests personally, it is inescapable that he/she will observe the subject's performances and will formulate opinions based on the process the subject used to perform a task rather than on the test scores alone. The important point in this procedure, however, is related to the sequence the neuropsychologist follows to evaluate the subject. We recommend the sequential steps described below.

First, the neuropsychological test data is evaluated, including notations about any deviations from standard testing procedures, and perhaps even the process the subject uses to reach a solution. This first step should be completed before referring to any additional information, such as the history, findings of other examinations, or conclusions of other professionals. This procedure allows the neuropsychologist to evaluate the relevance of the test data and determine the extent to which it can make an independent contribution to understanding the patient's brain functions and cognitive structure.

After interpreting the neuropsychological test data "blindly," all additional information should be evaluated, including the complete history, the results of examinations by other professionals, the findings of neurological diagnostic procedures, the individual's behavior in everyday living situations, his/her interaction with family members and other significant persons in the environment, and the academic, employment, military, and professional records. Wolfson (1985) has systematized our history-gathering procedures by preparing a formal guide that explores an extensive range of relevant areas. The reader will notice that our method differs from the other approaches mentioned above only in terms of the sequence in which information is reviewed. In our system, the final interpretation of the data depends (as it does in the other approaches) upon an integration and evaluation of the consistency of all available information relating to the patient.

We prefer to avoid the risk of prejudicing the interpretation of the neuropsychological test results with supplemental information. We therefore perform an initial "blind" interpretation of the test data, and then relate these findings to any other available information. It is important to recognize that much of the history information and personal observations of the patient's behavior are obtained under inadequately standardized and uncontrolled conditions, which certainly impair interjudge reliability and the potential for unprejudiced interpretation of the test data.

A number of neuropsychologists have emphasized the importance of supplementing the information provided by the quantitative testing with observations of qualitative behavior. Such qualitative observations can be useful, but it must be recognized that some psychologists are more adept than others in reaching valid conclusions on the basis of their behavioral observations. In fact, if conclusions are based on impressions of qualitative aspects of behavioral manifestations, a considerable degree of interjudge unreliability is likely to result.

Many studies have shown that behavioral ratings among psychologists and/or psychiatrists tend to be quite variable, even when the judges are thoroughly experienced. However, these kinds of problems of reliability do not necessarily detract from the valid observations that may be made by some judges. In our own practice we have observed many subjects experiencing the stress involved with neuropsychological testing. The Tactual Performance Test, for example, is often more stressful for brain-damaged persons than individuals with normal brain functions. Thus, the difficulty-level of the task and the subject's reactions during the testing almost certainly contribute information over and beyond the quantitative score for the test.

Halstead and Reitan, having observed the various approaches to problem-solving utilized by normal and brain-damaged persons, were keenly aware of the potential implicit in assessing a subject's qualitative performances during neuropsychological testing. They were interested, nevertheless, in exploring the potential value of standardized procedures that produced quantitative scores as a basis for evaluating brain-behavior relationships. It is generally accepted that a standardized procedure that produces a quantitative score is more replicable and reproducible - and thus has greater reliability - than judgments about the qualitative aspects of behavior.

In discussing the strategy that they wished to employ in neuropsychological examination, Halstead and Reitan believed that it would be desirable to differentiate and separate the quantitative scores (which more closely meet the hallmark in science of replicability) from the observations of the patient's behavior. This decision was not intended to either deny the potential value of behavioral observations or to rule out the use of behavioral observations to supplement and complement the quantitative data produced by neuropsychological testing. However, if neuropsychology was to be established as a science rather than an art, it was critical to define the standardized testing procedures that would produce quantitative scores separate from the clinician's insightful observations of the subject's behavior.

It is not surprising that many investigators continue to emphasize the importance of qualitative observations. In fact, it appears that neuropsychologists who depend upon these procedures have a clinical orientation, and through their training have developed skills in understanding impaired brain functions through the subject's behavioral manifestations. Proponents of this approach may also include psychologists who have had a lesser degree of success in validating their conclusions based upon interpretation of quantitative scores.

The debate concerning the use of qualitative versus quantitative methods for assessment of the effects of cerebral damage has a long history. Qualitative observations and analyses of behavior necessarily precede translation of such observations into standardized and quantified neuropsychological tests, which is the precise procedure used by Halstead and Reitan in developing the tests included in the HRB. Kurt Goldstein (1942a, 1942b) argued that only quantitative evaluations were valid, and that quantitative assessments might only confuse the issue because a brain-impaired and a normal subject might earn the same quantitative score but use different approaches to do so. Goldstein claimed that brain damage inevitably produced a "concrete" approach to problem-solving, whereas the person with normal brain functions was able to adopt an "abstract" approach. According to Goldstein, it was imperative to observe the performances of brain-damaged subjects in order to determine whether the brain damage had imposed any limitations on the individual's cognitive functioning.

Luria (personal communication, 1967) had a very similar orientation towards this question. In a letter to one of the authors (RMR), he stated quite explicitly that little, if anything, could be gained by translating neuropsychological deficits into quantitative values, and believed that it was necessary to observe the performance style of the brain-damaged individual in order to understand that person's limitations.

As we have noted, the use of a standardized and replicable procedure in neuropsychological examination in no way obviates or even interferes with the use of clinical observation by those neuropsychologists who feel that they are skilled in this respect. At the same time, clinical observation (as contrasted with quantitative measurement of performances) should not be proposed -- as Goldstein and Luria have done -- as the only method that can validly identify the deficits of brain-injured persons.

D. The Development and Incorporation of Multiple and Complementary Methods of Inferring Cerebral Damage into a Single Battery   Return to Contents

A critical incident that led to the development of the HRB as a battery that describes the uniqueness of brain impairment of individual subjects occurred in 1945. One of the authors (RMR) had examined an extremely competent physician who had sustained a depressed skull fracture and complained of various significant deficits following the injury. Despite his complaints, this physician consistently performed well above average on the tests he had been given. Although the examination demonstrated that he had excellent abilities, his complaints seemed clinically valid. If this physician was indeed impaired as compared to his premorbid status, we needed to find a way to assess his deficits. Thus, the challenge was set. The problem required a methodology that described the impairment in the individual subject, regardless of whether the subject's premorbid level of functioning was high or low.

We realized that it was necessary to design the battery so that methods of inference other than level of performance would contribute to the clinical conclusions. Four approaches could be used to evaluate a subject's performances: (1) level of performance (how well the subject performed on a test compared to other subjects), (2) pathognomonic signs (specific deficits that rarely occur without cerebral damage), (3) patterns and relationships among test results (identification of score patterns that reflected localized or lateralized damage as well as relationships which compared premorbid abilities with evidence of acquired impairment), and (4) comparisons of a subject's performances on the same test on each side of the body.

A great deal of formal research and clinical evaluation was necessary to develop tests that met the above requirements and also were valid measurements of the biological integrity of the brain. It was necessary that the tests complemented each other in interpretation, were as economical as possible in the time required for administration, were consistently sensitive to cerebral damage or dysfunction across a broad range of neurological conditions, and reflected an individual's deficits in a balanced and equivalent manner.

III. Content of the Halstead-Reitan Battery and its Conceptual and Theoretical Bases   Return to Contents

Theory-building requires basic facts on which to develop constructs, and the short lives of many of the molar theories of brain-behavior relationships can be directly attributed to their inconsistency with the facts (see Reitan & Wolfson, 1988, 1993 for a review of theories in clinical neuropsychology). Our method of developing a theory differed from those customarily used. We tried at first to generate a fairly extensive body of facts which in turn could lead us to a broader conceptualization or generalization about relationships between the brain and neuropsychological functions. Our approach was therefore "fact-driven" rather than "theory-driven." We tried initially to meet the methodological requirements noted above and to compose a set of tests (the HRB) that was consistently valid, in both research and clinical application, as a basis for describing the ways in which the brain relates to behavior. Our empirical approach, guided by validated research findings and clinical verification in the individual case, led to the development of the Reitan-Wolfson model of neuropsychological functioning. We believe that this procedure may have more objectivity than an approach that postulates a theory and then searches for facts to support it.

The brain-based functions an individual needs in order to be efficient in his or her everyday behavior fall in several categories. The Reitan-Wolfson model of neuropsychological functioning(see Fig. 1) provides a conceptual framework for organizing the behavioral correlates of brain functions and describing the tests that measure these functions.

A neuropsychological response cycle first requires input to the brain from the external environment via one or more of the sensory avenues. Primary sensory areas are located in each cerebral hemisphere, indicating that this level of central processing is widely represented in the cerebral cortex and involves the temporal, parietal, and occipital areas particularly (see Reitan & Wolfson, 1993 for a description of the anatomical structures and systems for each of the elements of the Reitan-Wolfson model).

Once sensory information reaches the brain, the first step in central processing is the "registration phase" and represents alertness, attention, continued concentration, and the ability to screen incoming information in relation to prior experiences (immediate, intermediate, and remote memory). When evaluating this level of functioning, the neuropsychologist is concerned with answering questions such as, How well can this individual pay attention to a specified task? Can he/she utilize past experiences (memory) effectively and efficiently to reach a reasonable solution to a problem? Can the person understand and follow simple instructions?

If an individual's brain is not capable of registering incoming information, relating the new information to past experiences (memory), and establishing the relevance of the information, the subject is almost certainly seriously impaired in everyday behavior. A person who is not able to maintain alertness and a degree of concentration is likely to make very little progress as he\she attempts to solve a problem. Persons with such severe impairment have limited opportunity to effectively utilize any of the other higher level abilities that the brain subserves, and they tend to perform quite poorly on almost any task presented to them.

Because alertness and concentration are necessary for all aspects of problem-solving, a comprehensive neuropsychological test battery should include measures that evaluate the subject's attentiveness. Such tests should not be complicated and difficult, but should require the person to pay close attention over time to specific stimulus material. The Halstead-Reitan Battery evaluates this first level of central processing primarily with two measures: the Speech-sounds Perception Test (SSPT) and the Rhythm Test.

The Speech-sounds Perception Test consists of 60 spoken nonsense words that are variants of the "ee" sound. The stimuli are presented on a tape recording, and the subject responds to each stimulus by underlining one of four alternatives printed on an answer sheet.

The SSPT requires the subject to maintain attention through the 60 items, perceive the spoken stimulus through hearing, and relate the perception through vision to the correct configuration of letters on the test form.

The Rhythm Test requires the subject to differentiate between 30 pairs of rhythmic beats. The stimuli are presented by a standardized tape recording. After listening to a pair of stimuli, the subject writes "S" on the answer sheet if he/she thinks the two stimuli sounded the same, and writes "D" if they sounded different.

The Rhythm Test requires alertness to nonverbal auditory stimuli, sustained attention to the task, and the ability to perceive and compare different rhythmic sequences. Although many psychologists have presumed that the Rhythm Test is dependent upon the integrity of the right hemisphere (because the content is nonverbal), the test is actually an indicator of generalized cerebral functions and has no lateralizing significance (Reitan & Wolfson, 1989).

After an initial registration of incoming material, the brain customarily proceeds to process verbal information in the left cerebral hemisphere and visual-spatial information in the right cerebral hemisphere. At this point the specialized functions of the two hemispheres become operational.

The left cerebral hemisphere is particularly involved in speech and language functions, or the use of language symbols for communication purposes. It is important to remember that deficits may involve quite simple kinds of speech and language skills, or conversely, may involve sophisticated higher-level aspects of verbal communication. It also must be recognized that language functions may be impaired in terms of expressive capabilities, receptive functions, or both (Reitan, 1984). Thus, the neuropsychological examination must assess an individual's ability to express language as a response, to understand language through both the auditory and visual avenues, and to complete the entire response cycle, which consists of perception of language information, central processing and understanding of its content, and the development of an effective response.

The Halstead-Reitan Battery measures both simple and complex verbal functions. The Reitan-Indiana Aphasia Screening Test (AST) is used to evaluate language functions such as naming common objects, spelling simple words, reading, writing, enunciating, identifying individual numbers and letters, and performing simple arithmetic computations.

The AST is organized so that performances are evaluated in terms of the particular sensory modalities through which the stimuli are perceived. Additionally, the receptive and expressive components of the test allow the neuropsychologist to judge whether the limiting deficit for a subject is principally receptive or expressive in character. The verbal subtests of the WAIS are also used to obtain information about verbal intelligence.

Right cerebral hemisphere functions are particularly involved with spatial abilities (mediated principally by vision but also by touch and auditory function) and spatial and manipulatory skills (Reitan, 1955a; Wheeler & Reitan, 1962). It is again important to remember that an individual may be impaired in the expressive aspects or the receptive aspects of visual-spatial functioning, or both. It must also be kept in mind that we live in a world of time and space as well as in a world of verbal communication. Persons with impairment of visual-spatial abilities are often severely handicapped in terms of efficiency of functioning in a practical, everyday sense.

The HRB assesses visual-spatial functions with simple as well as complex tasks. Particularly important are the drawings of the square, cross, and triangle of the Aphasia Screening Test, the WAIS Performance subtests, and to an extent, Parts A and B of the Trail Making Test.

In evaluating the drawings on the AST, the criterion of brain damage relates to specific distortions of the spatial configurations rather than to artistic skill. The square and triangle are relatively simple figures, and do not usually challenge an individual's appreciation and production of spatial configurations. The cross involves many turns and a number of directions, and can provide significant information about a subject's understanding of visual-spatial form.

Comparisons of performances on the two sides of the body, using both motor and sensory-perceptual tasks, provide information about the integrity of each cerebral hemisphere, and more specifically, about areas within each hemisphere. Both finger tapping and grip strength yield information about the posterior frontal (motor) areas of each cerebral hemisphere.

The Tactual Performance Test (TPT) requires complex problem-solving skills and can provide information about the adequacy of each cerebral hemisphere. The subject is blindfolded before the test begins and is not permitted to see the formboard or blocks at any time. The first task is to fit the blocks into their proper spaces on the board using only the preferred hand. After completing this task (and without having been given prior warning), the subject is asked to perform the same task using only the nonpreferred hand. Finally, and again without prior warning, the task is repeated a third time using both hands. The amount of time required to perform each of the three trials provides a comparison of the efficiency of performance of the two hands. The Total Time score of the test reflects the amount of time needed to complete all three trials.

After the subject has completed the third trial, the board and blocks are taken out of the testing area and the subject's blindfold is removed. The subject is then asked to draw a diagram of the board with the blocks in their proper spaces. The Memory score is the number of shapes correctly remembered; the Localization score is the number of blocks correctly identified by both shape and position on the board.

 

 

An important aspect of the Tactual Performance Test relates to the neurological model. The test's design and procedure allows the functional efficiency of the two cerebral hemispheres to be compared and provides information about the general efficiency of brain functions. During the first trial, data is being transmitted from the preferred hand to the contralateral cerebral hemisphere (usually from the right hand to the left cerebral hemisphere). Under normal circumstances, positive practice-effect results in a reduction of time of about one-third from the first trial to the second trial. A similar reduction in time occurs between the second trial and the third trial.

The TPT is undoubtedly a complex task in terms of its motor and sensory requirements, and successful performance appears to be principally dependent on the middle part of the cerebral hemispheres. Ability to correctly place the variously shaped blocks on the board depends upon tactile form discrimination, kinesthesis, coordination of movement of the upper extremities, manual dexterity, and an appreciation of the relationship between the spatial configuration of the shapes and their location on the board. Obviously, the TPT is considerably more complex in its problem-solving requirements than either finger tapping or grip strength.

The tests for bilateral simultaneous sensory stimulation include tactile, auditory, and visual stimuli. Impaired perception of stimulation occurs on the side of the body contralateral to a damaged hemisphere.

The Tactile Form Recognition Test requires the subject to identify shapes through the sense of touch and yields information about the integrity of the contralateral parietal area.

Finger localization and finger-tip number writing perception also provide information about the parietal area of the contralateral cerebral hemisphere. Finger-tip number writing requires considerably more alertness and concentration, or perhaps even more general intelligence, than finger localization (Fitzhugh, Fitzhugh, & Reitan, 1962c).

In the Reitan-Wolfson theory of neuropsychological functioning, the highest level of central processing is represented by abstraction, reasoning, concept formation, and logical analysis skills. Research evidence indicates that these abilities have a general rather than a specific representation throughout the cerebral cortex (Doehring & Reitan, 1962). The generality and importance of abstraction and reasoning skills may be suggested biologically by the fact that these skills are distributed throughout the cerebral cortex rather than being limited as a specialized function of one cerebral hemisphere or a particular area within a hemisphere. Generalized distribution of abstraction abilities throughout the cerebral cortex may also be significant in the interaction of abstraction with more specific abilities (such as language) that are represented more focally.

Impairment at the highest level of central processing has profound implications for the adequacy of neuropsychological functioning. Persons with deficits in abstraction and reasoning functions have lost a great deal of the ability to profit from their experiences in a meaningful, logical, and organized manner. However, since their deficits are general rather than specific in nature, such persons may appear to be relatively intact in casual contact. Because of the close relationship between organized behavior and memory, these subjects often complain of "memory" problems and are grossly inefficient in practical, everyday tasks. They are not able to organize their activities properly, and frequently direct a considerable amount of time and energy to elements of a situation that are not appropriate to the nature of the problem.

This nonappropriate activity, together with an eventual withdrawal from attempting to deal with problem situations, constitutes a major component of what is frequently (and imprecisely) referred to as "personality" change. Upon clinical inquiry, such changes are often found to consist of erratic and poorly planned behavior, deterioration of personal hygiene, a lack of concern and understanding for others, etc. When examined neuropsychologically, it is often discovered that these behaviors are largely represented by cognitive changes at the highest level of central processing rather than emotional deterioration per se.

Finally, in the solution of problems or expression of intelligent behavior, the sequential element from input to output frequently involves an interaction of the various aspects of central processing. Visual-spatial skills, for example, are closely dependent upon registration and continued attention to incoming material of a visual-spatial nature, but analysis and understanding of the problem also involves the highest element of central processing, represented by concept formation, reasoning, and logical analysis. Exactly the same kind of arrangement between areas of functioning in the Reitan-Wolfson model would relate to adequacy in using verbal and language skills. In fact, the speed and facility with which an individual carries out such interactions within the content categories of central processing probably in itself represents a significant aspect of efficiency in brain functioning.

The HRB uses several measures to evaluate abstraction skills, including the Category Test, the Trail Making Test, and the overall efficiency of performance demonstrated on the Tactual Performance Test.

The Category Test has several characteristics that make it unique compared to many other tests. The Category Test is a relatively complex test of concept formation that requires ability (1) to note recurring similarities and differences in stimulus material, (2) to postulate reasonable hypotheses about these similarities and differences, (3) to test these hypotheses by receiving positive or negative information (bell or buzzer), and (4) to adapt hypotheses based on the information received after each response.

The Category Test is not particularly difficult for most normal subjects. Since the subject is required to postulate solutions in a structured (rather than permissive) context, the Category Test appears to require particular competence in abstraction ability. The test in effect presents each subject with a learning experiment in concept formation. This is in contrast to the usual situation in psychological testing, which requires solution of an integral problem situation.

The primary purpose of the Category Test is to determine the subject's ability to use both negative and positive experiences as a basis for altering and adapting his/her performance (i.e., developing different hypotheses to determine the theme of each subtest). The precise pattern and sequence of positive and negative information (the bell or buzzer) in the Category Test is probably never exactly the same for any two subjects (or for the same subject upon repetition of the test). Since it can be presumed that every item in the test affects the subject's response to ensuing items, the usual approaches toward determination of reliability indices may be confounded. Nevertheless, the essential nature of the Category Test, as an experiment in concept formation, is clear.

The Category Test is probably the best measure in the HRB of abstraction, reasoning, and logical analysis abilities, which in turn are essential for organized planning. As noted above, subjects who perform especially poorly on the Category Test often complain of having "memory" problems. In fact, the Category Test requires organized memory (as contrasted with the simple reproduction of stimulus material required of most short-term memory tests), and is probably a more meaningful indication of memory in practical, complex, everyday situations than most so-called "memory" tests, especially considering that memory, in a purposeful behavioral context, necessarily depends on relating the various aspects of a situation to each other (see Reitan & Wolfson, 1988, 1993, for a discussion of this concept).

The Trail Making Test is composed of two Parts, A and B. Part A consists of 25 circles printed on a sheet of paper. Each circle contains a number from 1 to 25. The subject's task is to connect the circles with a pencil line as quickly as possible, beginning with the number 1 and proceeding in numerical sequence. Part B consists of 25 circles numbered from 1 to 13 and lettered from A to L. The task in Part B is to connect the circles, in sequence, alternating between numbers and letters. The scores represent the number of seconds required to complete each Part.

The Trail Making Test requires immediate recognition of the symbolic significance of numbers and letters, ability to scan the page continuously to identify the next number or letter in sequence, flexibility in integrating the numerical and alphabetical series, and completion of these requirements under the pressure of time. It is likely that the ability to deal with the numerical and language symbols (numbers and letters) is sustained by the left cerebral hemisphere, the visual scanning task necessary to perceive the spatial distribution of the stimulus material is represented by the right cerebral hemisphere, and speed and efficiency of performance may reflect the general adequacy of brain functions. It is therefore not surprising that the Trail Making Test, which requires simultaneous integration of these several abilities, is one of the best measures of general brain functions (Reitan, 1955d, 1958).

IV. Validation of the Halstead-Reitan Neuropsychological Test Battery   Return to Contents

As noted previously, Dean (1985) cited the HRB as the "most researched" neuropsychological battery in the United States. Of even greater significance is the fact that the clinical usefulness of the HRB has been demonstrated in thousands of settings around the world, in laboratories ranging from clinical offices to major medical centers and universities.

Reitan's own research efforts began with general issues relating to brain-behavior relationships and proceeded to increasingly specific questions. Investigations were ordered in such a way that serially forthcoming information would arrive in a context of prior data.

First, the general effects of heterogeneous brain lesions were investigated; second, the differential effects of lateralized cerebral lesions without regard to type were studied; third, regional localization effects were identified; fourth, differences in effects of various pathologies and duration of lesion were researched; fifth, the neuropsychological correlates of brain lesions that were relatively chronic and static were compared with lesions that were acute and/or rapidly progressive; and sixth, the interaction among these variables and their relationship to the full range of cerebral damage, disease, and disorders was studied. Because our unit of value continually focused on the individual, the generality and specificity of application of the research results to individual subjects was routinely evaluated. No other test battery, or set of neuropsychological tests, has been evolved or studied in this organized and systematic manner.

A. General Neuropsychological Effects of Cerebral Damage   Return to Contents

Reitan's first study on the HRB compared results obtained with Halstead's ten tests in a group of 50 subjects with documented cerebral damage or dysfunction and a group of 50 control subjects who had no history of cerebral disease or dysfunction (Reitan, 1955c). A heterogeneous and diverse group of subjects with cerebral damage was included in order to ensure that an extensive range of conditions would be represented.

Persons with diverse medical conditions were deliberately included among the control subjects, and persons with brain damage were carefully excluded. Normally functioning individuals comprised 24% of the control group, and the remaining 76% was composed of patients hospitalized for a variety of difficulties not involving impaired brain functions. The control group included a substantial proportion of paraplegic and neurotic patients in order to minimize the probability that any intergroup differences could be attributed to variables such as hospitalization, chronic illness, and affective disturbances.

The two groups were matched in pairs on the basis of race and gender, and as closely as possible for chronological age and years of formal education. The difference in mean age for the two groups was .06 years, and the difference in mean education was .02 years. The two groups were obviously very closely matched for age and education, and the standard deviations for age and education in the two groups were nearly identical.

Although the two groups should have produced essentially comparable results on the basis of the controlled variables alone, the presence of brain damage in one group was responsible for a striking difference in the test results. Seven of the measures devised and described by Halstead showed differences between the mean scores for the two groups, with relation to variability estimates, which achieved striking significance from a statistical point of view. In fact, according to the most detailed tables we had been able to find, the probability estimates not only exceeded the .01 or .001 levels, but exceeded .000000000001. However, even these tables were inadequate to express the appropriate statistical probability level.

As noted above, statistical comparisons of the two groups reached extreme probability levels on seven of the ten measures contributing to the Halstead Impairment Index, and these seven tests have been retained in the Battery. The most striking intergroup differences were shown by the Category Test and the Halstead Impairment Index (even though the Impairment Index included three of the ten measures that were not particularly sensitive to brain damage). Not one brain-damaged subject performed better than his matched control on the Impairment Index (although the Impairment Indices were equal in six of the 50 matched pairs).

The Category Test was the most sensitive of any single measure to the effects of cerebral damage. Three subjects with brain lesions performed better than their matched control, but in the remaining 47 pairs of subjects the control performed better than the brain-damaged subject.

These results indicated that Halstead demonstrated remarkable insight in developing tests that reflected the nature of neuropsychological impairment due to brain damage. On three of his ten tests the results were not statistically impressive, but this may have been due to procedural problems in administration of these tests (Reitan & Wolfson, 1993). The remaining seven tests covered a broad range of adaptive abilities, involving such diverse performances as finger tapping speed, attentional capabilities to verbal and nonverbal stimuli, psychomotor problem-solving capability, memory for stimulus material to which the subject previously had been exposed, and abstraction, reasoning, and logical analysis skills.

These results, demonstrating highly significant differences between groups with and without cerebral damage, were produced at a time when most investigations had shown relatively minimal differences. Hebb (1939, 1941) had recently reported case studies of persons who had obtained IQ values in the superior range despite having had as much as one-third of their cerebral hemispheres surgically removed. A serious question existed, therefore, about why brain-damaged persons should perform so much more poorly than controls, especially considering the findings that had been reported by other researchers in earlier investigations.

At our current stage in the development of clinical neuropsychology, a retrospective review of this apparent conflict provides an explanation. Psychologists had been using measures devised to evaluate general intelligence, presuming that these tests would also be equivalently sensitive to impaired brain functions. However, as research has clearly demonstrated over the years since that time, tests developed to predict academic success are quite different from tests that are specifically sensitive to the biological condition of the brain. In fact, the IQ measures, which relate more closely to academic success, also appear to be heavily influenced by environmental and cultural advantages. On the other hand, tests developed specifically to evaluate impairment of brain functions seem to be fairer (unbiased) in terms of cultural and environmental experiences and advantages.

The basic reason for the striking differences in the results obtained using Halstead's tests and other measures appeared therefore to stem directly from the procedures used to develop the tests. As noted above, general intelligence measures were initially designed to predict academic success. Halstead ignored factors of this type almost entirely, and instead directly observed the daily living activities of persons with cerebral damage (having had these individuals identified for him by neurological surgeons who had operated on their brains). It is not surprising that Halstead's measures appear to relate much more closely to the biological adequacy of brain functions as well as to practical aspects of everyday life.

We continued to conduct additional studies of all of the measures that eventually have been included in the Halstead-Reitan Battery, comparing groups of patients who had documented cerebral damage with groups of subjects who had no history of cerebral disease. In one study the results indicated that the Trail Making Test was extremely sensitive to the biological condition of the brain (Reitan, 1958). Although Part A showed significant results, the results for Part B were even more striking.

Similar results were obtained with the Aphasia Screening Test (Wheeler & Reitan, 1962). Most persons with cerebral damage do not show any significant evidence of dysphasia, and essentially no individuals without past or present evidence of brain damage demonstrate such signs. However, when evidence of impairment is demonstrated on the AST, it almost invariably is associated with independent (medical) evidence of brain disease or injury. Minor deficits are sometimes exhibited by control subjects, but any significant evidence of dysphasia is a valid sign of cerebral damage, particularly involving the left cerebral hemisphere (Reitan, 1984).

The Halstead-Reitan Battery customarily includes the Wechsler Adult Intelligence Scale (WAIS). Certain of the Verbal subtests of the WAIS are usually within the normal range in persons with cerebral damage, provided that no evidence of aphasia has been elicited on the Aphasia Screening Test. In addition, control subjects usually show no evidence of frank dysphasic manifestations. Scores for the Wechsler subtests are usually somewhat lower for brain-damaged subjects than controls, except for Digit Span, which is often poorly performed by controls as well as brain-damaged subjects (Reitan, 1959).

B. Lateralization Effects   Return to Contents

The highly significant results obtained in comparing non-brain-damaged control subjects and groups with heterogeneous cerebral damage laid the foundation for more detailed studies of human brain-behavior relationships using the Halstead-Reitan Battery. One of these studies compared groups of subjects with left and right cerebral lesions in order to obtain information about the differential or specialized functions of the two cerebral hemispheres.

The initial investigation involved the Wechsler-Bellevue Intelligence Scale (Reitan, 1955a), and was one of the first studies to lay the groundwork for the "left brain versus right brain" differentiation. The findings indicated that the Verbal IQ was consistently depressed in persons with destructive lesions of the left cerebral hemisphere, whereas Performance IQ was lowered in persons with right cerebral damage.

These results were confirmed in an extensive series of studies that used various criteria for implicating the left or right cerebral hemisphere. These lateralized criteria included EEG disturbances (Kløve, 1959), homonymous visual field defects (Doehring, Reitan, & Kløve, 1961), and dysphasia versus constructional dyspraxia (Kløve & Reitan, 1958). Thus, a number of studies established the differential impairment of VIQ and PIQ to damage of the left and right cerebral hemispheres.

Throughout these investigations we continued to monitor the extent to which the generalizations applied to individual cases, and found that certain subjects did not fit the expected pattern, even though they had lateralized cerebral lesions. Other studies identified factors in addition to lateralization that influenced scores on neuropsychological tests. These factors included chronicity of the lesion, the developmental period during which the lesion occurred (childhood as compared with adulthood), the age of the adult subject when the lateralized brain damage was sustained, the education level, and even the type of cerebral damage.

For example, extrinsic tumors, which usually do not involve the brain tissue in a direct structural sense, demonstrated no evidence of differential impairment of verbal and performance intelligence, regardless of the side of the brain involved, even though other lateralizing findings were present. Traumatic head injuries showed a statistically significant but rather mild effect, correlating with the diffuse or generalized involvement (as well as lateralized damage) from a blow to the head from the outside environment.

The set of conditions that demonstrated the most profound effect on verbal or performance intelligence as a result of lateralized cerebral damage included: (1) normal prior development into adulthood of brain-behavior relationships, (2) a recent lateralized insult to a previously normal brain, and (3) a lesion that represented definite cerebral tissue damage and also structurally involved only one cerebral hemisphere. Interestingly, these conditions tend to describe persons who were normal before the lateralized cerebral damage was sustained (in contrast to a number of subjects included in split-brain studies and in evaluation of surgical excisions for epilepsy), and therefore have generalization value for normal brain functions (as contrasted with impairment of the brain sustained before neuropsychological maturation) (Reitan & Wolfson, 1993).

In their review of the historical development of clinical neuropsychology, Reitan and Wolfson (1993) have identified two historical trends -- one emanating from the areas of biopsychology and clinical psychology and the other developing from the medical field of neurology. Precedents in psychology have led largely to the development of tests and procedures rated on continuous distributions that generally follow the normal probability curve. Conversely, the rather simple procedures and tests derived from the field of behavioral neurology customarily produce dichotomous distributions, with results being classified as either normal or abnormal.

The Halstead-Reitan Battery was deliberately composed to include procedures from each of these historical areas. The tradition of behavioral neurology is principally represented in the HRB by the Aphasia Screening Test and the Sensory-perceptual Examination. Control subjects almost always earn normal scores on these simple tasks, and even persons with cerebral damage often do well. (See Reitan & Wolfson, 1986, 1988, 1993 for detailed discussions of the limitations of the "sign" approach.)

When abnormal performances are demonstrated on these relatively simple tasks, the results frequently have a fairly specific significance for implicating either the left or right hemisphere, and often identify impaired regional areas within each cerebral hemisphere. Deficits elicited by the sign approach, derived from procedures in behavioral neurology, therefore have special significance. However, considering the relatively simple nature of the tasks, normal performances are expected from non-brain-damaged subjects as well as a substantial number of persons with cerebral damage.

Using these procedures, we have compared the performances of control subjects with groups having left, right, and generalized cerebral damage (Doehring & Reitan, 1961; Heimburger & Reitan, 1961; Wheeler & Reitan, 1962). In each of these studies the subjects met independent neurological criteria for the groups to which they were assigned. Subjects selected for inclusion in the study had had the advantage of normal physical growth and development without being influenced by cerebral damage early in life. The subjects for these groups permitted inferences about the organization of neuropsychological functions subserved by normal development rather than deviant neuropsychological organization influenced by the effects of early cerebral damage.

The results indicated that manifestations of dysphasia were consistently associated with left cerebral damage, whereas the presence of constructional dyspraxia (deficits in ability to deal effectively with simple spatial relationships) characterized subjects with right cerebral damage. Many deficits were nearly exclusively manifested in association with damage to either the left or right hemisphere.

For example, dysnomia occurred in 53% of subjects with left cerebral lesions, but did not appear at all among subjects with right cerebral damage. This type of deficit occurred in 1% of the controls and 17% of the subjects with diffuse cerebral involvement.

Similar results were obtained on measures included in the Sensory-perceptual Examination. For example, finger agnosia involving the right hand occurred in 36% of subjects with left cerebral damage, but in less than 2% of subjects with right cerebral damage. Impairment on a sensory-perceptual task on one side of the body had adverse implications for the contralateral cerebral hemisphere.

Wheeler and Reitan (1962) compared 104 control subjects, 47 subjects with left cerebral lesions, 45 subjects with right cerebral lesions, and 54 subjects with bilateral or diffuse damage. Four simple rules were developed to classify any subject to one of the four groups. Applying these rules gave the following conditional probabilities of correct classifications: controls, 78%; left cerebral damage, 80%; right cerebral damage, 85%; and nonlateralized cerebral damage, 84%. These findings indicate that results from the HRB not only are quite accurate in identifying and differentiating subjects with and without cerebral damage, but also in identifying which hemisphere is involved.

Similar studies have been done using discriminant function analyses to produce a single weighted score for each subject and an optimum, least-squares type of separation. Wheeler, Burke, and Reitan (1963) used a total of 24 scores for each subject (11 from the Wechsler Scales and 13 from the HRB) and analyzed them using groups of 61 control subjects, 25 subjects with left cerebral damage, 31 subjects with right cerebral damage, and 23 subjects with diffuse or bilateral involvement.

The bases for classifying subjects to these groups were derived entirely from independent neurological, neurosurgical, and neuropathological findings. Analysis of the tests results fell in the following categories of correct predictions: controls vs. all categories of cerebral damage, 90.7%; controls vs. left damage, 93.0%; controls vs. right damage, 92.4%; controls vs. diffuse damage, 98.8%; and right vs. left damage, 92.9%.

A number of studies compared persons with acute versus chronic brain lesions. Clinical observations suggested that specific deficits tend to resolve, at least partially, as the patient progresses from an acute to chronic stage. In this context, Fitzhugh, Fitzhugh, and Reitan (1961, 1962a, 1962b, 1963) studied results from the Halstead-Reitan Battery, the Wechsler Scale, and the Trail Making Test. The results generally indicated that persons who had chronic brain lesions (thus implying the possibility of some recovery of function over time) tended to perform somewhat better than subjects with recent, acutely destructive lesions.

Particularly striking were the results concerning the association between lateralized cerebral damage and differential impairment on the VIQ and PIQ. Persons with recent lateralized cerebral lesions showed consistent impairment of either VIQ or PIQ, depending upon the hemisphere involved. However, persons with chronic lateralized cerebral damage demonstrated much less consistent relationships between differential VIQ and PIQ scores and the side of the brain that was injured. These results supported our later finding that neuropsychological recovery occurs in the area of initial deficit (Reitan & Wolfson, 1988).

The Halstead-Reitan Battery has been extensively researched and studied, both in group comparisons and in application to individual persons, across essentially the full range of conditions that comprise the field of clinical neuropsychology. The characteristic results found with the HRB have been described for all of the major categories of brain disease and damage, including neoplasms, cerebral vascular disease, traumatic brain injury, degenerative and demyelinating diseases, and infectious and inflammatory diseases (see Reitan & Wolfson, 1992b , 1993 for a summary of research and clinical findings in both adults and children).

Traumatic brain injury has been an area of special emphasis (Reitan & Wolfson, 1985, 1988, 2000a). In fact, Reitan's first publications were based on brain-impaired soldiers during World War II (Aita, Armitage, Reitan, & Rabinovitz, 1947; Aita, Reitan, & Ruth, 1947; Aita & Reitan, 1948). Books by Reitan and Wolfson have reviewed the neuropathological manifestations of head injury, the neurological and neuroradiological diagnostic methods, and the neuropsychological consequences (1985); the mechanisms of repair of traumatic brain damage, prognosis and outcome, methods of rehabilitation and retraining of neuropsychological deficits, and the natural history of traumatic head injury based on an 18-month follow-up of clinical and neuropsychological examinations (1988); and a review of the pathology of mild head injury, the neuropsychological literature, and research studies that showed the sensitivity of the HRB to deficits in cases of mild head injury that met certain criteria as contrasted with the relatively routine recovery of most cases of mild head injury (2000).

Of course, many additional studies have been published in the area of head injury. Dikmen and Reitan (1976), based on the 18-month follow-up study of traumatic brain injury cases referred to above, found that a combination of neuropsychological test results, obtained at about one month following injury, permitted a 91% accuracy rate in classifying subjects who would have significant deficits 18 months after the injury as contrasted with persons who recovered much better. Rojas and Bennett (1995) used the General Neuropsychological Deficit Scale (GNDS) (based on 42 measures from the HRB) and other measures to compare subjects with mild head injury with volunteer college students. They found that the Stroop Neuropsychological Screening Test did not differentiate the groups, but the GNDS correctly identified 92% of the subjects.

The HRB has also been extended to evaluate young adults with learning disabilities, especially considering the striking sensitivity of the HRB for Older Children in differentiating groups with brain damage, learning disabilities, and no evidence of brain damage (Reitan, 1992b).

Oestreicher and O'Donnell (1995) compared the GNDS scores of three groups: learning-disabled young adults (N=60), traumatically brain-injured young adults (N=30), and non-disabled volunteers (N=30). The groups were equivalent in gender and FSIQ. The results indicated that the GNDS differentiated the three groups at highly significant levels. The non-disabled volunteers all had GNDS scores of less than 27, whereas 29 of the 30 traumatically brain-injured subjects had GNDS scores of 27 or higher. Among the learning-disabled subjects, 29 (49%) had GNDS scores of 27 or higher.

In addition to confirming the validity of the GNDS in differentiating between brain-damaged subjects and non-brain-damaged subjects, as represented by a 98% hit rate, these researchers noted that the results confirmed prior studies (O'Donnell, 1991; O'Donnell, Kurtz, & Ramanaiah, 1983) in demonstrating that learning-disabled young adults show neurobehavioral impairment which, in some instances, is surprisingly severe.

Oestreicher and O'Donnell (1995) also compared the GNDS and the Halstead Impairment Index (HII). These researchers concluded that "the Omega Squared statistic showed that the GNDS was 45% more efficient than the HII in discriminating neuropsychologically normal from neuropsychologically impaired individuals. Thus, a single measure, the GNDS, which summarizes all the data from a neuropsychological examination and incorporates multiple methods of neuropsychological inference, yields an index that is highly sensitive to brain impairment. Therefore, the present study justifies extending the GNDS to both research and clinical assessment with LD (learning-disabled) and HI (head-injured) persons" (p. 189).

C. Clinical Inferences Regarding Individual Subjects   Return to Contents

Finally, we have been interested in applying these various research results to interpretation of test protocols for individual subjects. In order to test the validity and accuracy of the HRB, we conducted a complex study in which both location and type of cerebral damage were carefully controlled (Reitan, 1964).

The first step in the procedure was to identify subjects with criterion-quality frontal, nonfrontal, or diffuse cerebral lesions. In order to provide a rigorous test of the generality of any conclusions relating to these various groups, we designed the study so that each group had the same number of subjects with various types of lesions. Each regional localization group was therefore composed of equal numbers of subjects with intrinsic tumors, extrinsic tumors, cerebral vascular lesions, and focal traumatic lesions.

Through a review of thousands of neurological protocols, we were able to identify four individual subjects with each of these types of lesions in each of the left anterior, left posterior, right anterior, and right posterior cerebral locations. We therefore had a total of 64 subjects. When subdivided into groups of 16 subjects with different locations of damage, each group contained equal representations of four different types of lesions. When subdivided into groups of 16 subjects according to type of lesion, each group had equal representations of the four different locations. Three additional groups were included in the study: 16 subjects with diffuse cerebral damage or dysfunction due to cerebrovascular disease, 16 subjects with closed head injuries, and 16 subjects with multiple sclerosis. The inclusion of these groups resulted in a total of 112 patients.

In the first study, a form was designed to record independent judgments based on the psychological test results for each subject. This form required three general decisions: first, a judgment was made from the neuropsychological test results alone about whether the lesion was focal or diffuse; second, a lesion category was selected; finally, more detailed judgments were made within certain lesion categories. For example, if the initial decision was that the lesion was focal rather than diffuse, the next judgment required selection of the right or left cerebral hemisphere, followed by selection of an anterior or posterior location within the hemisphere.

Next, a judgment was made about whether the lesion represented cerebral vascular disease, tumor, multiple sclerosis, or trauma. If the cerebral vascular disease category was selected, the lesion was further classified as a hemorrhage or vascular insufficiency. Additional forced judgments about the underlying basis for the evidence of cerebral vascular disease were made under each of these two categories.

If the tumor category was selected, the rater had to judge whether the lesion was intrinsic or extrinsic. If the intrinsic tumor category was selected, a further classification was made concerning whether the lesion was metastatic or a primary glioma.

Under the trauma category, the lesions were classified as an open or closed head injury. All of these ratings were based solely on the neuropsychological test results.

The rating form was sufficiently complete to permit classification of all subjects according to neurological criterion information. Since the classification of subjects to their respective groups had initially been based on neurological information, judgments made on the basis of neurological information represented the criterion. The purpose was to determine the degree of concurrence between neurological criterion information and ratings made on the basis of the psychological test results alone.

Each of the 112 subjects was assigned a number before the ratings were begun, and all information other than the psychological data was concealed in order to avoid any identifying clues when the ratings were made. There was no attempt to assign an appropriate number of subjects to any particular group. For example, no running record was kept to limit assignment of only 16 subjects to the multiple sclerosis group, etc. Every effort was made to classify the test results for each subject independently, and ratings were completed without using any type of running tally.

Using the neurological ratings as the criterion, there were 64 subjects with focal cerebral lesions. Of these subjects, 57 were classified correctly on the basis of psychological testing, and 7 were judged to have diffuse damage. In the group of 48 subjects with diffuse cerebral damage, 46 were classified correctly and 2 were judged to have focal lesions.

As mentioned above, there were 16 subjects in each regional localization group. The number of correct classifications on the basis of psychological test results for each of the locations was as follows: left anterior, 9; left posterior, 11; right anterior, 7; and right posterior, 15. Thus, 42 of the 64 patients were placed in their correct groups. Adding to this the correct classification of 46 of the 48 subjects with diffuse cerebral involvement, 88 of the 112 subjects were correctly classified.

With respect to the type of lesion, the 112 subjects fell in the following neurologic diagnoses: intrinsic tumor, 16; extrinsic tumor, 16; cerebral vascular disease, 32 (16 focal, 16 diffuse); head injury, 32 (16 focal, 16 diffuse); and multiple sclerosis, 16.

The number of correct classifications on the basis of psychological inferences was as follows: intrinsic tumor, 13; extrinsic tumor, 8; cerebral vascular disease, 28; head injury, 30; and multiple sclerosis, 15. Thus, 94 of the 112 patients were correctly classified according to type of lesion.

Additionally, 13 of the 16 subjects with focal cerebral vascular disease were classified correctly, and 12 of these 13 were judged to have focal lesions. Of the 15 subjects with diffuse cerebral vascular disease who were correctly placed in this category, 14 were judged to have diffuse cerebral vascular disease. A total of 30 of the 32 head injury subjects had been placed in this category on the basis of their psychological test results, and 27 of these 30 had been correctly classified according to whether the lesion was focal or diffuse. It is particularly noteworthy that the HRB is differentially sensitive to traumatic brain injury, considering the frequency with which subjects in this category are involved in litigation. A review of the literature yields no evidence that any other neuropsychological tests or methods are able to differentiate traumatic brain injury from other conditions of brain damage.

The degree of concurrence between the neurological criteria and the neuropsychological ratings indicated above could scarcely have happened by chance. The results confirmed that neuropsychological test results are differentially influenced by (1) focal and diffuse lesions, (2) which cerebral hemisphere sustained damage, (3) frontal and nonfrontal lesions within the hemisphere involved, (4) type of lesion (intrinsic tumors, extrinsic tumors, cerebral vascular lesions, head injuries, and multiple sclerosis), (5) focal occlusion as compared with generalized insufficiency in cerebral vascular disease, and (6) focal as compared with diffuse damage from head injuries.

It is important to note that a study of this type serves a significant purpose in providing insight into the degree to which psychological test results may be determined by brain lesions. Furthermore, this study indicates the multifaceted nature of the concept of "brain damage," and demonstrates that many neurological variables, which occur in varying combinations for individual subjects, are relevant in determining psychological measurements.

The validational studies of the Halstead-Reitan Battery leave no doubt that an individual's test results are closely dependent on his/her neurological status or diagnosis. The research findings reveal not only statistically significant intergroup differences, but also demonstrate the validity of the test results for specific neuropathological conditions which affect the individual person.

V. The Halstead-Reitan Battery and Current Problems in Neuropsychology   Return to Contents

A number of recent studies have used the HRB to investigate problems of current clinical significance in neuropsychology. While the human brain subserves a broad range of abilities ranging from sensory-perceptual functions, through complex aspects of central processing, to adaptive motor responses (see the Reitan-Wolfson model described previously), a need has existed both for clinical and research purposes for an overall indicator of general neuropsychological status. The Halstead Impairment Index (Halstead, 1947; Reitan, 1955c) and the Average Impairment Rating (Russell, Neuringer, & Goldstein, 1970) have been found to be valid and have served usefully, but these measures do not summarize fully the broad range of tests included in the HRB.

A.  A Summary Index of Overall Performance on the HRB   Return to Contents

Reitan and Wolfson (1988, 1993) addressed this deficiency by developing the General Neuropsychological Deficit Scale (GNDS), an instrument that produces a summary score based on 42 variables from the Halstead-Reitan Neuropsychological Test Battery for Adults. The field of clinical neuropsychology has identified major areas of neuropsychological functioning, and there is general recognition that a comprehensive evaluation requires assessment of these areas for clinical purposes as well as for planning a rehabilitation regimen.

The GNDS was developed for this purpose, and was devised in such a manner that four methods of evaluating data are represented. Of the 42 variables that contribute to the GNDS score, 19 are based on level of performance and reflect how well the subject performed on a broad range of neuropsychological tests. Nine variables representing motor and sensory-perceptual performances evaluate differences between the preferred and the nonpreferred sides of the body. This approach reflects the fact that brain damage often affects one side of the body more than the other side. Using the differential-score approach, two variables evaluate relationships among tests that differentiate brain-damaged subjects from control subjects. Finally, 12 variables representing dysphasia and constructional dyspraxia constitute the pathognomonic sign approach.

The GNDS was not designed to replace competent clinical interpretation of the test results (which involves characterization of the individual's deficits), but only as an overall indication of the degree of neuropsychological impairment. To achieve this purpose, each of the 42 variables contributing to the GNDS score is represented by four score ranges: 0 (a perfectly normal performance), 1 (a normal performance that is mildly deviant, but without clinical significance), 2 (a mildly to moderately impaired performance), and 3 (a severely impaired performance). These score ranges are useful when the clinician wants to assess a subject's degree of impairment on an individual test in the Halstead-Reitan Battery, and the cutoff score between 2 and 3 differentiates normal from brain-damaged subjects. Thus, in a general sense, the classifications represent normative data.

The GNDS score represents the sum of the scores for the 42 variables on which it is based. This procedure produces low GNDS scores for normal controls and high GNDS scores for persons with cerebral dysfunction. The normal range extends from 0-25; mild impairment from 26-40; moderate impairment from 41-67; and severe impairment from 68 to the maximum possible score of 168 (Reitan & Wolfson, 1988, 1993). More detailed information about the GNDS as well as computerized scoring is published in Reitan and Wolfson (1993).

Using control groups and brain-damaged groups, Reitan and Wolfson (1988, 1993) presented the results of several validational studies indicating that the GNDS was adversely affected by (1) left, right, or generalized heterogeneous cerebral damage, (2) left, right, or generalized cerebral vascular damage, (3) left, right, or diffuse cerebral damage due to trauma, and (4) left or right cerebral damage when type of lesion was held comparable. All of the non-brain-damaged control groups had lower (better) mean GNDS scores than any of the brain-damaged groups.

The following control groups were used: (1) a normal group with a mean Full Scale IQ of 129.26 (mean GNDS, 12.48), (2) a young group (mean GNDS, 15.90), (3) a heterogeneous group (mean GNDS, 17.20), and (4) an older group (mean GNDS, 24.82). (See Reitan & Wolfson, 1988 for additional details.) A group of subjects who had sustained a cerebral concussion but had not demonstrated any objective evidence of brain damage, examined at the time of discharge from the hospital, had a mean GNDS score of 25.36, a score that was significantly poorer than the score earned by a heterogeneous control group.

The groups with definite brain damage consistently had mean GNDS scores ranging from 43.73 to 64.33 (subjects with traumatic brain injuries tended to score better than other groups, and subjects with left cerebral strokes tended to do most poorly). No significant differences were found within studies among subjects with left, right, or generalized cerebral damage, suggesting that the GNDS score, as intended, was a valid overall summary indicator. Males and females showed no significant differences on the GNDS when other variables were held constant. In a study comparing a group of subjects who had cerebral concussion with a group of traumatic brain-injured subjects who had documented tissue damage, the group with concussion had a significantly better mean GNDS score. Studies comparing older and younger control subjects demonstrated that the older groups performed more poorly.

Sherer and Adams (1993) published a cross-validation study of the GNDS and indices reported by Reitan and Wolfson (1988) to be differentially sensitive to damage of the left or right cerebral hemisphere, and concluded that their findings provided "limited support for the validity of these new scales with patients commonly seen in clinical practice" (p. 429). These investigators compared 73 brain-damaged subjects with 41 "pseudoneurologic" subjects. The pseudoneurologic group was composed of subjects who reported "neurologic" symptoms but had no biomedical evidence of brain damage and included "30 subjects who had received psychiatric diagnoses" (p. 429).

The results of Sherer and Adams agreed essentially with the findings reported by Reitan and Wolfson (1988), insofar as (1) the brain-damaged group was significantly more impaired on the GNDS than the pseudoneurologic group, (2) the subgroups with left, right, or generalized damage did not differ on the GNDS, (3) males and females showed no statistical difference, and (4) older subjects performed more poorly than younger subjects. The principal differences between results reported by Sherer and Adams (1993) and Reitan and Wolfson (1988) was that 53.6% of the pseudoneurologic group scored in the brain-damaged range as compared to only 10% of Reitan and Wolfson's controls. Sherer and Adams (1993) state that "the poor classification of our pseudoneurologic subjects may be due to a limitation with the use of pseudoneurologic control groups" (p. 434).

Wolfson and Reitan (1995) designed an additional study to further investigate the validity of the GNDS in differentiating between groups of subjects with and without evidence of cerebral damage, comparing 50 brain-damaged subjects with 50 controls that were similar in age and education.

The mean GNDS score for the brain-damaged group (55.02) was very similar to the mean of 53.22 reported by Reitan and Wolfson (1988) for 169 brain-damaged subjects. The mean GNDS score for the control subjects (19.66) was slightly higher than the mean of 17.20 found for 41 controls (Reitan & Wolfson, 1988). In each case, the brain-damaged subjects performed more poorly than the controls at a highly significant level.

In Reitan and Wolfson's 1988 study, the best cutoff score was 25/26, which resulted in a 9% rate of misclassification. The 1995 study identified 28/29 as the best cutoff score, which misclassified 12% of the subjects. An inspection of the two distributions indicated that individual cases of GNDS scores differing by one or two points were responsible for the differences in the two distributions. In the 1995 study, 80% of the controls and 96% of the brain-damaged subjects were classified correctly using the original cutoff score of 25/26, again yielding a 12% rate of misclassification. Obviously, the "best" cutoff score depends upon whether one wishes to avoid false negatives or false positives, but the best clinical conclusion would identify the "grey" area (where the groups overlap) as falling between 26 and 29 points.

Sherer and Adams (1993) stated that the value of using pseudoneurologic controls was that such patients are frequently seen in clinical practice. However, their pseudoneurologic control group had a mean GNDS score of 26.56, a value considerably higher than the means reported by Reitan and Wolfson (17.20, Reitan & Wolfson, 1988; 19.66, Wolfson & Reitan,1995). In order to obtain completely "clean" comparisons, we feel that the initial validation studies should compare groups of subjects who fall unequivocally into either a brain-damaged or non-brain-damaged group.

Following such a determination, other comparison groups, composed according to clinically relevant criteria, may be evaluated, and a pseudoneurologic comparison group would certainly be clinically relevant. The group with cerebral concussion reported by Reitan and Wolfson (1988) would be another such relevant group, and it might also qualify as a pseudoneurologic group since all members of the group had "neurologic" complaints but no biomedical evidence of brain damage. (See Reitan & Wolfson, 1988 for a review of the debate concerning whether the symptoms of concussion should be considered "psychiatric" or "neurologic.") In Reitan and Wolfson's previous study, the group with cerebral concussion had a mean GNDS score of 25.36, a value very similar to the mean of 26.56 reported by Sherer and Adams for their pseudoneurologic group (Reitan & Wolfson, 1988).

In summary, empirical research results have established that the GNDS score is a valid general indicator of the presence or absence of cerebral damage.

B. Practical and Legal Implications in the Use of the Halstead-Reitan Battery   Return to Contents

The United States Supreme Court identified four considerations for trial judges to use in evaluating expert testimony, or in some cases, in deciding whether to admit expert testimony. This directive requires that neuropsychologists carefully consider their testing methodology in accordance with whether or not their procedures meet these four criteria. Briefly, these include the following:

  • Has the method been tested?

  • Has the method been subjected to peer review and publication?

  • What is the error rate in applying the method?

  • To what extent has the method received general acceptance in the relevant scientific community?

In our experience, trial judges and lawyers are gradually giving increased consideration to these guidelines, and giving serious consideration to the admissibility of testimony based on whether these guidelines have been met. In the main, concerns have centered around (3) and (4) above; namely, the error rate and the question of general acceptance.

It would seem perfectly legitimate, and eve