Navigation Links
-------------

Home
-------------

Viewing the
Online Catalog

-------------

Go to Online
Catalog

-------------

Publications
-------------

Newsletters
-------------

Workshops
-------------

Case Studies
-------------

Ralph Reitan
Bibliography

-------------

A to Z Product
Listing

-------------

Related Internet
Links

-------------

PDF Downloads
-------------

 

      


CLINICAL AND FORENSIC ISSUES REGARDING AGE, EDUCATION, AND VALIDITY OF NEUROPSYCHOLOGICAL TEST RESULTS:
A REVIEW AND PRESENTATION OF A NEW STUDY

Ralph M. Reitan
Deborah Wolfson
 
Send a comment to the authors about this paper

Accepted for publication in the Journal of Forensic Neuropsychology

This paper may be downloaded and printed for personal use only.
Click here to download a PDF file from which to print your personal copy.

Duplication by any means for any other purpose is strictly prohibited by copyright law.

Abstract

This paper reviews procedures and reports of adjustments of raw scores from the Halstead-Reitan Neuropsychological Test Battery for Adults using the method proposed by Heaton, Grant, and Matthews (HGM) (1991). Arguments and findings both supporting and criticizing the HGM transformations were reviewed.

From a psychometric viewpoint, the conclusion was reached that the HGM method, which is based only on a neurologically-normal group, transforms raw scores into scaled scores that tend to fall in the normal range, especially for brain-damaged persons. Although this question is not yet adequately researched, the effect may stem principally from adjustments made for persons with less education and older age. The consequent result of the adjustments, in this case, would be to produce normal scores for tests that were designed, developed, and validated as measures of brain impairment - defeating the very purpose of the tests as neuropsychological measures. If the transformation process, based on age and education, is faulty for any reason (including inadequacies of the normative sample), and the transformations skew the raw scores either in the direction of normalcy or brain impairment, a serious problem exists in clinical interpretation or in a forensic setting. Thus, the questions raised in this paper are of theoretical, clinical, and forensic importance, and are urgently in need of resolution through further empirical research.

Heaton, Grant, and Matthews presumed, quite reasonably, that persons with more education would perform better on neuropsychological tests than would persons with lesser education, and that, among adults, older people might show some deterioration of performances in comparisons with younger subjects. A considerable body of published evidence had already supported these postulates. Thus, it was appropriate to explore the prospect of developing adjustments of raw scores in accordance with the age and education of the individual subject to evaluate the subject's performances with relation to the rest of the population.

Although relatively little data were available to determine whether these postulates were also true for brain-damaged subjects, this was not an immediate problem, inasmuch as Heaton, Grant and Matthews intended to develop the age and education adjustments using a sample of neurologically-normal persons. Their research supported the hypotheses underlying the purposes of their investigation, and was published in a volume that presented extensive tables for trans-forming raw scores on a series of tests to T-scores having a mean of 50 and a standard deviation of 10 for each test (Heaton, Grant, & Matthews, 1991). Jarvis and Barth (1994), in their book that was largely devoted to instruction in clinical interpretation of Halstead-Reitan test results, stated that the Heaton, Grant, and Matthews T-score transformations were generally used in the illustrative cases that they presented.

There can be no doubt that these transformations are used widely, and presently often serve as a basis for clinical and forensic interpretations of results for individual tests, for patterns and relationships among test results, and for conclusions about individual patients and clients, even though there has been very little research comparing the clinical significance of the transformations with clinical interpretations based on the raw scores. Obviously, the aim and purpose of the profession is to produce valid conclusions about the effects of brain damage or impairment for the individual examinee. Our integrity as neuropsychologists rests upon the accuracy with which we achieve this aim. From a forensic viewpoint, the validity of settlements and judgments may be determined by the accuracy of neuropsychological conclusions. Whenever raw test scores are adjusted or transformed, it is obvious that the profession must have full and complete information about the effects of such changes.

Many clinical neuropsychologists were well aware that neuropsychological tests, because of their specified purpose, have been developed to meet conditions of sensitivity and specificity - sensitivity to brain damage or impairment and specificity for brain impairment. If a test is useful as a neuropsychological instrument, the test scores should reflect brain impairment and should not overlap significantly with other groups, including non-brain-damaged persons.

What are the practical implications of these conditions? When they are met, brain impairment becomes a determiner of the test results, at least to a large extent, among persons with brain damage, and the remaining variance, determined by attribute variables such as age and education, is correspondingly reduced. It is only reasonable to postulate that age and education, which vary relatively randomly among brain-damaged persons, will lose a major aspect of their significance if brain impairment is a major factor in producing scores on neuropsychological tests (as should be the case if the tests are, in fact, neuropsychological).

While adjustments for age and education might be justified among non-brain-damaged persons, such adjustments among brain-damaged persons should have lessened justification. In fact, adjustments of raw scores of brain-damaged persons, when based on data from non-brain-damaged subjects, might possibly interfere with the validity of raw scores as indicators of brain impairment. Conceivably, such adjustments, based on trends and relationships that exist in non-brain-damaged groups, might actually have their major effect by influencing scores of brain-damaged persons to move in the direction of the scores of non-brain-damaged persons, inasmuch as these trends (correlations) might not exist in the brain-damaged group.

Considerations of this kind led Reitan and Wolfson to conduct a series of studies investigating the correlation between age and education and neuropsychological data using groups without brain damage and groups with brain damage (as well as other conditions which might reflect brain impairment, such as learning disabilities). The General Neuropsychological Deficit Scale (GNDS) was selected as the variable to reflect the adequacy of brain functions since the GNDS score, based on 42 brain-sensitive variables summarizing the results of the Halstead-Reitan Battery, has been found to be sensitive to impairment of brain functions (Reitan & Wolfson, 1988, 1995a), and even more sensitive than the Impairment Index (Rojas & Bennett, 1995; Oestreicher & O'Donnell, 1995).

The first study (Reitan & Wolfson, 1995) used two approaches to examine the effects of age and education on GNDS scores: (1) Were there significant differences in level of performance between older and younger subjects?, and (2) Were coefficients of correlation between age and education versus GNDS scores statistically significant? The research answered these questions positively in both instances for the non-brain-damaged group, but negatively for the group with definite evidence of brain damage or disease. The effects of brain damage had subverted the significant influence of age and education that was apparent in the control group.

The findings definitely implied that brain-damaged and control groups cannot be treated similarly regarding the effects of age and education. In brief, neither youth nor higher education can protect the individual from the impairing effects of brain damage, and the severity, type, and location of brain damage become the principal determinants of the neuropsychological test results. It is obvious that if a neuropsychological test is principally determined by the condition of the brain, a major part of the variance cannot be determined by attribute variables.

These considerations suggest a gradient effect among psychological tests - the more sensitive a test is to brain impairment, the less it will be affected by attribute variables, and the less sensitive a test is to brain damage, the greater the likelihood that other variables will determine the results.

Similar studies were done using data from the Wechsler Adult Intelligence Scale (Reitan & Wolfson, 1996b, 1996d) and the results tended to show that a subtest's sensitivity to brain damage was a significant factor in determining the extent to which it was susceptible to age and education influences. Among brain-damaged groups, subtests that were sensitive to brain damage (such as Digit Symbol and Block Design) were relatively uninfluenced by age and education, whereas subtests that were relatively unaffected by brain damage (such as Information and Vocabulary) showed a significant relationship to age and, especially, to education.

The same research approach was used to investigate the effects of age and education in children. Reitan and Wolfson (1995b) found that both age and education had a strong effect on Neuropsychological Deficit Scale scores of non-brain-damaged control children. Brain-damaged children, however, not only showed no significant correlations between NDS scores and age and education, but, in addition, showed no significant improvement in NDS scores when comparing groups aged 9 through 11 versus 12 through 14 years of age. (This latter finding is of special concern, since it suggests that educational training had a negligible effect on the development of brain-based abilities in brain-damaged children. While non-brain-damaged children showed significant improvement in NDS scores in the two age ranges that were compared, brain-damaged children showed no significant improvement. The three years of educational training apparently were ineffectively applied in consideration of the developmental needs of children with brain damage!)

These findings also raised a question about the mandatory (inbuilt) age adjustment required to score the Wechsler Intelligence Scale for Children-Revised (as well as later versions). If brain-damaged children do not show significant improvement as they grow older, at least over a three-year period, what effect does this have on computation of Intelligence Quotients, since the norms are based on a presumption of improvement, as shown in normal children? The effect would be to produce lower IQ's for brain-damaged children as they grow older, resulting from the mere passage of time.

Considerable research, beginning mainly with Rourke (1975, 1985, 1989), has implicated impairment of brain functions among children with learning disabilities. A logical step, therefore, was to investigate the effects of age and education on neuropsychological test scores of children in this category. The results (Reitan & Wolfson, 1996a, 1996c) were similar to those found among children with definite diagnoses of brain damage or disease. These studies showed no significant relationships between age and education and NDS scores for children with learning disabilities. In addition, mean NDS scores of children aged 9 through 11 years versus 12 through 14 years were not significantly different. As with brain-damaged children, the effects of additional education appeared to have had no beneficial effect on the brain-based abilities of learning-disabled children.

This finding, as with brain-damaged children, raises a question about the computation of IQ's of children with learning disabilities. If the abilities of these children do not progress at the rate of normal children, but IQ computation presumes an age-based rate of development, the effect would be to produce progressively lower IQ's as the child ages. With respect to the age-old formula of intelligence based on the relationship between mental age and chronological age, only chronological age would remain as a variable, and the older the child becomes, the lower his IQ would be until adulthood.

These findings provide a basis for seriously questioning the mandatory scoring procedure of the WISC-R (and probably the WISC-III, as well), because an apparently unjustified adjustment, based solely on the results of normal children, is applied to the raw scores of children in whom age-related increments in performance may be limited. Is it of value to know how well the individual has actually performed, or is it sufficient to know how well he has performed on an adjusted basis compared to a normative sample? How many other tests may have this same problem? These results suggest that many other tests that make arbitrary adjustments of raw scores based on attribute variables should also be investigated for the actual meaning of their scoring procedures and the effect of such adjustments.

Reitan and Wolfson (1997, 1999) investigated the influences of age and education in cases of mild head injury. Two groups with mild head injury were studied. One group was composed of persons who had been routinely recruited for study, regardless of any symptoms or complaints, and a second group of equivalently mild head-injured persons was composed of patients who had eventually developed significant symptoms or complaints which had caused them to return for further evaluation and/or treatment.

Correlations of age and/or education with GNDS scores were not statistically significant in either group. Thus, the findings for persons with a mild head injury were essentially similar to the findings for persons with a more severe head injury, indicating that the neuropsychological consequences of the injury apparently had overridden the influences of age and education seen among non-brain-injured subjects. This finding must be interpreted in light of the sensitivity and specificity of the GNDS to the effects of brain damage and, obviously, may not apply to tests that are not sensitive to and specific for brain damage.

The consistency of the above findings tends to document their validity. In every study the effects of age and education on neuropsychological measurements were significant for non-brain-damaged persons, but were not significant for groups with brain impairment. This finding occurred in comparisons of controls versus brain-damaged adults, controls versus brain-damaged children, controls versus children with learning disabilities, and controls versus a group with mild head injuries. This finding appeared more prominently in tests that had known sensitivity to brain damage than in tests that tend to be resistive to the effects of brain damage.

The results of these studies are more completely reviewed in a paper by Reitan and Wolfson (1999). We should point out, however, that limited sample size and restrictions of the age and education ranges in these studies suggest that further research is needed. A very large sample might produce a statistically significant correlation, even though the magnitude of the coefficient is not increased nor the extent to which variance is explained. (The reader should be aware of this effect. For example, a correlation coefficient of .42 is required for significance at the .05 level with an N of 20, but a coefficient of only 0.19 is required if the N is 100.)

One may wonder how many other conditions there are, beyond impairment of brain functions, that might also diminish the effects of age and education on psychological and neuropsychological test scores, but this question has not yet been studied. It would seem clear, however, that adjustments based on age and education do not appear to be justified among persons or groups with brain impairment. Such adjustments, presently made on a routine and widespread basis, must be questioned, and additional research is needed to identify various categories of patients in whom the adjustments may or may not be justified.

Many studies over the years have shown definite relationships in adults, both in research and clinical contexts, between Wechsler scores and various aspects of brain damage (Reitan & Wolfson, 1993), but the data used in these studies were based on scaled scores that were not adjusted for age or education. Studies of children, using scoring procedures that required age adjustment, have not shown similar findings. For example, there is not a single study in children that shows a significant difference between Verbal IQ and Performance IQ values, in cases with structural lateralization of cerebral damage (although many such studies have been published for adult groups).

Many variables and influences may be of significance in determining this circumstance, but transformation of scores for brain-damaged children on the basis of relationships that may not exist for them, but do exist for normal children, may have been a factor. Considering the "aging penalty" in scoring IQ's for brain-damaged and learning-disabled children (the older they become, the lower their IQ's become), as well as other brain-related variables such as lateralization effects, essentially the entire literature on intelligence and brain damage in children (which is mainly derived from Wechsler scores) may have been tainted.

If we postulate that brain damage has an adverse effect on neuropsychological test results in both adults and children, and that in this respect the categories represent different populations, should we not also entertain the hypothesis that attribute variables, such as age and education, might also differ in their effects on neuropsychological variables? Obviously, the universal and uncritical application of age and education levels, based on normal subjects, to brain-damaged persons and persons in all other diagnostic categories, has not considered this possibility. A helpful approach in dealing with this problem would be for normative data and scoring procedures to separate (rather than require) scores based on actual performances from scores that have been adjusted for age.

The Nature and Urgency of the Problem in Litigation

The following case illustrates one of many currently routine examples of the effects of transforming raw neuropsychological test scores to age- and education-adjusted scores.

The case involved a 51-year-old man with a 10th-grade education who had sustained a head injury four months prior to neuropsychological examination. He was suing for recovery of damages. Scores earned on the Halstead-Reitan tests that were administered are shown in Table 1.

According to the Heaton, Grant, and Matthews (HGM) T-scores, individual test scores fell in the normal range on six of the seven measures. Even though the Impairment Index fell in the impaired range, it was argued that it did so only because it was based on measures that were not adjusted for age and education, and thus was an inaccurate measure.

The NDS scores developed by Reitan and Wolfson (1988, 1993) were based on a review of many published studies and years of clinical application. Rather than depending on only one large sample, we were interested in taking advantage of consistency implicit in the many studies that have used the HRB. NDS scores for individual tests are represented on a four-point scale: 0 = a perfectly normal to superior performance; 1 = a score that is not exceptional, but still in the normal range; 2 = mild to moderate impairment; and 3 = severe impairment. In this case, the NDS scores showed an exactly opposite result from the HGM T-scores. Using NDS scores, six of the seven tests fell in the impaired range, and three of these scores represented severe impairment.

Since this case was involved in litigation, the plaintiff's neuropsychologist argued that the client was significantly impaired according to standards developed both clinically and in an extensive number of peer-reviewed publications; the neuropsychologist for the defense argued that outdated methods of evaluation were used by the plaintiff's neuropsychologist and that application of modern methods, taking age and education in consideration, showed that the client essentially fell in the normal range and had no significant impairment.

Obviously, striking disparities can result when conclusions are based upon conventional interpretation of the test data compared with the interpretation of the same data transformed according to the Heaton, Grant, and Matthews tables. Resolution of this disparity is clearly needed in clinical as well as forensic settings.

Review of Literature Concerned with Age and Education Transformations of HRB Scores

A number of critiques and rejoinders, as well as some empirical data, have been published about applying age and education corrections using the procedure proposed by Heaton et al. (1991), and the considerations raised in this paper cannot be evaluated fully without a review of these publications.

In 1991 Heaton, Grant, and Matthews published their book containing the procedures for transformation of raw scores for the HRB and other tests to T-scores adjusted for age and education. A review of this book by Fasteneau and Adams (1996) had a number of criticisms of the procedures used by Heaton, Grant, and Matthews.

Heaton et al. (1991) had created normative adjustments based upon ten age categories ranging from 20 years to 80 years and six education categories ranging from 6 to 18 years and more. Thus, adjustments were available for a total of 60 age versus education cells. Fasteneau and Adams (1996) noted that "a minimum of 15 subjects per cell is a rule of thumb for theoretical applications; 30 is a good minimum for norms in clinical decisions" (p. 446). They noted that the Heaton, Grant, and Matthews sample, based on "simple arithmetic yields a maximum of four people per subgroup, with as few as one person or even none at all representing at least some cells for some measures."

Heaton, Grant, and Matthews indicated that 244 men and 134 women were used to develop the normative samples. Thus, the total number of subjects used by Heaton, Grant, and Matthews appears to be insufficient. It should be noted, however, that Heaton, Grant, and Matthews pointed out that decisions of either a theoretical or clinical nature were not based upon the number of persons in each cell, but rather upon the overall trend represented for age and education.

Fasteneau and Adams (1996) also contended that distributions of age, education, and gender had not been demonstrated to be of significance for all of the tests used by Heaton, Grant, and Matthews. They said that these three variables were influential in only 14 of 54 measures, and considering the number of cells used for both men and women, the variables of age, education, and gender were not appropriate for the majority of the measures used. They also indicated that at the time of their review no one had presented a single substantive review or critique of the normative transformations proposed by Heaton, Grant, and Matthews.

Finally, Fasteneau and Adams (1996) questioned, "Can data generated on non-patient volunteers provide a useful benchmark to make statements about brain damage or dysfunction?" This comment relates to the fact that the normative data were based only on subjects without evidence of brain damage, whereas the interest of neuropsychologists is generally oriented toward identification or diagnosis of brain damage and its behavioral correlates.

Fasteneau and Adams (1996) offered pointed comments about the limited number of subjects in each cell, against which any individual might be compared on the basis of age and education. They cautioned that ". . . consumers should recognize the very significant limitations of the current volume and, if they use these norms, they should qualify their interpretations with professionally responsible statements such as those described in this review" (p. 448). This statement obviously was intended to emphasize the limited number of subjects in each cell that corresponded with the age and education of the person being evaluated.

Heaton, Matthews, Grant, and Avitable (1996) responded to the Fasteneau and Adams (1996) critique. They referred to the 486 normals, which included the group on which they based their normative data plus the group on which they based their validational data. They reported correlations of age and education with the Average Impairment Rating (AIR) for both the group of normal subjects (N = 486) and a group of brain-damaged subjects (N = 463). The correlation of the Average Impairment Rating with age was .71 for normals and .50 for brain-damaged subjects. The correlation of the AIR with education was .60 for the normals and .29 for the brain-damaged subjects.

A coefficient of .29 would account for only about 9% of the variance in the AIR distribution on the basis of age. It is apparent from these values that age and education, individually, had a significant effect on the AIR among normals, but this effect was substantially reduced in the group with brain damage. In fact, the authors state that age and education combined accounted for 64% of the AIR variance among normals, but for only 33.6% among brain-damaged subjects. The authors state that the diminished variance in test scores accounted for by age and education in the brain-damaged group is "to be expected due to their neurologic disorders, the severity of which may be unrelated to age and education" (p. 453).

However, the HGM procedure of adjusting HRB test results for age and education requires transformation to T-scores using tables based upon age and education. Heaton, Matthews, Grant, and Avitable (1996) transformed raw scores to age- and education-adjusted T-scores and found that differences between normal and brain-damaged groups were statistically significant, even though the T-score conversions had essentially eliminated the effects of age and education. The accuracy of classification of control subjects (specificity) and brain-damaged subjects (sensitivity) was determined for the AIR (which did not use age adjustments) and for the T-scores (which did use age and education adjustments). The T-score values were associated with a greater incidence of correct classifications of both controls and brain-damaged subjects: the controls rose from 82.5% to 85.8%, and the brain-damaged subjects rose from 66.1% to 75.8%.

These results might be considered quite impressive, except that analyses based upon General Neuropsychological Deficit Scale (GNDS) scores (without any corrections for age or education) achieved a 90% level of correct classifications for controls and a 92% level for correct classifications for brain-damaged subjects (Reitan & Wolfson, 1993). We would presume that the improved level of correct classifications using the GNDS resulted from the fact that the GNDS was designed to include all four methods of inference for detecting the presence of brain damage.

We should mention explicitly that Heaton, Matthews, Grant and Avitable (1996) argue that the number of subjects included in each cell, among the 60 cells used for age and education transformations, is not the pertinent point, because the T-scores were based on regression analyses using more than 350 subjects. The specific tables for each age and education category representing the 60 cells were designed for the convenience of transformations to T-scores in the individual case.

Heaton, Matthews, et al. (1996) felt that their age and education "corrections" achieved the following: (1) eliminated age and education effects from the test results, (2) showed similar error rates across age and education distributions, (3) showed similar results in clinical and normative groups, and (4) produced improved clinical accuracy rates compared with the AIR in correct classification of both controls (specificity) and brain-damaged persons (sensitivity). However, they also pointed out that "further research is needed to examine the performance of the T-scores with demographically diverse samples of subjects who have documented brain disorders" (p. 457). They felt that their age and education normative transformations, as compared with alternative technologies for interpreting performances on neuropsychological tests, "indicate that the norms do have significant advantages for neuropsychological clinical work and research" (p. 457).

Heaton, Ryan, Grant, and Matthews (1996) published a summarical position paper about the validity and significance of adjustments of neuropsychological test scores on the basis of age and education. In this paper they provided a detailed statement of the importance of attribute variables such as age, education, and gender as influences on neuropsychological test results in support of their investigations of these variables, and concluded that "the importance of such research cannot be overemphasized . . . and are critical to neuropsychology" (p. 143). As a final statement, these authors concluded that "without these [thorough and comprehensive norms], the interpretation of test scores will continue to rely more on educated guesswork than on science" (p. 160).

Heaton, Ryan et al. (1996) reached these conclusions without consideration of or reference to the many studies which have reported excellent validity findings without the use of any age, education, or gender "corrections." In an initial study using 50 controls matched in pairs with 50 brain-damaged subjects, not a single brain-damaged person earned a better Impairment Index than his matched control (Reitan, 1955). Over and beyond this finding, each of the seven tests on which the Impairment Index is currently based individually showed extreme probability levels in comparison of the two groups.

Wheeler, Burke, and Reitan (1963) analyzed HRB test results for four groups: Controls, Diffuse Cerebral Damage, Left Cerebral Damage, and Right Cerebral Damage using discriminant function analyses of neuropsychological test scores to determine classification of subjects to their correct groups. The results were as follows: Controls versus all categories of brain damage, 90%; Controls versus Left Cerebral Damage, 93.0%; Controls versus Right Cerebral Damage, 92.4%; Controls versus Diffuse Cerebral Damage, 98.8%; and Right Cerebral Damage versus Left Cerebral Damage, 92.9%. These results, based on HRB test scores alone, were the product of statistical analyses of the test scores, and were not adjusted for age or education. It would hardly seem that age and education adjustments were necessary to rescue the test data from a fate of "educated guesswork (rather) than science."

Wheeler et al. (1963) also determined the percentages of correct predictions of brain-damaged versus control subjects for each of 36 tests. In predicting membership in the control group, the 24-variable discriminant function, on which the above percentages were determined was the most accurate, followed by the Impairment Index. Two measures based on critical flicker frequency were the least sensitive measures, ranking 35 and 36 among the 36 variables, and these test scores have been dropped from the HRB. Age and education were the next least accurate, ranking 33 and 34 among the 36 measures. This result provides further evidence of the limited significance of age and education as neuropsychological measures in comparison with the tests developed specifically to assess the integrity of brain functions.

In an earlier publication, Heaton, Grant, Anthony, and Lehman (1981) evaluated the accuracy of clinical conclusions about various aspects of brain damage as compared with the Average Impairment Rating (AIR). Ratings by two clinicians, based on HRB protocols, yielded accuracy rates of 86.7% and 90.7% in differentiating between the presence and absence of brain damage, whereas only 75.3% were correctly predicted by the AIR. The clinicians were also more accurate in identifying lateralization of brain damage (left cerebral damage: clinicians, 71.1% and 80.6% versus 54.1% for the AIR; right cerebral damage: clinicians, 85.7% and 80.0% versus 72.3% for the AIR). Although age and education were included with the test scores in the evaluation process, the HRB scores were not adjusted or "corrected" on the basis of these variables. These results were not much different than the accuracy rates (noted above) that were achieved with T-scores adjusted for age and education, and they support the accuracy of clinical conclusions, as well as inter-judge reliability, in drawing inferences about brain damage (sensitivity) versus no brain damage (specificity) on the basis of raw HRB scores alone, without the use of age and education adjustments.

An noted previously, the General Neuropsychological Deficit Scale (GNDS) was developed to convert the HRB test data into a summary score that could be used directly in clinical evaluation. The GNDS is a direct reflection of raw test scores without any weighting for differential sensitivity among the 42 variables on which it is based or corrections for age and education. Published results (Reitan & Wolfson, 1988, 1993) were based on 169 persons with a variety of brain diseases and injuries and 41 controls who had been carefully examined medically for possible brain impairment or damage and for whom no such evidence was found. Many of the controls, however, were hospitalized and experiencing the stress that accompanies potentially serious illness.

A cutting point for the GNDS distributions was determined for differentiation of the two groups which classified 90% of the controls correctly (showing excellent specificity) and 92% of the brain-damaged persons correctly (showing excellent sensitivity). These values can be compared with the report of Heaton, Ryan, et al. (1996) based on T-scores on which age and education corrections had been made. As noted previously, these authors reported accuracy rates of 85% for controls and 75.8% for brain-damaged subjects. These results, especially for the brain-damaged group, were clearly below the rate achieved with the GNDS. From a clinical viewpoint, it is obviously of greater interest to recognize the effects of brain damage and to classify brain-damaged subjects correctly than to recognize normal scores and classify control subjects correctly.

Golden and van den Brock (1998) performed separate statistical analyses of Heaton, Grant, and Matthews T-scores and raw scores from the Halstead-Reitan Battery obtained in examination of 64 persons with focal cerebral lesions. They concluded that the "results certainly suggest that the patterns of scores in focal injuries generated by these [scoring] systems are indeed different," and that "one approach may not be appropriate for the other" (p. 163). These investigators found that raw scores showed evidence of more impairment than did scores based on the same data adjusted for age and education using the Heaton, Grant, and Matthews procedure.

Statistical analyses of data are entirely objective and impartial and are subject to inferential conclusions about chance probability. However, in practice, they can only serve as guidelines. The ultimate decisions (diagnoses) in the individual case are based on clinical judgments. Such clinical judgments may be viewed as less "scientific" than statistical analyses, but in actuality are ultimately required when evaluating the test results of individual clients. A valid question therefore concerns the validity of clinical conclusions about individuals. The ability to draw correct conclusions about individual persons is, in fact, "critical to neuropsychology" at a level well beyond corrections for age, education, and gender. The field was well aware of the critical importance of being able to reach correct clinical judgments long before issues of age and education were raised. In 1964 Reitan tested the accuracy of the Halstead-Reitan Battery, depending only on evaluation and interpretations of the test results without the use of age, education, or gender corrections, and the resulting findings would suggest that such adjustments are hardly "critical to neuropsychology" as claimed.

Based on neurological, neurosurgical, and neuropathological sources of information, we identified four individuals, each with either an intrinsic tumor, an extrinsic tumor, a focal traumatic lesion, or a focal vascular lesion, within each of the four lesion categories, which consisted of left anterior, left posterior, right anterior, and right posterior cerebral locations. We therefore had a total of 64 subjects with variously located cerebral lesions. Three additional groups were also included in the study: 16 subjects with diffuse cerebral damage or dysfunction due to cerebral vascular disease, 16 subjects with closed head injuries, and 16 subjects with multiple sclerosis. The inclusion of these groups provided a total of 112 patients. The results of the Halstead-Reitan Neuropsychological Battery were then reviewed to make "blind judgments" about the type and location (focal or diffuse) for each subject. The purpose, of course, was to determine the accuracy of inferences about brain lesions when based only on neuropsychological test data.

Using the neurological ratings as the criterion, there were 64 subjects with focal cerebral lesions. Of these subjects, 57 were classified correctly on the basis of psychological testing, and 7 were judged to have diffuse damage. In the group of 48 subjects with diffuse cerebral damage, 46 were classified correctly and 2 were judged to have focal lesions. The number of correct classifications on the basis of psychological test results for each of the lesion locations was as follows: left anterior, 9; left posterior, 11; right anterior, 7; and right posterior, 15. Thus, 42 of 64 patients were placed in their correct groups. Adding to this the correct classification of 46 of the 48 subjects with diffuse cerebral involvement, 88 of the 112 subjects were correctly classified.

For type of lesion, the 112 subjects were classified as follows on the basis of complete medical and surgical findings: intrinsic tumor, 16; extrinsic tumor, 16; cerebral vascular disease, 32 (16 focal, 16 diffuse); head injury, 32 (16 focal, 16 diffuse); and multiple sclerosis, 16. The number of correct classifications on the basis of psychological inferences was as follows: intrinsic tumor, 13; extrinsic tumor, 8; cerebral vascular disease, 28; head injury, 30; and multiple sclerosis, 15. Thus, 94 of the 112 patients were correctly classified according to type of lesion. Additionally, 13 of the 16 subjects with focal cerebral vascular disease were classified correctly, and 12 of these 13 were judged to have focal lesions. Of the 15 subjects with diffuse cerebral vascular disease who were correctly placed in this category, 14 were judged to have diffuse cerebral vascular disease. Thirty of the 32 head injury subjects had been identified correctly on the basis of their psychological test results, and 27 of these 30 had been classified correctly according to whether the lesion was focal or diffuse.

This degree of concurrence between neurological and neuropsychological ratings could scarcely have happened by chance. While adequate specificity statistics had been produced in earlier studies for brain-damaged and control groups, these results produced impressive specificity findings for both localization and type of brain damage based on HRB test scores. The results also confirmed that neuropsychological test results are differentially influenced by (1) focal and diffuse lesions, (2) which cerebral hemisphere is damaged, (3) frontal and nonfrontal lesions within the hemisphere involved, (4) intrinsic tumors, extrinsic tumors, cerebral vascular lesions, head injuries, and multiple sclerosis, (5) focal occlusion as compared with generalized insufficiency in cerebral vascular disease, and (6) focal as compared with diffuse damage from head injuries.

These findings, as noted, were based on test results that had not been adjusted for age or education, although information about these variables was known for each person at the time the ratings were made. Evidence regarding the extent to which neuropsychological findings relate to the differential aspects of brain pathology is particularly important in the forensic area inasmuch as critical questions so often concern the questions about possible brain damage.

The Use of Age and Education in Clinical Evaluations

The reader may be interested in the manner in which age and education data are used in clinical evaluation of the individual person, since such data frequently do serve a useful role. Many examples have been provided in addition to a review of published studies (Reitan, 1979). It should be recognized that we have routinely found these variables to be related to neuropsychological test scores among non-brain-damaged persons, and have never proposed that brain-damaged subjects are entirely immune to these effects.

Our position in this regard, as stated previously, hinges on the sensitivity and specificity of tests to brain damage. Among persons with brain damage, the more sensitive and specific the test is to brain damage, the less the available variance that may be determined by other variables such as age and education. After all, sensitivity and specificity are critical requirements of neuropsychological tests. If neuropsychological tests were not sensitive to brain damage, but were influenced principally by other factors, they could hardly serve as indicators of brain status.

Thus, we propose that an inverse relationship exists between the sensitivity and specificity of a test to brain damage and the extent to which age and education may determine the results of that test among persons with brain damage. This proposal carries with it many complexities, considering the different ways in which subcategories of brain damage (left versus right cerebral lesions, for example) may affect test scores differentially. In general, however, neuropsychological interpretation of the HRB in the individual case provides relevant conclusions that clarify considerations of this type.

Basically, the procedure in interpretation of individual protocols is first to determine whether evidence of brain damage is present. (Unfortunately, many "flexible" batteries cannot provide an answer to this question.) If brain damage is present, one can expect the tests most sensitive to brain damage to be impaired, regardless of the age or education of the client, and one would proceed to consider the full range of additional dimensions included under the general rubric of brain damage.

The effects of brain damage are obviously quite variable, depending upon location, type, status, etc. of damage caused by the injury or disease, but these effects occur essentially regardless of age or education. It should be noted that tests which are not specifically sensitive to brain damage may be relatively spared, and may still largely reflect an individual's age and educational status. Other tests may occupy an intermediate position, still retaining age and/or education effects even though they have been influenced to a degree by brain damage. Clearly, as with all clinical evaluations, the knowledge and competence of the clinician are major factors in understanding the complexities implicit in the test data. Unless the clinician is informed about the ways in which age and education variables affect the test results, and which tests are more and less sensitive to brain damage, the above recommendations can hardly be implemented.

It is beyond the scope of this paper to attempt a full exposition of the processes by which valid clinical conclusions are reached in evaluation of neuropsychological test data (see Reitan & Wolfson, 1986b, 1988, 1993). With respect to age, education, and brain damage, the general rule is to look for definitive indications of brain damage which, if found, would stand as valid indications regardless of the subject's age or education. If no indications of brain damage are found, the usual effects of age and education should be considered as appropriate.

Our usual procedure is to review the raw-score test data initially and to record written conclusions at least in brief form, and then turn to any available history and/or neurological diagnostic information for correlation with the test results. The sequence with which these categories of information are reviewed varies among individual neuropsychologists. If knowledge of the presence of brain damage is known before the neuropsychological test records are reviewed, such information may provide a basis for expecting the effects of the damage to have a distinct influence on any effects of education or age.

Neuropsychologists were aware of the importance of understanding the differential and interactional effects of age and education variables long before the current issues about corrections or adjustments arose, and many studies were performed to evaluate the effects of age and education with relation to psychological and neuropsychological test results. Some of this research has been summarized by Reitan (1970a) and Reitan and Wolfson (1986a). Implementation of the plan for clinical interpretation, briefly described above, is greatly facilitated by knowledge of which neuropsychological tests are most sensitive to brain damage, which tests are most affected among normals by aging, and which tests are relatively unaffected by brain damage but are influenced by age and education. Clinical experience involving many brain-damaged and non-brain-damaged persons is essential in learning to understand the interactions of test results, and this type of knowledge can only be gained in any detail by using a constant basic test battery.

There is nothing at this point to restrict comparison of the individual's test results with those of persons either with or without brain damage. In fact, such comparisons represent the fundamental basis for conclusions about the presence or absence of brain pathology. The point of this paper is not to limit or restrict such comparisons; our concern is to make such comparisons with a full and accurate knowledge of the groundwork and procedures on which the comparisons are based. If, for example, a subject's occupation was known to have a specific and relatively invariant influence on the score obtained on a particular test, the neuropsychologist would want to consider occupation as a variable and the possible differential significance of the particular occupation among persons with normal as opposed to damaged brains. Obviously, consideration of the subject's occupation would lose interest if research results had shown that the particular occupation lost its significance regarding the test score if brain damage was present.

Conceptual Considerations Concerning Age and Education Transformations

Table 2 of Heaton, Matthews, et al. (1996) makes it quite clear that the raw Average Impairment Rating (AIR) scores of the least educated and older subgroups were lower than for any other subgroup, and the presumption was that this was due to limited education and to older age. Thus, their rationale would be that raw scores in these subgroups were in need of "correction." However, there are many reasons for delving a little deeper into this explanation.

Corrections for age and education, even among supposedly normal persons, may represent a significant conceptual error, at least among older and less educated persons. Declining performances with advancing age among normals may, in fact, represent a correction for brain impairment. Neurological studies, completely separate from neuropsychological testing, have shown unequivocal findings of structural, physiological, and neurobehavioral changes that occur in the aging process. Many of these studies have been reviewed by Reitan and Wolfson (1992).

For example, findings on autopsy of presumably normal persons show many changes of the type seen in Alzheimer's disease, not only among subjects in North America, Great Britain, and European countries, but also in Japan. Matsuyama and Nakamura (1977) reported on their examination of 617 brains in Tokyo. These investigators deliberately excluded brains of patients who had evidence of psychosis or any conditions known to be predisposed to the types of pathological changes seen in Alzheimer's disease. Thus, these patients presumably represented a sample of subjects who might manifest the effects of normal aging but not brain disease.

The results of this study indicated a strong association between advancing age and the occurrence of neuropathological changes, including neurofibrillary tangles and senile plaques, which are the major pathological hallmarks of Alzheimer's disease. Such changes were present in approximately 50% of cases aged from 50-59, in more than 80% of cases aged 60-69, and in almost every patient over age 70. Changes of this kind represent a difference in degree of pathology in aging and Alzheimer's disease, but no apparent difference in kind.

Obviously, it is difficult to relate autopsy findings to the performance capabilities of living persons. However, it has been possible to compare the type and degree of neuropsychological deficits found with younger brain-damaged persons and the neuropsychological test results among older neurologically-normal persons. Reed and Reitan (1963) found statistically significant evidence that the neuropsychological tests that were most sensitive to brain damage in younger persons were also the tests that most strikingly reflected intellectual and cognitive impairment among older persons, even though this latter group was composed of persons who appeared, on clinical neurological examination and by history, to be aging normally. Thus, both the pathological changes of the brain and the neuropsychological findings among supposedly normally aging people are similar to findings of groups with definite brain damage.

The adjustments made among the elderly by the Heaton, Grant, and Matthews procedures are aimed at eliminating the effects of age and education differences. In effect, the "corrections" eliminate the neuropsychological impairment seen among older persons because this impairment is presumably "normal," even though independent evidence indicates that it is associated with pathological brain changes. It hardly seems surprising, then, that very poor scores are classified as normal when the HGM "corrections" are made. This effect is illustrated by the fact that a 55-year-old man with 17 years of education can still make 80 errors on the Category Test and be classified by the HGM transformation as having performed normally. The validity of a result of this kind is difficult to understand, considering that Reitan (1955) found that a control group made an average of 32 errors and a brain-damaged group made an average of 62 errors!

Thus, while HGM corrections are intended to eliminate the effect of age and education differences, thus providing for comparable T-scores across each distribution, it would seem that they are, in effect, at least partially eliminating the effects of brain pathology on neuropsychological test scores. This procedure would have its major effect in producing normal scores among brain-damaged persons, since the normative data on which the adjustments were based were derived, in part, from older persons and less-well educated adults whose poorer raw scores were very probably determined, in part, by brain impairment.

In summary, an abundance of evidence has linked older age to brain pathology, even among older persons without a clinical diagnosis such as Alzheimer's disease. There can hardly be any disagreement that these neuropathological changes are the bases for neuropsychological impairment. In fact, the authors of the special article on neuropsychological assessment (Neurology, 1996) state that aging negatively affects intellectual abilities, complex attentional processes, some aspects of memory, psychomotor speed, assessing word knowledge, visuospatial skills, some forms of reasoning, and complex motor problem-solving. Presumably, some factors related to the impairment may derive from limited sensory or motor functions, but a major component of the cognitive decline is due to gradually progressive brain impairment.

The fact that a substantial part of the need for corrections for education came from persons with lesser education calls for a similar explanation. Obviously, there are reasons that these people achieved a lesser education, and some of the reasons have to do with circumstances that would have no connection with the adequacy of brain functions. It is likely, however, that the lower educational level of these adults was due, in part, to limitations of brain functions, which in turn were expressed as lowered intelligence, lessened academic aptitude, and poorer test scores.

Heaton et al. (1991) do not seem to have appreciated the relationship of advanced age and limited education to impairment of brain functions, even though their sample had been determined to be neurologically normal. In the first paragraph of their book they cite the usefulness of neuropsychological tests in evaluation of brain disorders, but then state, "However, neuropsychological tests are not exclusively sensitive to brain pathology. It has long been recognized that performances on most of these instruments are strongly related to age and education . . ." (p. 3). This statement appears to deny the distinct likelihood that age and education, particularly at the extremes of the two distributions (older age and lesser education), almost certainly, in many cases, have a definite relationship to the biological status and adequacy of brain functions.

What, then, is the result of developing raw score/T-score transformations based on age and education? The result is to produce normal T-scores particularly for persons with older age and lesser education. In effect, the result is to produce normal scores for the parts of the distribution in which raw scores have been poorest, and poorest, at least in part, because of brain impairment. What, then, would be likely to happen if such adjustments were made in a group of persons who all have brain impairment? The expected result would be that they would score better when the adjustments were made (T-scores) than when no such adjustments had been made (raw scores. The above rationale is consistent with the findings reported by Reitan and Wolfson (1995a) in their study of relationships between age and education and GNDS scores. Among their control (non-brain-damaged) subjects, the correlation between age and education differed sharply for the older and younger halves of the distribution. The older half had a coefficient of 0.68 (showing that GNDS scores were poorer as age increased), whereas the younger half had a coefficient of only 0.10.

These adverse effects of aging, as shown on the GNDS, were entirely consistent with the expected neuropathological changes that develop with advancing age, even among persons classified as controls. Age adjustments in this group, done to eliminate age effects, are in large part equivalent to eliminating the effects of brain impairment. The very reason that neuropsychological tests are used is to identify (not eliminate) the effects of brain impairment. Thus, for the older segment of the so-called "normal" population, the transformations proposed by Heaton et al. (1991) appear to be conceptually flawed, inasmuch as aging is a variable that causes subtle brain damage in many so-called "normal" people.

The same line of reasoning applies to the segment of the "normal" population with limited education, even though neuropathological studies have not focused on this group as they have for the aged. As noted previously, multiple factors may be responsible for limited educational achievement, but adequacy of brain functions, resulting in limitations of intelligence and academic aptitude, is surely among them. For example, on brain-sensitive tests, would one normally expect persons with a low educational level to perform as well as persons with more education? Reitan and Wolfson (1995a), when dividing their control group at the 50% percentile for educational level, found that the lower 50% had a correlation of -0.56 (p<.01) between education and GNDS (the lower the education, the poorer the GNDS score), whereas the upper 50% had a correlation of 0.27, a value that did not reach significance considering the size of the group.

Undoubtedly, the overall role of age and education among neurologically-normal groups is hardly to be explained by postulating that subtle impairment of brain functions is the only (or perhaps, even major) factor for elderly persons or for the lesser educated. It would appear very likely, however, that brain impairment is one of the relevant variables, and the above considerations were presented in order to recognize the complexity of the entire situation. Can comparable procedures for effecting age and education "corrections" be used across the entire span of the age and education distributions, even among "normals?" Can age and education adjustments, aimed at removing the effects of these variables, be acceptable when they actually remove, at least in part, the effects of brain impairment - the very condition that the tests were designed to detect?

In summary, when one adjusts neuropsychological test scores for age, one adjusts the test scores for all of the additional variables that are significantly associated with age. These additional factors will be related to neuropsychological test scores to a varying degree. Some variables, such as reduced physical strength, reduced sexual drive, greying of hair, etc. may have little or no relationship to test scores, and therefore the adjustments of test scores for age will have little if any consequence. Motor speed and dexterity, reaction time, reasoning and logical analysis, and other neuropsychological functions may be more adversely affected by age, and poor scores on tests that require these skills would be changed more strikingly by age adjustments in the direction of normality. Brain impairment, which is generally but variably more pronounced with aging, could be a major factor, and tests that are particularly sensitive to brain impairment could be done very poorly and still be considered normal after age adjustments had been made.

Since both older age and limited education appear to be associated with variables that produce poorer raw scores on neuropsychological tests (such as pathological brain changes), the extent to which these associated variables produce lower scores will determine the degree of adjustment necessary to bring the performances up to a mean T-score of 50. In other words, the more sensitive the test is to these associated variables, the poorer the raw score can be and still be classified as normal after the adjustment is made.

Results of a Current Study

The present investigation was concerned with a direct comparison of transformations (called "corrections" by Heaton et al. [1991]) of raw scores from the Halstead-Reitan Neuropsychological Test Battery for Adults to T-scores by using the HGM tables and NDS scores by using the table provided by Reitan and Wolfson (1993). To summarize the information given previously, the HGM procedure first transforms raw scores into scaled scores, and the scaled scores are used to obtain T-scores (with a mean of 50 and a standard deviation of 10) depending upon the gender, age, and education of the subject.

The Reitan and Wolfson procedure makes no adjustment for age or education, but merely uses the raw score to determine the corresponding NDS score. NDS scores range from 0 (perfectly normal) to 3 (severely impaired), and were devised originally to provide information about the clinical significance of raw scores on each of the 42 variables in the HRB and to place the score for each test on a standard scale so that the scores could be added and thus provide a total score for the entire battery (the General Neuropsychological Deficit Scale score).

Purpose. Although there are many ways in which comparisons of the two methods of transforming raw scores could be evaluated, it seemed of value initially to use a procedure that utilized the fact that each method identified a cutting-point for separation of "normal" and "impaired" scores (HGM method: T-scores of 39 and less; Reitan and Wolfson method: NDS scores of 2 and 3). The purpose of this study was to determine and compare the frequency with which "normal" and "impaired" scores occurred in a sample of non-brain-damaged controls and a sample of persons with diffuse cerebral vascular disease.

Description of Groups. In order to avoid any possibility that group composition might have been influenced by the purposes of this study, the groups compared were ones that had been used in a prior study done years before the question of normative transformations was even proposed (Reitan, 1970b). The control group was composed of 26 men who were functioning normally in their occupations and had no reports of illness or injuries they felt might have compromised their intellectual and cognitive competence. The group had a mean age of 54.00 years (SD, 4.22) and a mean education of 14.73 years (SD, 2.75).

The group with diffuse cerebral vascular disease consisted of 26 men who had a mean age of 53.54 years (SD, 6.28) and a mean education of 11.00 years (SD, 3.03). The difference in age distributions between the two groups was negligible (t ratio, 0.27; p<.70). Education differences, however, were significant (t ratio, 4.55; p<.001).

It must be pointed out, as was done in the original report, that a conclusion of diffuse, generalized, or multifocal cerebral vascular disease is relatively difficult to reach as a firm differential neurological diagnosis. For this reason, detailed results of the clinical neurological examination, specialized neurological diagnostic procedures, frequency of complaints of a neurological nature reported by the patients, and associated conditions or diagnoses were included in the description of the group (Reitan, 1970b, pp. 159-160). The diagnosis of diffuse cerebral vascular disease was not dependent on any single one of these sources of information, but instead represented the definitive conclusion of the examining neurologist.

Procedure. In accordance with the purpose of this study, the raw scores for nine variables were transformed to both HGM T-scores and to Reitan and Wolfson NDS scores for each group. The number of scores falling in the impaired and normal ranges was determined for both groups according to each method. Differences in the distributions using the two methods were tested for significance using a chi-square test based on a four-celled table. These results permitted an inference of whether the distribution of normal and impaired scores, produced by the two methods, differed beyond chance expectancy.

Results. Table 2 presents means and standard deviations for the nine tests used in this study, calculated from raw scores, from T-scores derived from the HGM transformations and from Reitan and Wolfson's NDS scores. We have not included statistical comparisons of the groups, since this information was included in the original report (Reitan, 1970b). However, the controls consistently performed better than the group with cerebral vascular disease at statistically significant levels.

Our present purpose was to determine the frequencies with which normal and impaired scores were obtained when using the HGM and RW methods, and this information is presented in Table 3.

The control group, which had no evidence of brain involvement, consistently showed a greater frequency of normal scores than impaired scores, regardless of the transformation method that was used. Overall, the HGM method classified 87% of these scores in the normal range (T-scores of 40 or higher) and the RW method classified 76% in the normal range (NDS scores of 0 or 1). Thus, more of the controls showed no impairment using the HGM method than the RW method. A chi-square test of these distributions yielded a value of 15.39 (p<.001).

Conversely, the RW method consistently classified persons with diffuse cerebral vascular disease as having impaired scores more frequently than did the HGM method. The RW method classified 82% of the scores in the CVD group as being impaired, whereas the HGM method classified 63% as being impaired. The chi-squared comparison of these differences yielded a value of 16.74 (p<.001).

It is apparent from these results that the two methods, using the same raw-score data, differ significantly in the frequency with which the transformations identify impaired and normal scores. The HGM method identifies more scores as normal, whereas the RW method identifies more scores as impaired. The consistency of this observation is indicated by the finding that in the control group the HGM method showed a greater percentage of subjects earning normal scores on 8 of the 9 variables (with one tie) than did the RW method; in the CVD group, all nine tests showed a greater frequency of impairment using the RW method than when using the HGM method. There would seem to be no doubt that the two methods produce results that differ in their clinical significance.

Discussion. Some controls will, of course, produce some scores in the impaired range, and some brain-damaged subjects will produce some scores in the normal range. However, there is relatively little available information to use as a guideline for expectation of the frequency of impaired scores for persons who qualify as normal controls versus expectation of the frequency of normal scores for persons with evidence of brain damage (and even less information for persons with diffuse cerebral vascular disease). It might be noted, however, that there is a long historical record in clinical interpretation of indicating that control subjects may have Impairment Indexes of up to 0.3 (30% of the tests in the impaired range), with an Impairment Index of 0.4 (40% of the tests in the impaired range representing the borderline range.

On this basis, it might seem that as few as 13% of the scores in the impaired range among control subjects, as found with the Heaton, Grant, and Matthews transformations, may represent an underestimate of the number of "impaired" scores, especially considering the age of the subjects in this study. The standard interpretation of the Impairment Index, however, is based on test results that were not adjusted for age and education, and the age and education corrections of the HGM method may be producing more accurate estimates of improved normality and functional capabilities. The incidence of 82% of the patients with cerebral vascular disease earning impaired scores when using the Reitan and Wolfson criteria hardly seems surprising, and the finding that only 63% of the scores for the CVD group fell in the impaired range when using the HGM method may well be an underestimate.

Certain implications of the present findings, based on the groups studied, are quite clear. Using the HGM method to transform HRB raw scores results in generation of more scores in the normal range for controls than did using the Reitan-Wolfson method. Conversely, the Reitan-Wolfson method produced more scores in the impaired range for a group with diffuse cerebral vascular disease.

If these results are confirmed and are determined to represent a general conclusion, the findings will have distinct and definite significance not only in clinical evaluations, but also for forensic interpretations. HGM transformations of data will tend to indicate that a client is unimpaired, whereas Reitan-Wolfson transformations of data will tend to emphasize impairment. If a plaintiff is seeking compensation for brain impairment, use of HGM transformations would be of benefit for the defense, inasmuch as indications of impairment would tend to be minimized. However, in the same case, the indications of impairment might be emphasized through use of the Reitan-Wolfson transformations because the client's age and education were not represented in the scores. Obviously, if one method of transformation tilts the interpretation either toward normality or toward impairment more significantly than another method of transformation, resolution of the conflict is needed.

It must be pointed out explicitly that the empirical results reported in the study represented a limited investigation of this complex issue. The basic purpose of this paper was to stimulate further investigation, and this should be quite possible, considering the wealth of data on the Halstead-Reitan Battery that is available in the field. Not only the effect on test scores (as studied in this paper), but the effect on classifications of groups of brain-damaged and control subjects, according to age, type of lesion, lateralization and location of lesion, etc. must also be investigated. The urgency of such investigations is implicit, inasmuch as clinical conclusions are routinely being drawn daily, using transformations of raw scores, based on the age and education of non-brain-damaged subjects, applied to brain-damaged subjects, with limited empirically-derived knowledge of the accuracy of these transformations.

The need for such additional investigations is evident both for clinical interpretation and for use of neuropsychological data in a forensic setting. Many neuropsychological tests, over and beyond those referred to in this paper, may also be influenced by the age and education of the particular person being evaluated, and attenuation of age and education effects may be differentially affected in accordance with the sensitivity of the test to brain damage generally and, more specifically, to variables such as diffuse versus focal damage, the particular location of damage, and the status of the brain disease or injury. In addition, the interaction of age and education with variables describing brain involvement may differ depending upon whether the individual is young or old, well educated or poorly educated. Real-life variables are complex and intertwined. While the overall aim of science is to gain valid generalizations, expert and valid conclusions as clinicians and forensic experts focus on the individual and we must continually be aware of the uniqueness of every individual person.

References

Assessment: Neuropsychological testing of adults. Considerations for neurologists. 1996. Report of the Therapeutics and Technology Assessment Subcommittee of the American Academy of Neurology. Neurology, 47, 592-599.

Fasteneau, P.S., & Adams, K.M. 1996. Heaton, Grant, and Matthew's Comprehensive Norms: An overzealous attempt. Journal of Clinical and Experimental Neuropsychology, 18, 444-448.

Golden, C.J., & van den Brock, A. 1998. Potential impact of age- and education-corrected scores with HRNB scores patterns in participants with focal brain injury. Archives of Clinical Neuropsychology, 13, 683-694.

Heaton, R.K., Grant, I., Anthony, W.Z., & Lehman, R.A. 1981. A comparison of clinical and automated interpretation of the Halstead-Reitan Battery. Journal of Clinical Neuropsychology, 3, 121-141.

Heaton, R.K., Grant, I., & Matthews, C.G. 1991. Comprehensive norms for an expanded Halstead-Reitan Battery. Odessa, FL: Psychological Assessment Resources.

Heaton, R.K., Matthews, C.G., Grant, I., & Avitable, N. 1996. Demographic corrections with comprehensive norms: An overzealous attempt or a good start? Journal of Clinical and Experimental Neuropsychology, 18, 449-458.

Heaton, R.K., Ryan, L., Grant, I., & Matthews, C.G. 1996. Demographic influences on neuropsychological test performance. In I. Grant & K. Adams (Eds.), Neuropsychological assessment of neuropsychiatric disorders. (2nd ed.) (pp. 141-163). New York: Oxford University Press.

Jarvis, P.E., & Barth, J.T. 1994. The Halstead-Reitan Neuropsychological Battery: A Guide to Interpretation and Clinical Application. Odessa, FL: Psychological Assessment Resources.

Matsuyama, H., & Nakamura, S. 1977. Senile changes in the brain in the Japanese: Incidence of Alzheimer neurofibrillary and senile plaques. In R. Katzman, R.D. Terry, & K.L. Bick (Eds.), Alzheimer's disease: Senile dementia and related disorders. Aging, Vol. 7 (pp. 287-297). New York: Raven Press.

Oestreicher, J.M., & O'Donnell, J.P. 1995. Validation of the General Neuropsychological Deficit Scale with nondisabled, learning-disabled, and head-injured young adults. Archives of Clinical Neuropsychology, 10, 185-191.

Reed, H.B.C., & Reitan, R.M. 1963. A comparison of the effects of the normal aging process with the effects of organic brain damage on adaptive abilities. Journal of Gerontology, 18, 177-179.

Reitan, R.M. 1955. An investigation of the validity of Halstead's measures of biological intelligence. Archives of Neurology and Psychiatry, 73, 28-35.

Reitan, R.M. 1964. Psychological deficits resulting from cerebral lesions in man. In J.M. Warren & K.A. Akert (Eds.), The frontal granular cortex and behavior. New York: McGraw-Hill.

Reitan, R.M. 1970a. Measurement of psychological changes in aging. Duke University Council on Aging and Human Development, Proceedings of Seminars.

Reitan, R.M. 1970b. Objective behavioral assessment in diagnosis and prediction. In A.L. Benton (Ed.), Behavioral change in cerebrovascular disease (pp. 155-165). New York: Medical Department, Harper & Row.

Reitan, R.M. 1979. Neuropsychology and aging. Tucson, AZ: Neuropsychology Press.

Reitan, R.M., & Wolfson, D. 1986a. The Halstead-Reitan Neuropsychological Test Battery and aging. In T.L. Brink (Ed.), Clinical gerontology: A guide to assessment and intervention (pp. 39-62). New York: Haworth.

Reitan, R.M., & Wolfson. D. 1986b. Traumatic brain injury. Vol. I. Pathophysiology and neuropsychological evaluation. Tucson, AZ: Neuropsychology Press.

Reitan, R.M., & Wolfson, D. 1988. Traumatic brain injury. Vol. II. Recovery and rehabilitation. Tucson, AZ: Neuropsychology Press.

Reitan, R.M., & Wolfson, D. 1992. Neuroanatomy and neuropathology: A clinical guide for neuropsychologists (2nd ed.). Tucson, AZ: Neuropsychology Press.

Reitan, R.M., & Wolfson, D. 1993. The Halstead-Reitan Neuropsychological Test Battery: Theory and clinical interpretation (2nd ed). Tucson: Neuropsychology Press.

Reitan, R.M., & Wolfson, D. 1995a. Influence of age and education on neuropsychological test performances of older children. Child Neuropsychology, 1, 165-169.

Reitan, R.M., & Wolfson, D. 1995b. Influence of age and education on neuropsychological test results. The Clinical Neuropsychologist, 9, 151-158.

Reitan, R.M., & Wolfson, D. 1996a. Can WISC-R IQ values be computed validly for learning-disabled children? Applied Neuropsychology, 3, 15-20.

Reitan, R.M., & Wolfson, D. 1996b. Differential relationships of age and education to WAIS subtest scores among brain-damaged and control groups. Archives of Clinical Neuropsychology, 11, 303-311.

Reitan, R.M., & Wolfson, D. 1996c. The diminished effect of age and education on neuropsychological performances of learning-disabled children. Child Neuropsychology, 2, 11-16.

Reitan, R.M., & Wolfson, D. 1996d. Relationships of age and education to Wechsler Adult Intelligence Scale IQ values in brain-damaged and non-brain-damaged groups. The Clinical Neuropsychologist, 10, 293-304.

Reitan, R.M., & Wolfson, D. 1997. The influence of age and education on neuropsychological performances of persons with mild head injuries. Applied Neuropsychology, 4, 16-33.

Reitan, R.M., & Wolfson, D. 1999. The influence of age and education on neuropsychological performances of persons with mild traumatic brain injuries. In M.J. Raymond, T.L. Bennett, L.C. Hartlage, & C.M. Cullum (Eds.), Mild traumatic brain injury. Austin: Pro-ed.

Rojas, D.C., & Bennett, T.L. 1995. Single versus composite score discriminative validity with the Halstead-Reitan Battery and the Stroop Test in mild head injury. Archives of Clinical Neuropsychology, 2, 101-110.

Rourke, B.P. 1975. Brain-behavior relationships in children with learning disabilities: A research program. American Psychologist, 30, 911-920.

Rourke, B.P. (Ed.). 1985. Neuropsychology of learning disabilities: Essentials of subtype analysis. New York: Guilford Press.

Rourke, B.P. 1989. Nonverbal learning disabilities: The syndrome and the model. New York: Guilford Press

Wheeler, L., Burke, C.J., & Reitan, R.M. 1963. An application of discriminant functions to the problem of predicting brain damage using behavioral variables. Perceptual and Motor Skills, [Monograph supplement], 16, 417-440.


 

Table 1. Test results for a 51-year-old man in which the Heaton, Grant, and Matthews
T-scores and the Reitan and Wolfson NDS scores disagree.
Test Raw Score HGM T-Score R & W NDS
Category 82 42 3
Tactual Performance Test
       Total Time 21.7 44 2
       Memory 6 42 1
       Localization 2 46 3
Speech-sounds Perception Test 20 33 3
Rhythm Test 23 43 2
Finger Tapping - Preferred Hand 49 46 2
Impairment Index 0.9 29 3


Table 2. Means and standard deviations for a control group and a group with diffuse cerebral vascular disease, representing raw scores, Heaton, Grant, and Matthews transformations, and Reitan and Wolfson transformations.
 
Group Category TPT Time TPT Mem TPT Loc Rhythm Speech-
Sounds
Tapping Trails A Trails B
Raw Scores
Controls M 56.60 14.30 6.96 3.15 27.23 4.15 52.73 33.08 74.77
  SD 29.05 3.78 1.18 1.87 1.99 2.98 3.96 10.15 23.63
CVD M 85.00 33.05 4.58 1.69 19.27 17.38 38.27 84.15 284.42
  SD 36.81 13.56 2.67 1.89 4.39 13.99 12.48 67.53 268.45
HGM T-scores
Controls M 44.35 52.50 45.00 48.58 55.08 53.35 52.38 48.00 51.31
  SD 11.48 7.52 9.38 7.35 9.99 8.42 6.73 9.20 7.67
CVD M 37.34 35.92 35.42 42.50 39.00 37.04 36.46 34.54 34.65
  SD 11.06 13.79 14.66 9.25 14.00 7.03 11.67 10.00 14.76
RW NDS Scores
Controls M 1.73 1.31 0.92 1.27 0.69 0.15 1.00 1.00 1.04
  SD 1.04 0.79 0.84 0.92 0.76 0.46 0.69 0.80 0.96
CVD M 2.54 2.50 1.88 2.65 2.12 2.00 2.35 2.38 2.50
  SD 0.71 0.71 1.14 0.75 1.14 0.94 0.94 0.80 0.99


 

Table 3. Percentage of scores for 26 persons which were classified as Normal or Impaired using the Heaton, Grant, and Matthews transformations and the Reitan and Wolfson transformations.
 
  Category % TPT Time % TPT Mem % TPT Loc % Rhythm % Speech-
sounds %
Tapping % Trails A % Trails B % Total %
HGM Transformations

Controls
 
Normal 69 96 69 92 92 100 96 77 92 87
Impaired 31 4 31 8 8 0 4 23 8 13
RW Transformations

Controls
 
Normal 65 65 69 85 85 96 77 77 65 76
Impaired 35 35 31 15 15 4 23 23 35 24
HGM Transformations

CVD
 
Normal 50 42 35 46 38 31 42 31 19 37
Impaired 50 58 65 54 62 69 58 69 81 63
RW Transformations

CVD
Normal 12 12 27 8 27 27 23 12 12 18

 
Impaired 88 88 73 92 73 73 77 88 88 82

 

© Copyright Reitan Neuropsychology Laboratory 2003