T1-weighted sagittal magnetic resonance images showing cerebellar regions of interest. 1 indicates primary fissure; 2, prepyramidal fissure. Lobuli VI-VII are located between areas 1 and 2. Left, A 13-year-old male subject with acute lymphoblastic leukemia. Right, A 13-year-old male control subject.
T1-weighted coronal magnetic resonance images from a volume acquisition representing prefrontal cortex regions of interest. Left, A 10-year-old female subject with acute lymphoblastic leukemia. Right, A 10-year-old female control subject.
Lesnik PG, Ciesielski KT, Hart BL, Benzel EC, Sanders JA. Evidence for Cerebellar-Frontal Subsystem Changes in Children Treated With Intrathecal Chemotherapy for LeukemiaEnhanced Data Analysis Using an Effect Size Model. Arch Neurol. 1998;55(12):1561-1568. doi:10.1001/archneur.55.12.1561
Copyright 1998 American Medical Association. All Rights Reserved. Applicable FARS/DFARS Restrictions Apply to Government Use.1998
Following brain insult in early childhood, the later maturing neocerebellum and frontal lobes frequently show abnormalities.
To investigate the morphologic characteristics and function of a proposed cerebellar-frontal subsystem in children treated for acute lymphoblastic leukemia (ALL) with intrathecal methotrexate using quantitative magnetic resonance imaging, neuropsychological measures, nonlinear multiple regression analysis, and a statistical effect size model that augments interpretive validity of nonsignificant statistical findings, particularly from small sample size studies.
Comparison and relationship of magnetic resonance imaging morphometry of cerebellar lobuli I-V and VI-VII and prefrontal cortices, and performance on 5 neuropsychological tests assessing visual-spatial attention, short-term memory, and visuomotor organization and coordination between childhood survivors of ALL and a matched control group.
Ten childhood survivors of ALL treated between 1982 and 1989 with standard 3-year intrathecal chemotherapy, and matched control subjects.
Main Outcome Measures
Morphometric results of cerebellar lobuli I-V and VI-VII and prefrontal cortices, and results of Trail-Making Tests, Rey-Osterreith Complex Figure Test, WISC-III Coding.
Significant effect size model values for outcome measures in the ALL group support deficits in lobuli VI-VII and prefrontal cortices, and neuropsychological performance. Multiple regression analysis results were consistent with hypothesized involvement of a cerebellar-frontal brain subsystem.
Treatment of children with ALL with intrathecal methotrexate before 5 years of age has structural and functional effects on the developing neocerebellar-frontal subsystem.
INTRATHECAL methotrexate impairs proliferation of leptomeningeal leukemic cells, and is a major component of preventive central nervous system (CNS) treatment for childhood acute lymphoblastic leukemia (ALL).1 Reported neuroanatomical changes following methotrexate treatment include cerebral calcifications and atrophy,2,3 and children treated with both cranial irradiation (240 Gy) and methotrexate chemotherapy before the age of 5 years have shown decreases in measures of cerebellar morphology.4,5
Consistently, results from neuropsychological studies on clinical populations have shown the cerebellum and prefrontal lobes related to deficits in attention, spatial working memory, and planning.6- 8 Attention and short-term memory deficits, particularly in the visual-spatial-motor domain, have previously been reported in survivors of ALL.3- 5,9- 11
The present study investigated the morphometry of the cerebellar vermis in children who were treated before the age of 5 years with intrathecal chemotherapy, but with no cranial irradiation. We hypothesized that if cerebellar changes as a consequence of exposure to intrathecal methotrexate were found, they would follow our developmental chronometry hypothesis,5 which predicts that the later developing neocortical posterior vermis (lobuli VI-VII) would be more affected than the earlier maturing anterior part (lobuli I-V). Further, it was hypothesized that the relatively later developing cerebellar lobuli VI-VII and prefrontal cortices are components of the brain that, influenced by their corresponding protracted course of development and neural connections, may organize into a subsystem; ie, interrelated and interacting components of the CNS involved in a particular process or function.12 Accordingly, it was hypothesized that if the later developing neocerebellum was found to be abnormal, there would be a corresponding abnormality in later developing prefrontal cortices, and poorer performance on visual-spatial attention, short-term memory, and visuomotor organization and coordination tests.
We used a statistical effect size (ES) model of enhanced data description and analysis by reporting ES and post hoc power analyses,13,14 and confidence intervals. "Effect size" is a standardized quantitative index that can represent the magnitude of change that one variable produces in another variable as reflected in the difference between 2 means.13,15 Power is the probability that a statistical test will yield a statistically significant result and the null hypothesis will be rejected.13 The most feasible way for a researcher to increase the power of a statistical test is to increase sample size.13,16 Because often only small samples are available in clinical neuroscience research, the knowledge of ES model values is particularly valuable for interpreting results. For example, a nonsignificant P value may be interpreted as evidence for no treatment effect and/or no difference between means. However, ES model measures may reveal moderate to large ES and confidence interval values, but the study may have relatively low power. In a replication study with more subjects, power would increase, and thus the likelihood would increase of finding statistically significant results. Effect size model measures, then, provide additional quantitative information that can be used to augment statistical inferences drawn from the outcome of a single study, and, compared with the passive use of "significant" vs "nonsignificant" P values to make decisions concerning treatment effects and/or differences between means, allow for a more active process of reasoning about and interpretation of one's results.
Written informed consent was obtained from all children and parents before testing, and all children and parents participated in a clinical interview related to prenatal, perinatal, and early postnatal periods of development. Ten survivors of childhood ALL (4 boys and 6 girls; age range, 6 years 9 months to 13 years 5 months) were recruited from The University of New Mexico Pediatric Oncology Program, Albuquerque. The present study reflects all available children who underwent a homogeneous 3-year protocol for CNS prophylaxis between 1982 and 1989 and were classified as low-risk, non-T, early B cases. Treatment for ALL over a 3-year period included 4 components: CNS prophylaxis, induction, consolidation, and maintenance. All children received cytosine arabinoside, hydrocortisone, and methotrexate intrathecally for CNS prophylaxis (with no irradiation). Selection criteria for the present study included the following: 1 to 5 years of age at diagnosis, 3 years or more in continuous remission after discontinuation of ALL treatment, no premorbid sensory or motor disabilities, no developmental psychiatric or metabolic disorders or focal brain lesions, and no family history of alcoholism or other drug abuse.
Ten healthy controls (4 boys and 6 girls; age range, 6.0 years to 13.0 years) were selected to match the ALL group in age, sex, and socioeconomic status.17 In several cases, the relatives of ALL subjects were available as control subjects. The selection criteria described above, except for details about the ALL treatment, also applied to the control group. The present report used data from those children who successfully completed neuropsychological testing and magnetic resonance imaging (MRI) of the brain, with images clearly readable for quantitative morphometry.
Magnetic resonance imaging scans of the head were performed with a Siemens 1.5-T imaging system in the New Mexico Veterans Affairs Imaging Center, Albuquerque. Acquired were axial spin echo images using repetition time (TR) of 2500 milliseconds, echo time (TE) of 22 and 90 milliseconds, and 5-mm slice thickness with 0.4-mm interslice gap and sagittal T1-weighed sequence (TR, 40 milliseconds; TE, 6 milliseconds; T1 inversion time, 300 milliseconds; flip angle 40°, 1.5-mm slice thickness and no interslice gap). Coronal images were reconstructed from the 3-dimensional sagittal data set at 1-mm slice thickness with no interslice gap.
No sedation was used, but prior to MRI acquisition each child met with one of us (K.T.C.) for a short relaxation session. To help each child in minimizing head movement while in the magnet, a red dot 1 cm in diameter was placed on the ceiling of the MRI tube. Children were instructed to focus on the dot during scanning, and prepare a short story to tell after the imaging ended. Scanning took about 28 minutes per subject.
Planimetric measurements were performed on commercially available computer hardware (model 7100/66, Apple Computer Inc, Cupertino, Calif; http://www.apple.com/) using the public domain National Institutes of Health program, version 1.52 (developed at the US National Institutes of Health and available on the Internet at http://rsb.info.nih.gov/nih-image/). For all neuroanatomical variables, 3 separate measurements of each region of interest were performed, and a mean area (squared millimeters) from these values was then calculated. Two independent raters performed all neuroanatomical measurements, with an average correlation of 90% interrater reliability and 93% intrarater reliability.
Measurements of anterior vermal lobuli I-V (lingula, central lobule, and culmen) and superior-posterior VI-VII lobuli (declive and folium, including tuber) were performed on the midsagittal image that most clearly showed the cerebral aqueduct and the longest primary and prepyramidal fissures. Vermal measurements were obtained by tracing the boundaries of vermal lobuli I-V, bordered inferiorly by the primary fissure, and lobuli VI-VII, bordered inferiorly by the prepyramidal fissure. Measurement of the pons followed the boundaries of the pontomesencephalic junction, the dorsal tegmentum, the pontomedullary junction, and the ventral basal pons.4,18 The pons, located in the same topographical brain area as the cerebellum, served as a regional anatomical control measure to further investigate our hypothesis that the structures of the developing brain are affected selectively by toxic effects of methotrexate, determined by their individual properties and course of maturation.
To obtain an estimate of left and right prefrontal cortex volume, a consistent method of demarcation was used on the midsagittal slice, and corresponding coordinate values (x, y) were extrapolated to coronal slices. On the midsagittal slice, the posterior boundary (x) was defined as 5 mm anterior to the genu of corpus callosum, and a line through the posterior and anterior comissures was drawn across the medial frontal lobe; this line provided a y-coordinate value for the coronal slices, and served as an inferior boundary. Starting at the coronal slice indicated by the x-coordinate of the midsagittal posterior boundary, and moving anteriorly at 5-mm slice intervals until cortex was no longer visible (8-9 slices), the lateral boundary of cortex was traced downward from the longitudinal fissure until the inferior boundary line was met, then, moving back upward, the remaining cortex was traced. Each coronal mean area measurement was multiplied by 5 mm, and the resulting values were summed to obtain an estimate of volume. Extrapolation from the midsagittal demarcation boundaries to coronal slices indicates that measures of prefrontal association cortex largely comprised Brodmann areas 9, 10, and 46.19Figure 1 shows cerebellar regions of interest, and Figure 2 shows prefrontal regions of interest.
Whole brain volumes (WBVs) were estimated from 10 coronal slices. The first anterior slice where brain tissue initially appeared was determined, and the fourth slice posterior to this point was used as the first WBV slice. The last posterior slice where brain tissue clearly could be identified was determined, and the fourth slice anterior to this point was used as the 10th WBV slice. The remaining slices were spaced at equal intervals between the anterior and posterior slices. The hemispheres, and when apparent the brainstem (bordered inferiorly by the pontomedullary junction), and cerebellum were outlined. Ventricular planimetric areas were subtracted out. From the first slice to the next to last slice, the mean area was multiplied by the distance between it and the next consecutive slice. The resulting values were summed to yield an estimate of total volume.
Central to the purposes of this study were tests that provide information on visual-spatial attention, short-term memory, and visuomotor organization and coordination skills. The tests included Trail-Making Test, parts A and B (TMT-A, TMT-B)20; Rey-Osterreith Complex Figure 1 Test, Copy (CFT-C) and Immediate Recall (CFT-IR)21; and Wechsler Intelligence Scale for Children, Third Edition (WISC-III) Coding.22 (Detailed information on these tests can be found in texts by Lezak23 and Spreen and Straus.24)
The TMT assesses visual-spatial attention, search, and sequencing; visuomotor coordination; and cognitive flexibility.23,24 The test has been shown to load on "rapid visual search" and "visual-spatial sequencing" constructs in factor analysis,25 and to correlate highly with other visual search tests, but not with verbal tests.26 Part A requires the individual to connect consecutively numbered circles and measures the ability to track a single train of numbers. Part B requires individuals to consecutively connect numbered and lettered circles (1-A-2-B, etc), and involves the ability to flexibly switch between sets of stimuli by inhibiting the irrelevant set and attending to the alternative set. Speed is an important indicator of performance, and although errors are recorded, the main measure is the total time taken for task completion.
The CFT-C and CFT-IR provide information on visual-spatial attention, visuomotor and perceptual organization and integration, visual-spatial memory, and planning skills.23,24 A detailed, multi-element geometric design is presented to copy, and immediately afterward the individual is required to reproduce it from short-term memory. The number of correctly reproduced details of the figure is used as a measure of performance.
Finally, coding is a symbol substitution task intended to provide information on attention, visual scanning and tracking, short-term memory, and cognitive flexibility.27 Within WISC-III, coding correlates high with symbol search (a task that involves attention and concentration, perceptual discrimination, short-term memory, cognitive flexibility, and visuomotor coordination) and low with the verbal scale.27 For younger children (6-7 years of age; coding A), the test requires individuals to match and place simple visual symbols (eg, lines, circle) within geometrical figures (eg, triangle, square), and for older children (8-16 years of age; coding B), to match a particular symbol with a particular number. The number of correct pair matches completed within 120 seconds is used as the measure of performance.
Data analyses were performed with SPSS version 6.1 for the computer software program UNIX (SPSS Inc, Chicago, Ill). For each family of analyses, Student 2-tailed t tests for independent samples were computed, and compared with a Bonferroni-adjusted α level. For all analyses, the Levene test for equality of variance was not significant; therefore, t values were based on pooled variance estimates. Reported P values can be considered as providing an index as to whether replication of the present study using a different sample would produce similar significant results in the same direction, ie, one group higher and one group lower on some measure.28,29 Effect size model values also were computed. The d statistic13,14 was used as a measure of ES. Interpretation of ES is based on a convention suggested by Cohen13—0.20 is considered a "small" effect; 0.50, "medium"; and 0.80 or greater, "large." Post hoc power analyses were performed using the d statistic to estimate power from Cohen's tables by linear interpolation.13 With very large ESs, d was used to calculate Cohen's f and used with more extensive tables or to calculate ϕ, and used with Pearson-Hartley charts to estimate power.13,16 In computing and interpreting power, sample ES values are assumed to generalize to the population. Finally, 95% confidence intervals were computed for ES14 and mean differences.
Hierarchical regression analysis30 and partial correlation31 techniques were used to explore the hypothesis that a cerebellar-frontal subsystem may be involved in the processes of visual-spatial attention, and visuomotor organization and coordination. A representative measure, Frontal-Spatial (F-S), was formed by combining mean performance on the present neuropsychological tests, which assess functions in the visual-spatial-motor domain. Our composite label F-S was chosen to distinguish it from other functions that may be associated with frontal subsystems, eg, frontal-verbal. The subsystem model assumed that lobuli VI-VII development would relatively precede extended prefrontal cortical development; therefore, the lobuli VI-VII variable was entered first in all analyses. Proportion of variance accounted for (R2), expressed as a percentage, was used as a measure of association between the neuroanatomical variables and F-S. To obtain a measure of the proportion of variance in F-S accounted for by the interaction between the neuroanatomical variables, partial correlation analyses were performed between F-S and the product of lobuli VI-VII and either the left or right prefrontal measure, controlling simultaneously for both separate lobuli VI-VII and the respective prefrontal variable.31 The result can be interpreted as the amount of variance in F-S accounted for solely by the interaction of lobuli VI-VII and prefrontal variables.31
Table 1 presents demographic data for the 2 groups. To determine whether the ALL and control groups were adequately matched, a multivariate analysis of variance was conducted on the differences between the 2 groups with respect to age, handedness, sex, and socioeconomic status.17 The main effect of group was not significant (P=.94).
Table 2 presents morphometric measures. Results for each family of variables were compared with a Bonferroni-adjusted α level (B-α) of .05/2=.03. First, for cerebellar vermis, the lower ALL group mean for cerebellar lobuli VI-VII was significant (P=.02, d=1.14, power approximately 0.67). In contrast, the lower ALL group mean for lobuli I-V was not significant (P=.14, d=0.68, power approximately 0.30). Second, for left and right prefrontal lobe volume, the lower ALL group mean for left prefrontal cortex was significant (P=.005, d=1.41, power approximately 0.84). Similarly, the lower ALL group mean for right prefrontal cortex was significant (P=.02, d=1.12, power approximately 0.65). Pons and WBV data were analyzed. Complete WBV data were not available for 2 control subjects, and were not included in the analysis. Group differences for the pons were not significant (P=.28, d=0.49, power approximately 0.18). Likewise, WBV differences were not significant (P=.74, d=0.16, power approximately 0.06).
Table 3 shows the neuropsychological results. Test raw scores were converted to standard z scores using published test norms. Data not available were TMT-A for 1 subject with ALL, TMT-B for 1 control subject and 1 ALL subject, and CFT-C for 1 control subject, and therefore were not included in their respective analyses. Results were compared with B-α=.01. For TMT-B, lower mean performance by the ALL group was significant (P=.009, d=1.40, power approximately 0.80). Lower mean performance by the ALL group approached significance for CFT-IR (P=.03, d=1.07, power approximately 0.63) and WISC-III Coding (P=.02, d=1.12, power approximately 0.67). Lower ALL group performance was not significant for CFT-C (P=.05, d=0.96, power approximately 0.53) and for TMT-A (P=.99, d=0.005, power <0.05).
Table 4 shows that neuropsychological tests and F-S were highly correlated with one another, suggesting that F-S may generally represent one functional construct. Table 5 shows the correlation matrices for neuroanatomical measures and F-S. For the ALL group, the linear shared proportions of variance (r2) between the neuroanatomical measures and F-S were nearly zero. However, examination of scatterplots and curve-fitting analysis suggested that a cubic polynomial provided a better description of these data. Polynomial regression analysis indicated that the lobuli VI-VII combined linear, quadratic, and cubic components (its compound variable)32 accounted for about 17% of F-S variance, the compound left prefrontal measure accounted for about 14%, and the compound right prefrontal measure accounted for about 26%. The combination of lobuli VI-VII and left prefrontal compound variables accounted for about 39% of F-S variance, and lobuli VI-VII and right prefrontal accounted for about 56%. The percentages of F-S variance accounted for by combination and interaction of compound lobuli VI-VII and left (LF) and right (RF) prefrontal cortex measures for both groups of subjects are shown in the tabulation below.
For the control group, it was found that a cubic polynomial also provided a better description of the data. The lobuli VI-VII compound variable accounted for about 44% of F-S variance, the compound left prefrontal measure accounted for about 93%, and the compound right prefrontal measure accounted for about 48%. The combination of lobuli VI-VII and left prefrontal compound variables accounted for about 99% of F-S variance, and lobuli VI-VII and right prefrontal accounted for about 79%.
For the ALL group, the lobuli VI-VII and left prefrontal interaction accounted for about 20% of F-S variance; the lobuli VI-VII and right prefrontal interaction accounted for less than 1%. For the control group, the lobuli VI-VII and left prefrontal interaction accounted for about 41% of F-S variance; the lobuli VI-VII and right prefrontal interaction accounted for about 21%.
The present study found that in children treated for ALL with chemotherapy only before the age of 5 years, the posterior cerebellar vermis (lobuli VI-VII) and left and right prefrontal association cortices were significantly reduced, concurrent with neuropsychological deficits in visual-spatial attention, short-term memory, and visuomotor organization and coordination. It should be emphasized that one aim of the present study was to explore whether these data are consistent with the hypothesis that the later developing cerebellar lobuli VI-VII and prefrontal cortices may be particularly vulnerable to the effects of intrathecal methotrexate, and may comprise an interrelated and interacting subsystem, which may produce an effect different from that of either structure alone.12 The present subsystem analyses using multiple regression and partial correlation techniques yielded results that are compatible with this hypothesis; for both groups, the combination of both lobuli VI-VII and prefrontal cortex measures accounted for substantially greater amounts of F-S function variance than either variable alone. Except for the ALL group lobuli VI-VII and right prefrontal cortex, the interaction of lobuli VI-VII and prefrontal cortex measures accounted for notable amounts of F-S variance. These results suggest that a cerebellar-frontal deficit is involved in brain sequelae of ALL survivors, and provide further evidence indicating a cerebellar-frontal relationship to cognitive deficits, including deficits in visual-spatial attention and memory tasks.8,33- 35
The neurogenesis and migration of granule cells in the human cerebellum continue through the first several years of life, causing an increase in the size of this structure. It is essential for the normal development of the cerebellum that during the early postnatal period there is normal structural and functional interactions of granule cells with Purkinje cells and radial glial fibers.36,37 Early disruption of this interaction by toxic insult, such as irradiation and/or chemotherapy, may inhibit the development of interactions between granule and Purkinje cells, and result in cerebellar hypoplasia and atrophy.38
A study in monkeys documented the transport of herpes simplex virus from prefrontal cortex to the lateral cerebellum, demonstrating a prefrontal-cerebellar connection.39,40 In humans, results of a functional MRI study have shown that neural systems including the cerebellum and dorsolateral prefrontal cortex are involved in cognitive reasoning tasks.41 An abnormally developing cerebellum, then, may contribute to deficits in higher cognitive functions, due to an anatomical and functional cerebellar-frontal association.42,43
Our developmental chronometry hypothesis5 posits that as the neocerebellum and frontal areas are relatively later-developing structures of the brain, a neocerebellar-frontal subsystem with a slow rate of maturation may have prolonged vulnerability to damage, and as such may be a common nonspecific site of abnormality in many disorders of childhood. Toxic insult to this subsystem in young children, when the processes of development are prolific, may manifest itself later in ontogeny as structural, functional, and interconnection abnormalities. Although our present observations suggest that cerebellar-frontal subsystem abnormality may be common among survivors of ALL, this may not necessarily be a specific diagnostic marker for this population; cerebellar-frontal brain subsystem deficits may occur in the course of the developing brain of any child following substantial genetic, viral, toxic, or traumatic insults.
Although ALL survivors show some deficits in verbal tasks, more severe deficits are noted in visual-spatial orientation, visuomotor coordination, spatial memory, and arithmetic,44 particularly in children treated before 5 years of age. Such deficits have been interpreted as reflecting abnormalities in the right hemisphere and in white matter integrity.45 These suggestions are derived from neurobehavioral data, and do not yet have direct empirical support from neuroanatomical data. For the ALL group in the present study, the neuropsychological profile of deficits shows some similarities to findings that others have related to right brain deficits.46 Although frontal measures for both right and left hemispheres were abnormal, the left hemisphere appears to be more affected (d=1.41, mean difference=43.59) than the right (d=1.12, mean difference=38.61). The left anterior brain has been related to processing sequentially presented stimuli and to programming rapid motor sequence responses.47,48 The present tests, particularly TMT and Coding, strongly require sequencing ability. Given that sequencing abnormalities in the ALL population have been reported,11 and that most children surviving ALL are considerably more deficient in the visual-spatial rather than the verbal domain,44 the observed left brain deficit may be more strongly related to poor sequencing in the visual-spatial domain, rather than to poor verbal abilities.
Pragmatically, the question of interest of the present study is whether the experimental results make it reasonable to conclude that if children are treated with chemotherapy before the age of 5 years, it is likely that they will subsequently demonstrate deficits in areas vital to normal growth and development, thus requiring special remediation in those areas. Our findings include some nonsignificant P values regarding the answer to this question; however, it does not necessarily follow that the conditions have been met to confidently conclude that deleterious treatment effects are not actually occurring in the population. Effect sizes and confidence intervals for group differences in neuropsychological performance (Table 3) indicate that potentially large and clinically important differences may actually exist in the population, but may not have been detected at a level of significance in this particular study due to low sample size and subsequent low to moderate statistical power. Based, then, on additional information from ES model values, the present data can reasonably be interpreted as providing persuasive evidence that intrathecal methotrexate treatment of children before the age of 5 years has both structural and functional effects on the developing brain. Because frontal and other cortical areas of the developing brain reflect a dynamic and changing system, with the potential for functional reorganization at least into late childhood (for review, see Joseph49), neuropsychological-cognitive rehabilitation should be provided as early as possible after completion, or near completion,50 of the treatment protocol to optimize the developmental potential of frontal functions in survivors of childhood ALL.
Accepted for publication April 20, 1998.
We thank William Orrison, MD, and the staff of the New Mexico Veterans Affairs Imaging Center for their help in completing the brain imaging part of this study; staff from the Department of Pediatrics, University of New Mexico Hospital for referring children treated for ALL; Richard J. Harris, PhD, for comments on the final version of the manuscript; and Barbara Brooks and Dina Hill for help in psychometric testing.
Comments on the effect size model can be addressed to Paul G. Lesnik, MS, at e-mail: firstname.lastname@example.org.
Reprints: Kristina T. Ciesielski, PhD, Clinical Neuroscience Laboratory, Department of Psychology, The University of New Mexico, Albuquerque, NM 87131 (e-mail: email@example.com).