Regions of interest in 9 axial slices on the right hemisphere. The analogous regions of interest in the left hemisphere were obtained by applying the mirror image of the map of regions of interest. The number appearing in each region of interest refers to the Brodmann areas of that region in Table 1.
Composite activation maps based on median pixel values for the Word Serial Position Task (WSPT) in 3 averaged anatomical slices for the control (n=12) and the patient (n=12) groups, illustrating the hypoperfusion in the left inferior frontal gyrus in the schizophrenic group. The bar charts (with SE bars) represent pixel density in the regions of interest corresponding to Brodmann area inferior lateral 6, area 44, and area 45. Reduced activation in the superior temporal gyri is also evident. Threshold for pixels was set at t-value=0.2 and a cluster filter of 10 pixels. Approximate Talairach z coordinate is given for each slice. TSPT indicates Tone Serial Position Task; R Inf, right inferior; and L Inf, left inferior.
Composite activation maps during the Tone Serial Position Task (TSPT) in 2 slices from the control group and schizophrenic group showing the failure of activation in medial area 6. The bar chart (with SE bars) illustrates the hypoactivation in this region that appeared in the patient group only during the TSPT. Both groups showed above-threshold activation in this region during the Word Serial Position Task (WSPT). R Med indicates right medial; L Med, left medial.
Stevens AA, Goldman-Rakic PS, Gore JC, Fulbright RK, Wexler BE. Cortical Dysfunction in Schizophrenia During Auditory Word and Tone Working Memory Demonstrated by Functional Magnetic Resonance Imaging. Arch Gen Psychiatry. 1998;55(12):1097-1103. doi:10.1001/archpsyc.55.12.1097
Copyright 1998 American Medical Association. All Rights Reserved. Applicable FARS/DFARS Restrictions Apply to Government Use.1998
Verbal learning and memory deficits are among the most severe cognitive deficits observed in schizophrenia. We have demonstrated that such deficits do not extend to working memory for tones in a substantial number of patients even when verbal working memory is impaired. In this study we used functional magnetic resonance imaging to study the neural basis of this dissociation of auditory verbal and nonverbal working memory in individuals with schizophrenia.
While undergoing functional magnetic resonance imaging, 12 schizophrenic patients and 12 matched control subjects performed auditory Word Serial Position Task and Tone Serial Position Task.
Both tasks produced activation in frontal cortex and temporal and parietal lobes of the cerebrum in both groups. While robust activation was observed in the left inferior frontal gyrus (areas 6, 44, and 45) in the control group during the Word Serial Position Task, activation in the patient group was much reduced in these areas and failed to show the same task-specific activation as in controls. Reduced activation in patients was not confined to the inferior frontal gyrus, but also extended to a medial area during the Tone Serial Position Task and to premotor and anterior temporal lobe areas during both tasks.
These findings support the hypothesis that abnormalities in cortical hemodynamic response in the inferior frontal gyrus underlie the verbal working memory deficit in schizophrenia. The relationship of verbal working memory deficits to other cognitive functions suggests that abnormal functioning in the speech-related areas may reflect a critical substrate of a broad range of cognitive dysfunctions associated with schizophrenia.
ALTHOUGH schizophrenia disrupts a broad range of perceptual, sensorimotor, and cognitive functions, tasks assessing verbal learning and memory appear to be particularly disturbed.1- 6 In our companion article,7 we report that auditory verbal working memory is selectively impaired in schizophrenic patients with basic perceptual and attentional competence while working memory for nonverbal auditory stimuli (ie, tones) remains intact. This suggests that the neural systems for processing auditory verbal and nonverbal material are segregated at some level and that the former can be selectively impaired in schizophrenia.
Functional imaging studies of verbal working memory tasks consistently report activation in the inferior and middle frontal cortex, and in posterior temporoparietal areas in control subjects,8- 12 results consistent with reports that verbal deficits are associated with left inferior frontal and posterior temporal lesions.13 In contrast, imaging work suggests that tonal memory involves cortical and subcortical areas partially lateralized to the right hemisphere,14 consistent with clinical studies demonstrating right temporal and frontal lobe involvement in pitch discrimination and memory.15- 17
Although there are no published imaging studies of auditory verbal working memory per se in schizophrenia, a number of studies have reported frontal and temporal lobe abnormalities during word recognition,18 word generation,19- 22 and auditory selective attention.23 In addition, several imaging studies have reported decreased frontal lobe activation in schizophrenic patients during performance of neuropsychological tasks sensitive to frontal lobe lesions.23- 26
The present study sought to examine cortical hemodynamic activity in schizophrenia during a verbal working memory task as well as during a nonverbal working memory task that patients performed normally.
All subjects provided informed consent before the study. The patient group consisted of 14 medicated outpatients (8 men) who met Structured Clinical Interview for DSM-III-R27 criteria for schizophrenia. Symptom scores (mean±SD) on the Positive and Negative Symptom Scale were as follows: positive, 16.1±6.0; negative, 17.2±4.5; general, 32.1±9.0; and total, 65.0±18.4.
Fourteen subjects (8 men) without history of psychiatric disorder served as controls. None was taking medications that affect the central nervous system. All patients and control subjects were right-hand dominant, as assessed by writing and performing at least 5 of 7 daily activities with their right hand. Subjects with histories of neurologic disorder or trauma, or substance abuse within the last 6 months, were excluded from the study. As assessed by Student t test (P<.05), the control and schizophrenia groups did not differ in age (controls, 36.8±11.8 years; patients, 39.6±8.6 years), years of education (controls, 13.9±1.1 years; patients, 14.5±1.9 years), or parent education (controls, 13.5±2.3 years; patients, 13.0±3.2 years). All subjects scored at least 90% correct on a screening test of auditory pitch discrimination, described in our companion article,7 assuring that all subjects could discriminate the tonal stimuli used in the Tone Serial Position Task (TSPT).
Subjects were trained on the auditory Word Serial Position Task (WSPT) and TSPT before scanning as described in our companion article.7 In the WSPT, subjects were presented with lists of 4 words followed by a 9-second retention interval. One of the words was then repeated and subjects indicated its position (first, second, third, or fourth) in the list by briefly extending the corresponding number of fingers on their left hand. The TSPT was procedurally identical to the WSPT but consisted of lists of 3 tones instead of 4 words. Tones within each list differed from the next closest tone by a frequency ratio of 0.85 (eg, 500 Hz, 425 Hz, 361 Hz). Fewer tones than words were used in lists to improve the overall level of performance of the task.
Scanning took place in a 1.5-T Signa magnetic resonance imaging system (General Electric Corp, Milwaukee, Wis) equipped with echo planar imaging hardware (Advanced NMR Inc, Wilmington, Mass). Anatomical localizer scans were acquired in the sagittal plane, using conventional T1-weighted spin-echo sequence: echo time, 11 milliseconds; repetition time, 667 milliseconds; field of view, 24 cm2; slice gap, 0 mm; acquisition matrix, 256×128 with 1 excitation; and slice thickness, 5 mm. Ten T1-weighted oblique axial slices were acquired with the following imaging sequence: echo time, 13 milliseconds; repetition time, 500 milliseconds; field of view, 40 cm2; acquisition matrix, 256×192 with 2 excitations; and slice thickness, 9 mm. Slice orientation paralleled the plane transecting the anterior and posterior commissures, and the interslice distance was varied by subject to maintain the same location of slices in all subjects.
Functional images were acquired with an echo planar imaging gradient echo sequence: flip angle, 60°; echo time, 45 milliseconds; repetition time, 1500 milliseconds; field of view, 40×20 cm; acquisition matrix, 128×64 with 1 excitation; and voxel volume, 9×3.125×3.125 mm. Eighty images per slice were acquired during each scan.
Image processing was performed with statistical programs written in MATLAB 4.0 (Math Works, Inc, Natick, Mass). All images were screened for motion and other artifacts and contaminated images were eliminated. Subjects were eliminated if movement of the center of mass of activation was greater than 1 pixel across the session, or greater than 0.5 pixel within a scan. Two controls and 2 patients were eliminated because of failure to meet the above criteria, yielding the final sample of 24.
Subjects underwent 2 scans for each task. Computer-generated stimuli were presented via shielded speakers, through air conductance tubes, to headphones, which provided external noise reduction. The volume was adjusted to the comfort of each subject. A WSPT scan consisted of 4 trials each separated from the next by a 14-second baseline interval. The same procedure was used with the TSPT, except that 5 trials were conducted during each TSPT scan separated by 12-second intertrial intervals. The 6 images acquired during the 9-second retention interval preceding the target and the 6 images acquired during the final 9 seconds of the baseline period were compared in the data analysis.
After coregistration of the anatomical images and echo planar imaging functional maps from individual subjects, the images were transformed into the standard 3-dimensional coordinate system of Talairach and Tournoux.28,29 Slices were interpolated into the 10 most superior slices of Talairach space, centered at z=+69, +60, +51, +42, +33, +23, +14, +5, −5, and −16. The slices covered the entire cerebrum except for a portion of the anterior and inferior aspect of the temporal pole. Thirty regions of interest (ROIs) in each hemisphere were defined within the Talairach space. The ROIs were constructed from rectangular volumes (8×8×9 mm) and corresponded to Brodmann areas (Figure 1).
A 2-step analysis was carried out to calculate task-related activation in each individual subject; the values obtained were then used as the dependent measures in repeated-measures analysis of variance. First, t-values were calculated for each pixel during each task by comparing the 6 images acquired during the retention intervals with the last 6 images acquired during the baseline periods.28 A criterion for activation was set at a t-value greater than 1 on both scans of a task, with a minimum cluster size of 4 contiguous pixels meeting this same t-value threshold. A pixel was assigned the lesser t-value of the 2 scans for each task. The t-value criteria were selected because lower thresholds included activation in ventricles and white matter, while higher values eliminated all but a few pixels in some subjects.
Next we calculated the active pixel density in all ROI volumes in each brain for each task. First the number of active pixels in each ROI was divided by the total number of pixels contained in that ROI to adjust for differences in the sizes of the ROIs. Individual differences in global activation were removed by dividing the above ratio by the ratio of the total number of active pixels in the brain to the total number of pixels in the brain. For each ROI, an active pixel density of 1 is equivalent to the mean density of active pixels for the whole brain. To identify regions that showed clear task-related activation, an ROI was retained for further analysis if its mean pixel density was greater than 2 (twice the density of the whole brain) in at least 1 of the groups during 1 of the tasks.
The statistical significance of regional activation differences between groups was evaluated by subjecting pixel density scores for retained ROIs to 2×2×2 repeated-measures analysis of variance with group (patients vs controls) as a between-subject factor and task (WSPT vs TSPT) and hemisphere (left vs right) as within-subject factors. A minimum significance value of .05 was used throughout.
The pattern of behavioral deficits on the WSPT and sparing of TSPT performance reported in the larger sample1 was confirmed in the present sample. On the prescan WSPT test, the patient group performed at a mean of 85% correct, significantly worse than the 94% mean accuracy among the controls (F1,22=5.718; P=.03). Performance in the scanner was excellent for both groups on the limited number of WSPT trials, on which the patient group and control group performed at 91% and 95% mean accuracy, respectively (F1,22=0.88; P=.36). In training before scanning, the patient group averaged 68% correct on the TSPT compared with 71% for the controls (F1,22=0.217; P=.64). During scanning, the groups averaged 69% and 75%, respectively (F1,22=0.33; P=.59). Equivalent levels of performance by patients and controls during scanning rule out a failure by the patient group to attend to the stimuli or other extraneous factors as the basis of activation differences between groups.
Prescan group differences on the WSPT task were mirrored by cortical hemodynamic differences during WSPT scans. A cluster of ROIs in the left inferior frontal gyrus was preferentially activated by the WSPT task in the controls but not in patients (Figure 2). Analyses of variance confirmed these observations (Table 1). The activation centroids are presented in Table 2. The WSPT produced significantly greater activation than the TSPT in area 44 and inferior lateral area 6. Interactions between group and task in these 2 areas, as well as in area 45, indicated that increased activation was specific to the controls. The WSPT also produced significant effects of hemisphere, reflecting left lateralization of activation in area 44 (P<.004) and lateral inferior area 6 (P=.03). Similarly, in area 45, a 3-way interaction emerged between group, task, and hemisphere. Follow-up analyses of variance indicated that only the control group produced significantly greater activation during WSPT than during the TSPT in area 44 (F1,11=14.087; P<.004), area 45 (F1,11=7.642; P=.02), and inferior lateral area 6 (F1,11=11.227; P<.007) (Figure 2).
The TSPT task produced significantly greater activation than the WSPT in area 10 and medial area 6 in the frontal lobes, although the greater activation during the TSPT in medial area 6 appeared only in controls (Table 1). Separate analyses of the groups confirmed that controls produced greater activation during the TSPT than during the WSPT (F1,11=8.917; P<.01) in medial area 6, while the patients showed no such increase (F1,11=0.297; P=.59) (Figure 3).
Finally, a main effect of group appeared in lateral area 6, which was activated by both the WSPT and TSPT tasks in the control group to a greater degree than in the patient group.
The correlation between WSPT performance outside the scanner was compared with activation in ROIs that showed task×hemisphere interactions reflecting greater activation during the WSPT. Activation in left area 44 correlated with WSPT performance (Pearson r=0.45; P<.02), as did inferior lateral area 6 (Pearson r=0.49; P<.02) in the overall sample. Correlations in patients and controls separately were of equivalent magnitude. The correlation between left medial area 6 activation during the TSPT scan and TSPT behavioral performance was not significant (Pearson r=0.15; P=.49).
Both the WSPT and TSPT produced substantial above-threshold activation throughout the superior temporal gyrus. Unlike the frontal lobe, temporal cortical areas did not show significant group×task interactions (Table 1). The WSPT produced significantly greater pixel density than the TSPT throughout the superior temporal gyrus in anterior and posterior area 22 as well as in the primary auditory areas (41/42) (Figure 2). The control group had somewhat higher pixel densities than patients throughout the superior temporal gyrus during both tasks, but group differences reached significance only in anterior area 22. Hemispheric effects of task were confined to area 41/42, reflecting a relative right hemisphere lateralization of activation in this area during the TSPT and a greater pixel density in the left hemisphere during the WSPT. No other effects in the temporal lobe ROIs reached significance.
Group differences failed to appear in the parietal regions, although task-specific activation was present in area 7. A significant interaction of task and hemisphere reflected greater pixel density in the right hemisphere during the TSPT relative to the WSPT, while there was comparable pixel density in the left hemisphere during both tasks (Table 1). As expected, no ROI in the occipital lobe approached the pixel density threshold of 2 for inclusion in the analyses.
The present study indicates that areas in left inferior frontal cortex that are active during verbal working memory tasks in normal subjects9,12,30,31 are dysfunctional in schizophrenic patients who have specific performance deficits in verbal but not nonverbal working memory.1 The WSPT-specific hypoactivity in the left inferior frontal cortex of the schizophrenic group suggests that these language-related regions may be critically related to their verbal working memory deficit. This assertion is supported by the significant correlation between 2 of these ROIs (left areas 44 and inferior lateral 6) and performance on the WSPT assessed before scanning. It is not possible in this study to determine whether the marked failure of activation in the left inferior frontal gyrus during the WSPT scans was caused by abnormalities in the inferior frontal gyrus, failure by patients to engage these areas during the WSPT task because of strategic factors, or a result of failures at earlier stages of processing. Data from previous positron emission tomographic imaging studies of word generation tasks reported comparable activation in patient and control groups in the left inferior frontal gyrus when matched for rate of production, but abnormalities in superior temporal gyrus.19,20 Thus, under conditions of lexical retrieval, the frontal speech areas appeared normal, but in the present study, which emphasizes encoding and maintenance, these areas appear dysfunctional. It is important that we failed to observe activation in area 46, a site commonly reported active in other studies of working memory. This may reflect differences in stimuli used in the WSPT and other tasks, and, in addition, the heavy reliance on covert rehearsal during the WSPT, which would favor more inferior and specifically speech-related regions as compared with the more commonly used "N-back" task.12
Other cortical areas also showed signs of abnormal function, including areas activated by the behaviorally spared TSPT. The TSPT provided an opportunity to assess cortical hemodynamic abnormalities on a task that is not impaired in these schizophrenic patients.1 The 2 areas of TSPT-related activation were in the anterior frontal lobe (area 10) and in the supplementary motor and premotor area of the superior frontal gyrus (medial area 6). Connections between area 10 and the superior temporal gyrus have been observed in the rhesus monkey,32 but activation in area 10 has not been reported in previous functional imaging studies of auditory processing in humans.33 In medial area 6, the patients failed to produce increased activation during the TSPT scans, as seen in controls, but showed comparable activation during the WSPT scans. Performance of the TSPT task was not impaired either during scanning or on extensive prescan testing. Therefore, the decreased hemodynamic response in the patient group in medial area 6 during the TSPT cannot be attributed to a performance deficit, as appeared to be the case during the WSPT in the inferior frontal gyrus. Similar cases of hypoperfusion and spared behavior have been previously reported in schizophrenia.34
The schizophrenia group consistently produced less temporal lobe activation than the controls during both tasks, although a group difference reached statistical significance only in anterior area 22 bilaterally. These results suggest that, while abnormalities are present in temporal lobe regions in schizophrenia,35- 39 they may not compromise auditory perception and memory during processing of relatively simple stimuli (ie, tones), as demonstrated by the sparing of the TSPT task. The verbal memory processes, which are widely considered essential for comprehension and reasoning,40,41 may rely more heavily on the frontal areas activated in the control subjects of the present study. Given that the WSPT deficit suggests that verbal encoding and maintenance are disrupted in the patients, the inferior frontal areas may also be critical for these processes. However, these patients were selected on the basis of their relatively good attention, and therefore the results must be generalized with caution. In addition, further understanding of the neural systems underlying the distinctions between verbal and auditory processing will be necessary to bear out the present interpretations.
Accepted for publication August 7, 1998.
This study was supported by a National Alliance for Schizophrenia and Depression Young Investigator Award (Dr Stevens) and a Research Scientist Development Award, National Institute of Mental Health, Rockville, Md (Dr Wexler).
Reprints: Alexander A. Stevens, PhD, Department of Psychiatry, CR139, Oregon Health Sciences University, 3181 SW Sam Jackson Park Rd, Portland, OR 97201-3098 (e-mail: firstname.lastname@example.org).