Likeman M, Anderson VM, Stevens JM, Waldman AD, Godbolt AK, Frost C, Rossor MN, Fox NC. Visual Assessment of Atrophy on Magnetic Resonance Imaging in the Diagnosis of Pathologically Confirmed Young-Onset Dementias. Arch Neurol. 2005;62(9):1410-1415. doi:10.1001/archneur.62.9.1410
Copyright 2005 American Medical Association. All Rights Reserved. Applicable FARS/DFARS Restrictions Apply to Government Use.2005
To investigate the diagnostic accuracy of visual inspection of magnetic resonance imaging (MRI) in a range of pathologically confirmed diseases causing young-onset dementia and to assess the sensitivity and specificity of atrophy patterns for Alzheimer disease (AD) and frontotemporal lobar degeneration (FTLD).
Sixty-two patients with pathologically confirmed diseases that may present as young-onset dementia were selected from a biopsy and postmortem series. The first diagnostic T1-weighted volumetric MRI was obtained for each patient, together with images from 22 healthy control subjects. All MRIs were assessed for regional atrophy independently by 3 neuroradiologists, blinded to all clinical details except age. Observers were also asked to use their clinical judgment to form a diagnosis.
Eighty-seven percent of dementia cases were distinguished from controls after visual inspection of MRI, and a correct pathologically confirmed diagnosis was given in 58% of cases. Hippocampal atrophy was noted in 92% of AD cases but was commonly seen in other dementias and controls. A bilateral symmetrical pattern of hippocampal atrophy discriminated AD from FTLD with 47% specificity, while posterior greater than anterior gradient of atrophy was 92% specific for AD. Atrophy of the anterior, inferior, and lateral temporal lobes was suggestive of FTLD pathology (≥90% sensitivity), while anterior greater than posterior gradient of atrophy and hemispheric asymmetry of atrophy were each at least 85% specific for FTLD.
Despite variation and overlap of atrophy patterns, visual inspection of regional atrophy on MRI may aid in discriminating AD and FTLD.
Early and accurate differential diagnosis of dementing diseases is increasingly important to guide treatment and provide prognostic information and care. Alzheimer disease (AD) and frontotemporal lobar degeneration (FTLD) are 2 of the most common causes of young-onset (<65 years) dementia,1 although clinical diagnosis in these conditions at an early stage can be problematic. Definitive diagnosis requires histopathologic examination of brain tissue. In AD, this involves confirmation of sufficient numbers of amyloid plaques and neurofibrillary tangles, while several pathologic processes may underlie a clinical diagnosis of FTLD.2,3 Examination of brain tissue is rarely performed in life, however, and diagnosis has traditionally relied on clinical assessment and established diagnostic criteria supported by neuroimaging.4,5 Although investigations have found that National Institute of Neurological and Communicative Disorders and Stroke–Alzheimer’s Disease and Related Disorders Association criteria6 are sensitive to discriminating pathologically confirmed AD, specificity is often poor, because patients with FTLD and other dementias may also fulfill AD clinical criteria.7,8 Magnetic resonance imaging (MRI) aids in the differential diagnosis of these diseases through exclusion of alternative diagnoses and supports a clinical diagnosis through detection of different atrophy patterns.9 Pathological and imaging studies10- 13 have shown AD to be associated with bilateral hippocampal atrophy and diffuse symmetrical atrophy, while frontotemporal lobar atrophy with left to right asymmetry on imaging in part supports a clinical diagnosis of FTLD.14
Many previous methods of evaluating atrophy patterns on MRI in patients with dementia involve time-consuming or sophisticated techniques, such as manual outlining of regions, voxel compression mapping, or voxel-based morphometry. Although these provide objective measures of atrophy, they are largely limited to research groups in specialist centers. Visual inspection of MRIs by an experienced radiologist is the method of evaluation commonly used in clinical practice. Discrimination of patients with dementia from control subjects on the basis of visual assessment of global atrophy is subject to wide interrater variation,15 but rating of regional volume changes may be more useful.16 Few neuroimaging studies have addressed the diagnostic power of visual inspection of MRIs to differentiate between causes of dementia in pathologically confirmed cases.16- 20 Moreover, most studies do not include a wide mix of causes of cognitive impairment, a more realistic model of clinical practice. The differential diagnosis in early-onset dementia is particularly difficult because AD accounts for a smaller proportion of cases in younger populations. The present study evaluated visual inspection of MRIs in the diagnosis of patients with a range of pathologically confirmed diseases causing young-onset dementia and, particularly, in the discrimination of patients with AD or FTLD.
This study included 62 patients (39 men and 23 women), who had fulfilled criteria for dementia21 in life and had pathological confirmation of disease, and 22 healthy control subjects (11 men and 11 women). Patients with dementia were recruited from the postmortem (50 cases) and biopsy (12 cases) series of the Dementia Research Centre, London, England. Consent had been obtained for biopsy or brain donation for research. The dementia group consisted of 25 cases with AD, 17 with FTLD, and 20 with other dementias (8 with prion disease; 5 with vascular dementia, including 2 with CADASIL [cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy]; 3 with progressive supranuclear palsy; 3 with corticobasal degeneration; and 1 with multiple system atrophy). The mean age of patients in each group was younger than 60 years. Although some patients had mixed pathology, the primary pathology was used to group patients, and there were no cases of mixed AD and FTLD. The AD group included 4 cases with pure or mixed dementia with Lewy bodies. Although dementia with Lewy bodies is neuropathologically distinct from AD, MRI was not expected to distinguish cases because of the similarity of atrophy patterns and frequent co-occurrence of AD in patients with dementia with Lewy bodies.17 The control group consisted of healthy volunteers without cognitive impairments and with Mini-Mental State Examination22 (MMSE) scores of 28 or higher of a possible 30. Demographic and clinical characteristics are summarized in Table 1.
Patients with dementia had T1-weighted volumetric MRI performed as part of a diagnostic workup when they presented with cognitive complaints to the National Hospital for Neurology and Neurosurgery. Control subjects had T1-weighted volumetric imaging performed as part of longitudinal research studies. Images were acquired on 1.5-T MRI scanners (Horizon Echospeed, version 5.7 or LX; GE Medical Systems, Milwaukee, Wis) as part of clinical imaging protocols. All were acquired in the coronal plane using spoiled gradient echo techniques (spoiled gradient recalled echo and magnetization-prepared rapid gradient echo) and single excitation using a 24-cm field of view to give 124 contiguous 1.5-mm sections. Other acquisition parameters varied depending on the scanner and clinical protocol used, with a repetition time between 13 and 35 milliseconds, an echo time between 4 and 9 milliseconds, and a matrix size of 256 × 256, 200 × 200, or 180 × 180 pixels.
Magnetic resonance images were viewed on a Sun workstation (Sun Microsystems, Inc, Santa Clara, Calif) using software that allows images to be viewed in 3 orthogonal planes.23 Images were assessed for atrophy in random order and independently by 3 experienced neuroradiologists (M.L., J.M.S., and A.D.W.) who were blinded to all clinical details except age at the time of MRI. To assess intraobserver reliability, 15 cases (chosen arbitrarily by a fourth observer [V.M.A.]) appeared twice in the series, making a total of 99 scan assessments for each observer. Only the first assessment of these 15 cases was included in the pathological-radiological analysis. Assessments of whole-brain gradient (anterior-posterior) and asymmetry (left-right hemispheric) of atrophy were made. The frontal lobes were assessed for the presence or absence of atrophy, while 4 regions of the left and right temporal lobes were rated using a modification of the rating scales by Scheltens18 and Galton16 and their colleagues. Hippocampal atrophy was rated when visualized in the coronal plane at the level of the hippocampal head on a scale of 0 to 4 (0, none; 1, minimal; 2, mild; 3, moderate; and 4, severe). Anterior temporal lobe (ATL) or amygdala, lateral temporal gyri (LTG) or fusiform gyrus, and parahippocampal gyrus atrophy were assessed on a scale of 0 to 3 (0, none; 1, mild; 2, moderate; and 3, severe). Each observer formed a diagnosis for each patient, based on regional atrophy patterns. Observers knew the possible diagnoses but did not know the distribution of cases among them. A category of “nonspecific changes” was included for imaging results that were abnormal but that could not be placed in a specific diagnostic category. The FTLD category consisted of several associated pathologic substrates, including Pick disease,24 dementia lacking distinctive histology,25 and dementia with ubiquitin-positive tau-negative inclusions26 (motor neuron disease–inclusion dementia27). However, histologic findings do not necessarily predict clinicoradiological features in individual cases, and diagnoses were based on 3 clinical subtypes (frontal lobe dementia [FLD], primary progressive nonfluent aphasia, and semantic dementia).14
Characteristics of the 4 subject groups at the time of MRI were compared by means of pairwise t tests. Intraobserver and interobserver agreements were expressed as percentages. κ Statistics were calculated to correct for “chance” agreements. Using the pathologically confirmed diagnosis as the gold standard, the percentage of cases correctly classified by each observer after MRI assessment was calculated, and the mean was reported together with the between-observer standard deviation. The sensitivity and specificity of atrophy patterns for pathologically confirmed AD and FTLD were calculated. Although observers were asked to rate multiple features at image assessment, the way in which these were weighted to reach a radiological diagnosis was left to their clinical judgment. To identify independent predictors of disease, multiple logistic regression analysis was used to relate dichotomous outcomes for AD (AD vs controls, AD vs all other cases, and AD vs FTLD) to atrophy rating scores. Bootstrap confidence intervals, incorporating allowance for clustering of diagnoses for the same subject, were calculated to allow for the nonindependence introduced by multiple observers. Bootstrap confidence intervals were bias corrected and used 2000 replications. Nonstatistically significant predictor variables were jointly omitted from the multiple regression model. As a check, these variables were reintroduced one at a time and tested for statistical significance.28
Groups were matched for age and disease duration. Mini-Mental State Examination scores were significantly reduced in dementia groups compared with controls, and in AD compared with FTLD and other dementias (Table 1). Although image quality varied, all images allowed adequate visual assessment of atrophy. The mean (SD) intraobserver agreement of radiological diagnoses was 84% (7.7%), and interobserver agreement (between pairs of observers) based on all 84 subjects in the study was 63% (6.6%). Correcting for chance agreement between observers, κ was 50% (P<.001). Among all subjects, a mean (SD) of 87% (7.3%) of dementia cases were distinguished from controls on visual assessment of MRIs; pathologically confirmed diagnoses of 58% (3.6%) were correctly identified from imaging.
Of 25 cases of primary AD, a mean (SD) of 84% (10.6%) showed abnormal findings, and 67% (8.3%) were correctly diagnosed as AD, distinct from those with other dementias and controls. Of misdiagnosed AD cases, 17% were thought to be normal, 9% FTLD, and 7% other pathologies. Table 2 lists the sensitivity and specificity for visual assessment of regional atrophy in pathologically confirmed AD and FTLD. Hippocampal atrophy (left or right, any severity) was 92% sensitive for AD. However, specificity was 62% when AD was compared with controls and decreased to 6% when compared with FTLD. Specificity improved when moderate to severe (left or right) hippocampal atrophy (atrophy graded as ≥3) was considered, but sensitivity decreased to 41%. Bilateral symmetrical hippocampal atrophy (any severity) was 71% sensitive for AD and 70% and 47% specific for AD when compared with controls and patients with FTLD, respectively. Only posterior greater than anterior gradient of atrophy was highly specific for AD when compared with other pathologies, and sensitivity was moderate. Combining bilateral symmetrical hippocampal atrophy and posterior greater than anterior gradient of atrophy gave 87% sensitivity for AD, with moderate specificity when compared with other pathologies.
All 17 patients with FTLD pathology were distinguished from controls after MRI assessment, and a mean (SD) of 61% (14.8%) were correctly diagnosed as having FTLD pathology. Of those misdiagnosed, 85% were thought to be AD. Atrophy (any severity) of the ATL or amygdala, LTG or fusiform gyrus, and parahippocampal gyrus had high sensitivity (≥90%) for FTLD; specificity was moderate but decreased to 32%, 35%, and 39%, respectively, when FTLD was compared with AD (Table 2). Considering moderate to severe atrophy of these regions increased specificity for FTLD compared with AD to more than 75%, but sensitivity was reduced. Frontal lobe atrophy had moderately good sensitivity (76%) and specificity for FTLD when compared with all other cases and AD cases (74% and 60%, respectively). Atrophy patterns specific for FTLD when compared with other pathologies were anterior greater than posterior gradient of atrophy and asymmetry of atrophy; however, sensitivity was less than 40%. Combining anterior greater than posterior gradient of atrophy and asymmetry of atrophy resulted in a higher sensitivity of 59% for FTLD, while specificity remained high at 84% when AD and FTLD were compared.
Table 3 lists the results of multiple logistic regression analysis performed on AD vs controls, AD vs all other cases, and AD vs FTLD cases. When comparing atrophy patterns in AD and controls, 11 assessments of AD cases rated as having asymmetry of atrophy, a feature that is 100% specific for AD, were excluded from the analysis. Subsequent analysis identified frontal lobe, ATL or amygdala, and hippocampal atrophy as statistically significant independent predictors of AD compared with controls. In distinguishing AD from all other cases, statistically significant atrophy patterns were posterior greater than anterior gradient of atrophy and hippocampal atrophy, which multiplied the odds of having AD by factors of around 5.0 and 1.5, respectively. Isolated unilateral parahippocampal gyrus atrophy was associated with a reduced risk of having AD compared with other diagnoses. In assessing atrophy patterns distinguishing AD from FTLD, posterior greater than anterior gradient of atrophy was statistically significant, with the odds of having AD multiplied 10-fold. In addition, anterior greater than posterior gradient of atrophy and parahippocampal gyrus atrophy were statistically significant independent predictors of a diagnosis of FTLD.
This study investigated the capacity of simple visual inspection of MRIs to distinguish diseases causing young-onset dementia in patients with a range of pathologically confirmed dementing diseases. We demonstrated that visual assessment of MRIs is a sensitive method for identifying dementia and that there are several global and regional atrophy patterns more commonly associated with a pathologically confirmed diagnosis of AD or FTLD.
Hippocampal atrophy of any severity was 92% sensitive and 62% specific for AD compared with controls. Our results are similar to those found in a study18 of clinically diagnosed AD and controls that rated medial temporal lobe atrophy and found 81% sensitivity and 67% specificity for AD. However, our study found that specificity of hippocampal atrophy for AD was poor when AD and FTLD were compared, and it was a nonsignificant factor in multiple logistic regression analysis. Although moderate to severe atrophy of the hippocampus can discriminate AD from controls with 98% specificity, discrimination from other pathologies remained poor, and sensitivity was reduced. This is in accord with studies that demonstrated hippocampal atrophy in FTLD,13,16,19,29,30 vascular dementia,19 and normal aging.18,31 The pattern of atrophy within the hippocampus is important, with bilateral symmetrical hippocampal atrophy occurring in AD,17,29,32 in contrast to asymmetrical atrophy that occurs in FTLD.13,16 In agreement with this, bilateral symmetrical hippocampal atrophy had the highest sensitivity and specificity for AD when compared with FTLD. Atrophy of the parietal cortex has also been associated with AD,20,33 and although sensitivity was moderate in our study, posterior greater than anterior gradient of atrophy was highly specific for AD, especially when compared with FTLD. Combining bilateral symmetrical hippocampal atrophy and posterior greater than anterior gradient of atrophy may ensure a higher specificity for discriminating AD from FTLD and other pathologies than that achieved by hippocampal atrophy alone.
Although several pathologic substrates underlie a clinical diagnosis of FTLD, this study based diagnosis on 3 established clinical subtypes (FLD, primary progressive nonfluent aphasia, and semantic dementia)14 that appear to be associated with different atrophy patterns involving the frontal or temporal lobes, often with left to right asymmetry.11,13,14,30 In our study, the ATL or amygdala, LTG or fusiform gyrus, and parahippocampal gyrus were rated atrophic in 90% or more of the patients with FTLD, and multiple logistic regression analysis identified atrophy of the parahippocampal gyrus as a significant predictor of FTLD. Previous studies have identified the amygdala,11,13 LTG or fusiform gyrus,13,30 and parahippocampal gyrus13,16 as commonly affected in clinically diagnosed FTLD. Chan et al13 found fusiform gyrus atrophy to be highly specific for semantic dementia compared with AD; in contrast, our study found the specificity for FTLD (FLD, primary progressive nonfluent aphasia, or semantic dementia) of atrophy in these regions was low to moderate, highlighting the overlap of atrophy patterns in pathologically confirmed AD and FTLD. Early changes in the parahippocampal gyrus (particularly the entorhinal cortex) have been associated with AD13,34; reports suggest that atrophy is more severe in FTLD, particularly on the left.11,13,29,32 A study16 comparing semantic dementia, FLD, and AD found that moderate to severe parahippocampal gyrus atrophy was highly sensitive for semantic dementia and found in only 17% of the subjects with AD. In our study, moderate to severe atrophy of the ATL or amygdala, LTG or fusiform gyrus, and parahippocampal gyrus each increased specificity for FTLD to more than 75%, with moderate sensitivity, which may reflect the fact that our FTLD group consisted of more FLD and primary progressive nonfluent aphasia than semantic dementia clinical variants, as Galton et al16 found that a much smaller proportion of subjects with FLD than those with semantic dementia had moderate to severe ATL or LTG atrophy.
Compared with other pathologies, anterior greater than posterior gradient of atrophy and asymmetry of atrophy were highly specific for FTLD but of low sensitivity, which may reflect the difficulty of assessing global atrophy patterns in these patients, some of whom may have only mild atrophy. A recent study20 comparing clinically diagnosed FTLD, AD, and vascular dementia found a similarly high specificity and low sensitivity (100% and 38%, respectively) for FTLD of asymmetrical atrophy. Rating of severe frontal atrophy in these cases demonstrated that it was 93% specific and 52% sensitive. The study20 also showed that combining asymmetry and frontal atrophy ratings increased sensitivity to 71%, with specificity remaining high at 93%, a result similar to ours when combining anterior greater than posterior gradient of atrophy and asymmetry of atrophy. Another study35 found that a discriminant function that includes values of frontal or temporal asymmetry could distinguish FTLD from AD with 90% sensitivity and 93% specificity. Anterior greater than posterior gradient of atrophy, asymmetry of global atrophy, and moderate to severe ATL or amygdala atrophy may be additive in their detection of FTLD. Indeed, the highest sensitivity among the 3 observers for the detection of FTLD was 76%, with a similarly high specificity.
Findings from our study can also be compared with those obtained from region of interest measurements, a more precise quantification of atrophy. Several studies29,34,36 investigating hippocampal volume measurements to classify subjects with clinically diagnosed AD from controls found that sensitivity ranged from 75% to 80%, with specificity ranging from 76% to 90%. In contrast, our visual inspection study found slightly lower sensitivity but higher specificity. Investigating discrimination of subjects with clinically diagnosed FTLD from controls, one study29 found that volumetric measures of entorhinal cortex distinguished only 50% of subjects, for a specificity of 90%. This contrasts with the 92% sensitivity and 61% specificity found in our study of visual ratings of parahippocampal gyrus to discriminate FTLD from other dementias and controls (arguably a more difficult distinction). Few volumetric studies have attempted to use measures to discriminate between dementing diseases.13,34,36 One study looked at the benefit of using volumetric measures or visual ratings of the medial temporal lobe in addition to MMSE scores to discriminate AD from other dementias.19 Volumetric measures yielded no diagnostic gain over the MMSE scores (68% sensitivity and 53% specificity), whereas visual ratings did (78% sensitivity and 64% specificity).19 Although quantitative techniques may be more precise in their measurement of atrophy, unlike visual assessment, they do not consider the relative state of atrophy within the whole brain and are more labor and equipment intensive. Visual inspection appears to perform well in comparison, especially given that the technique is more applicable in a clinical setting.
The mix of patients in this study, young age at onset, and inclusion of atypical presentations (postmortem or biopsy is more likely to be performed when a clinical diagnosis is uncertain) enabled visual inspection of MRIs to be tested in a demanding and clinically realistic manner. Furthermore, although mixed pathology is common and it is important for the primary pathology to be identified, radiological diagnosis may have been more challenging in these cases. Although the patients in our study had imaging performed as part of a diagnostic workup, the mean duration from earliest symptom to MRI was around 4 years. Although early symptoms may be subjective, subtle, and only elicited on direct questioning of the patient and relatives, MRI observations would be most useful at this time. The inclusion of patients at a more advanced stage of disease may reduce the relevance of our findings to the earliest detection of dementias but reflects clinical practice, in which there may be a considerable delay between first symptoms and brain imaging and finally to diagnosis. A previous study has shown a mean time from symptoms to diagnosis of around 4 years.37 Although the difference in MMSE scores between the AD and FTLD groups suggests that patients with AD were more severely affected, this may instead reflect the limitations of assessing disease severity in FTLD using MMSE scores.14
Visual inspection of MRIs is a quick and widely available method of analysis in common clinical practice. Although it may be subject to interrater variation, regular multidisciplinary meetings providing feedback on visual analysis would help to minimize this and provide training. Regional atrophy that is highly sensitive for a disease may not provide a conclusive diagnosis, but visual inspection methods can be diagnostically useful in cases in which regional atrophy highly specific for one disease is present. When used alone or in combination, regional atrophy patterns with high specificity or sensitivity for dementia pathologies can produce moderately good discrimination between AD and FTLD.
Correspondence: Valerie M. Anderson, BSc, Dementia Research Centre, Institute of Neurology, Queen Square, London WC1N 3BG, England (email@example.com).
Accepted for Publication: March 24, 2005.
Author Contributions:Study concept and design: Likeman, Anderson, Stevens, Godbolt, Rossor, and Fox. Acquisition of data: Anderson, Stevens, Waldman, and Fox. Analysis and interpretation of data: Likeman, Anderson, Stevens, Frost, and Fox. Drafting of the manuscript: Likeman, Anderson, and Fox. Critical revision of the manuscript for important intellectual content: Likeman, Anderson, Stevens, Waldman, Godbolt, Frost, Rossor, and Fox. Statistical analysis: Frost. Obtained funding: Rossor. Administrative, technical, and material support: Rossor and Fox. Study supervision: Waldman, Frost, and Fox.
Funding/Support: This study was funded by EU contract QLK3-CT-2001-02362/LSHM-CT-2003-503330 (administered by VERUM Foundation, Munich, Germany)from Early Diagnosis of Alzheimer’s Disease and Related Dementias/Abnormal Proteins in the Pathogenesis of Neurodegenerative Disorders (DIADEM/APOPIS); the Alzheimer’s Disease Society, London; Alzheimer’s Research Trust, Cambridge, England; and Medical Research Council, London. Dr Fox holds a Medical Research Council Senior Clinical Fellowship.
Additional Information: Dr Likeman and Ms Anderson contributed equally to this study.
Acknowledgment: We thank the patients and their families for consenting to biopsy and brain donation. We acknowledge the following neuropathology departments for performing postmortem and histologic examinations: Institute of Psychiatry, London; National Hospital for Neurology and Neurosurgery; Queen Square Brain Bank for Neurological Disorders, London; Addenbrooke’s Hospital, Cambridge; and Radcliffe Infirmary, Oxford, England.