Fox NC, Cousens S, Scahill R, Harvey RJ, Rossor MN. Using Serial Registered Brain Magnetic Resonance Imaging to Measure Disease Progression in Alzheimer DiseasePower Calculations and Estimates of Sample Size to Detect Treatment Effects. Arch Neurol. 2000;57(3):339-344. doi:10.1001/archneur.57.3.339
Copyright 2000 American Medical Association. All Rights Reserved. Applicable FARS/DFARS Restrictions Apply to Government Use.2000
To evaluate the rate of brain atrophy calculated from serial magnetic resonance imaging (MRI) registration as a surrogate marker of disease progression for use in clinical trials in Alzheimer disease (AD).
Eighteen patients with mild to moderate AD and 18 age-matched normal controls underwent 2 MRI brain scans separated by a 12-month interval. Each individual's later scan was registered to their first scan, and the volume of cerebral tissue loss calculated directly from the registered and subtracted MRI scan pairs. The mean and SD of the rate of brain volume changes were used to estimate the sample sizes that would be needed in a clinical trial with a drug anticipated to modify disease progression by varying degrees. Comparable sample size estimates were performed with data for other methods of monitoring rates of brain atrophy, extracted from published papers.
The mean (SD) rate of brain atrophy for the patients with AD was 2.37% (1.11%) per year, while in the control group it was 0.41% (0.47%) per year. Based on these figures, to have 90% power to detect a drug effect equivalent to a 20% reduction in the rate of atrophy, 207 patients would be needed in each treatment arm. This assumes a 1-year placebo-controlled trial with a 10% patient dropout rate, and that 10% of scan pairs are unusable.
Registration of serial MRI volume images provides a powerful method of quantification of brain atrophy that can be used to monitor progression of AD in clinical trials.
ALZHEIMER DISEASE (AD) is characterized by gradual cognitive decline and progressive cerebral atrophy. At autopsy, the histopathological disease hallmarks of amyloid plaques and neurofibrillary tangles are accompanied by widespread synaptic and neuronal loss. Atrophy is a macroscopic consequence of these cellular losses that can be visualized in life using computed tomography or magnetic resonance imaging (MRI).1
A wide range of drugs is currently under development for treating AD.2 The most successful class of compounds to date has been the cholinesterase inhibitors. These agents, which include tacrine,3 donepezil,4 rivastigmine,5 and metrifonate6,7 have all produced statistically significant symptomatic improvement over 6 to 12 months in randomized, placebo-controlled clinical trials. The clinical trial designs used, eg, placebo-controlled, parallel group studies and assessed efficacy using end-point measurements. With the cholinesterase inhibitors, patients on active treatment seem to remain unchanged, as a group, while those who receive placebo decline. This has been widely interpreted as a delay in symptomatic progression. A vital question that now needs to be addressed is what effect are these compounds having on the biological progression of the disease; ie, are they able to alter the rate of neurodegeneration? Addressing this issue in clinical trials is the next major issue in drug development for AD.8,9
Cholinesterase inhibitors as a class were originally thought to be a symptomatic treatment only, yet there is now a suggestion from these early symptomatic protocol trials, and from in vitro research on amyloid processing, that there may be a disease-modifying effect. Moreover, there are numerous other potential candidates for disease-slowing effects, including neuroprotective,10 hormonal,11 and anti-inflammatory12 agents whose primary effects may be to modify biological progression. There is currently no established method to directly track the cellular changes that underlie the clinical manifestations of the disease. However, neuroimaging can provide a surrogate marker of cell loss in the form of rates of cerebral atrophy.13 In addition, Fox et al14 showed a strong correlation between atrophy measured from serial MRI and cognitive decline given by the Mini Mental State Examination (MMSE) score. Measurement of rates of tissue loss have, until now, focused on improving diagnosis and have measured specific structures—particularly medial temporal lobe regions,15- 20 ventricular volume,21,22 or the total brain.23 We have used registration of serial MRI to increase the sensitivity and reproducibility of measurements and have shown very significant differences between normal aging and AD, even in early cases.13,24,25 The development of a sensitive, reliable, and well-validated technique for monitoring rates of cerebral atrophy in populations of patients participating in clinical trials would provide a valuable surrogate measure of biological disease progression.26
It is important that clinical trials attempting to measure cerebral atrophy as a marker of progression are of sufficient power to detect a treatment effect. We therefore undertook a study to determine rates of brain loss using registration of serial MRI brain scans in untreated, clinically diagnosed patients with mild to moderate AD, typical of those likely to be included in therapeutic trials. The results from this study were used to estimate the necessary numbers for trials to show a putative treatment effect with drugs of various magnitudes of effect. We further compared these sample size estimates with those derived from other methods of measuring progressive brain atrophy using published rates of ventricular enlargement and hippocampal atrophy.
The Dementia Research Group at The National Hospital for Neurology and Neurosurgery, London, England, is involved in a wide range of longitudinal neuroimaging studies and clinical drug trials in Alzheimer disease. Patients recruited into these longitudinal studies routinely have MRI brain scans at 12-month intervals. The diagnosis of Alzheimer disease is made according to National Institute of Neurological and Communicative Disorders and Stroke–Alzheimer's Disease and Related Disorders Association criteria,27 and disease severity is estimated using formal neuropsychometry and clinical assessment including the MMSE.28 All data are stored in a centralized database (Microsoft Access 95 and SQL Server 6.5; Microsoft Corp, Redmond, Wash, 1995, 1996).
A database search was performed to identify all patients with clinically diagnosed AD who had undergone 2 MRI scans with approximately a 12-month interval between scans, and who had mild to moderate dementia (MMSE score, 10-26) at the date of their first scan. The search provided 18 patients with AD, 2 familial and 16 sporadic, with a mean interval between scans of 11 months. None of the patients was known to be taking active medication. Age-matched control subjects were selected from our pool of normal volunteers. Demographic details are given in Table 1.
All subjects had given their consent both for the MRI and for participation in longitudinal research studies. All studies had the approval of the Local Research Ethics Committee.
T1-weighted, volumetric MRI scans were performed on a 1.5-T Signa unit (General Electric, Milwaukee, Wis) yielding 124 contiguous 1.5-mm coronal slices through the head with a 256 × 128 image matrix (acquisition parameters: time to repeat/echo time/number of excitations/FLIP ANGLE, 35/5/1/35). Axial dual-echo sequences (T2- and proton-density weighted) were also acquired.
Magnetic resonance images from the volumetric sequences were transferred to a Sun workstation (Sun Microsystems Inc, Mountain View, Calif) where registration and quantification of atrophy were performed retrospectively in a randomized and blinded fashion. Only the brain is used for the registration procedure, and semiautomated 3-dimensional image analysis procedures were used for approximate delineation of the brain from the rest of the MRI (scalp, neck, etc).13,29 The registration algorithm is fully automated and uses the complex structure of the whole brain, considered as a volume rather than a set of slices, to determine the rotations and translations necessary to place 1 brain image set accurately on the other.13,30 Each individual's follow-up scan was registered to their baseline scan. The change in cerebral volume was computed directly from the registered image sets using a validated technique for automatically calculating the amount that cerebral boundaries have changed between scans.25 Cerebral volume loss was expressed as a percentage of initial total brain volume and annualized to give a rate of global atrophy.
Data were analyzed using Stata version 5.0 (Stata Corporation, College Station, Tex, 1997). Sample size requirements were estimated using the standard formula31: Sample size = (u + v)2 × (σ12 + σ22)/(µ1 − µ2)2, where u = 1.28 to provide 90% power and v = 1.96 to test at the 5% level; µ and σ are the mean and SDs of rates of atrophy in the treatment and placebo groups (assumes µ1 ≈ µ2)
The following modifications were included:
The difference in annual rates of tissue loss between patients with AD and healthy age-matched controls was taken to represent the maximum possible effect a treatment could have (ie, 100% impact). If this assumption is not made, the potential for a therapeutic effect may be overestimated. A reduction of 20% in progression was therefore considered to be equal to 20% of this difference rather than 20% of the total loss in patients with AD.
The sample size thus derived was increased by 10% to allow for losses to follow-up.
The sample size was then further increased assuming that about 10% of scan pairs could not be used (eg, too much movement on 1 of the scans).
All calculations were performed based on the requirement that a trial should have 90% power to detect the specified treatment effect when a 2-sided 5% level of significance is used.
The AD and control groups were closely matched for age, sex, and interval between scans (Table 1). The mean (SD) rate of cerebral atrophy was 2.37% (1.11%) of brain volume per year for the AD group and 0.41% (0.47%) per year for the control group. The change in mean (SD) MMSE scores was from 19.6 (4.1) at baseline to 17.1 (6.3) at second scan in the patients with AD and 29.2 (1.0) to 29.2 (1.1) in the control group.
The results of the sample size calculations for the serial MRI registration technique are shown in Table 2. This table can be interpreted for a drug with an anticipated ability to reduce the rate of cerebral atrophy by 20% over 1 year as follows: To have 90% power to detect a drug effect, allowing 0.24% per year for normal aging, a 10% dropout rate, and assuming that 10% of scan pairs are unusable, 207 patients will be needed in each treatment arm.
The calculations presented above suggest that this method of measuring rates of cerebral atrophy may be feasible for use in clinical phase 2/3 drug trials. The calculations take into account subject attrition and technical problems with scan acquisition, yet still indicate a realistic sample size given the cost of including serial MRI in a large clinical trial. The therapeutic effects used in the examples given ranged from 10% to 50%. Less than a 10% disease-slowing effect would be difficult to show and might not be considered clinically relevant. A disease-slowing effect of more than 50% is unlikely to need a surrogate marker of progression because the clinical effects would be so dramatic.
Serial MRI registration is a comparatively new technique for measuring rates of cerebral atrophy. Although a large number of articles describe the cross-sectional features and correlates of neuroimaging changes and volumetric measures in AD, only a relatively small number of longitudinal structural imaging studies of appropriate design have been published with atrophy rates and their SDs reported. A number of studies20,21,32- 36 where comparable data are available are summarized in Table 3. The paucity of longitudinal studies is surprising considering their importance.
Comparisons of the different techniques used to measure atrophy are difficult because the patient data sets are necessarily different. Based on the published rates of atrophy of the structures measured, sample size calculations were carried out using an identical method to that used for the MRI registration technique. These sample size estimates are summarized in Table 4.
Most studies focused on ventricular enlargement. Of those listed, only 2 studies used MRI.20,32 Kaye et al32 studied elderly patients who were initially recruited "predementia" and observed to 12 months postdiagnosis. Rates of atrophy might therefore be expected to be higher if based only on the period postdiagnosis. Jack et al (1998)20 conducted a careful longitudinal study measuring hippocampal and temporal horn volumes from volumetric MRI in 24 patients with AD and 24 controls. The interval between scans was almost 2 years. Registration was used prior to manual segmentation of the regions of interest resulting in high levels of reproducibility. The rate of hippocampal atrophy in the patients with AD (4% per year) was only 2½ times that of the control group (1.6% per year), and the relatively small difference between patients and controls increases the sample size requirements. The factors that are important in determining sample size are first the difference between AD and control rates of loss for the structure to be measured; the maximum treatment effect that could be expected is to reduce the rate of loss to that of age-matched controls (100% disease slowing). The second factor is the heterogeneity or variability (the SD) of the measured rates of atrophy in the AD group. This variability is the result of genuine differences between individuals, dependent on stage of disease and so forth, as well as the measurement error of the technique used. The more homogeneous the measured rate of atrophy in the patients, the easier it is to detect a small treatment change.
Bearing these issues in mind, what then are the reasons for the wide range (6-fold) in the estimated sample sizes required by the different techniques? It may be that the patients included in some of the studies had intrinsically higher heterogeneity. The patients in our study were fairly carefully assessed reflecting a relatively pure group of clinically diagnosed probable AD. In addition, our patients as a group were younger than in many studies. While this may not affect the rate of atrophy in the AD group, it may mean that the controls, being younger, had lower rates of loss than might be seen in older controls. For instance, the mean age of the control group of Jack et al20 was 81 years. Rates of whole brain atrophy might similarly be higher in an older group. However, a recent article by Mueller et al37 suggests that healthy oldest-old subjects do not show greater rates of brain loss compared with younger elderly. These issues are controversial, and we are currently investigating how rates of cerebral atrophy in AD and in normal aging are affected by age.
Alternatively, techniques that measure brain substructures may be subject to greater measurement error because of the difficulty in manually segmenting (outlining) these structures. This may be particularly relevant for computed tomographic studies and for the earlier published studies where measurement techniques may have been less sophisticated. The use of manual methods of segmentation and volume measurement introduces the issue of interobserver and intraobserver variability. Measurement errors are likely to be greater for smaller and less easily defined neuroanatomical structures, particularly if an arbitrary cutoff to the structure is used (eg, the posterior boundary of the temporal lobe or hippocampus). Furthermore, a cerebral substructure may have a variable rate of loss as the disease progresses. For example, it may be that while the hippocampus is severely affected early in the disease, it may not lose tissue as rapidly in the progression from mild to moderate AD, whereas other areas may only show significant changes later in the disease. For instance, Jack et al19 found that in very mild AD (mean MMSE, 22/30), the mean hippocampal volume was already significantly smaller (1.75 control SDs) than that of controls. More severely affected patients (mean MMSE, 18/30) had a mean hippocampal volume that was 1.99 control SDs below the control mean—in effect only slightly smaller than that of the very mildly affected patients. By contrast, measuring total cerebral atrophy may even out these differences but lack sensitivity in very early cases.
The MRI registration technique addresses several of these issues. The measurement of total cerebral atrophy includes all areas of loss and avoids the necessity of making a priori decisions about relevant regions of interest. The technique has high repeatability: scan-rescan reproducibility based on subjects having 2 scans on the same day shows a mean absolute difference between the 2 measures of under 2 mL (< 0.2% of brain volume). The technique is relatively insensitive to the operator because both registration of the scans and quantification of atrophy are performed virtually automatically. The only operator input is in the preprocessing step, where scalp and nonbrain tissues are roughly removed in a semiautomated procedure that typically takes less than 10 minutes. The registration process also compensates for "scanner drift" or variation in voxel size; conventional volumetric MRI-based measurements do not allow for these variations.
Alzheimer disease is inevitably a heterogeneous condition, and rates of disease progression between subjects are very variable. An enhancement of this technique for monitoring the disease progression might be to use subjects as their own controls for rate of atrophy. This would require subjects to have a 3- to 6-month placebo run-in with 2 MRI scans to establish their individual rate of atrophy. A further MRI scan would then be performed after 6 to 12 months on either active treatment or placebo, with the outcome assessed by comparison with their baseline rate of atrophy.
In terms of practical application of the registration technique, our experience has shown that it is possible to successfully scan patients with MMSEs as low as 10/30. By collecting only the coronal volumetric sequence, it was possible to reduce the duration of the scan to as little as 10 minutes, which substantially improved tolerability. Techniques that reduce patient movement during scans (including faster scans) are very important in longitudinal studies because both of a subject's scans must be usable in order to calculate a rate of loss. The figure of 10% of unusable scan pairs is likely to be an underestimate unless great care is given to patient compliance and, where necessary, scans are repeated. It should be noted that a larger sample is needed if 10% of all scans (as opposed to scan pairs) were unusable and these scan failures were randomly distributed. For studies requiring 2 scans on each patient, this amounts to an increase of approximately 25%. For "within patient" trials requiring 3 scans, this amounts to an increase of approximately 35%. Improvements in MRI techniques mean that faster scanning with similar resolution is now possible, which, together with careful explanation of the scanning procedure, may reduce problems of movement artifact and patient dropout. These advances, together with improving image analysis, should mean that imaging studies of atrophy progression in AD are increasingly feasible in trials examining disease progression.
The absence of methods to measure disease progression at a molecular level has increased interest in using MRI-based measures of atrophy. However, there are some caveats to be attached to the use of this method as a surrogate marker of disease progression. First, cerebral atrophy is an indirect measure of pathological processes occurring at the cellular level. Second, therapeutic disease-slowing effects at the molecular or cellular level may in theory take some time to feed through to a slowing in the rate of atrophy. Nonetheless, macroscopic atrophy seems to be an inevitable concomitant of the cellular destruction, synaptic loss, and dendritic pruning of AD. If a therapeutic intervention in AD were to slow rates of cerebral atrophy as well as produce symptomatic benefit, this would be strong evidence for a disease-modifying effect. Registration of serial MRI offers a powerful method of tracking changes in rates of atrophy with potential for use in clinical trials.
Serial MRI registration offers an accurate, powerful method for measuring rates of atrophy in AD. The results of this study suggest that the technique could be effectively used in a clinical trial setting evaluating the efficacy of a drug in modifying the biological progression of the disease. By comparison with the few published longitudinal imaging studies, quantification of total rates of cerebral atrophy based on registered MRI seems to require the smallest sample size, and may thus be the most cost-effective method available. Further research comparing longitudinal measurements of regional (eg, hippocampal) and global atrophy at different stages of the disease are needed to guide the choice of surrogate markers for forthcoming trials in AD.
Accepted for publication, April 26, 1999.
This study was supported by the Medical Research Council, London, England. (Dr Fox is supported by a Medical Research Council Clinician Scientist Fellowship. Dr Harvey is supported by an Alzheimer's Disease Society Research Fellowship, London, England. Dr Rossor holds a Medical Research Council Program Grant.)
Reprints: Martin N. Rossor, MD, FRCP, Dementia Research Group, The National Hospital for Neurology and Neurosurgery, Queen Square, London WC1N 3BG (e-mail: email@example.com).