Funnel plot assessing potential publication bias. The horizontal line indicates the average effect size as the log of the odds ratio (OR); diagonal lines, approximate 95% confidence intervals for estimates.
Forest plots for random effects meta-analyses. CI indicates confidence interval.
Ownby RL, Crocco E, Acevedo A, John V, Loewenstein D. Depression and Risk for Alzheimer DiseaseSystematic Review, Meta-analysis, and Metaregression Analysis. Arch Gen Psychiatry. 2006;63(5):530-538. doi:10.1001/archpsyc.63.5.530
A history of depression may increase risk for developing Alzheimer disease (AD) later in life. Clarifying this relation might improve understanding of risk factors for and disease mechanisms in AD.
To systematically review and complete a meta-analysis on the relation of depression and AD.
We conducted electronic bibliographic searches of MEDLINE, PsychLit, EMBASE, and BIOSIS using search terms sensitive to studies of etiology combined with searches on terms related to depression and AD and reviewed reference lists of articles.
Studies with data contrasting depressed vs nondepressed patients who did and did not later develop AD were included. Studies that related continuous measures of depression and cognitive status were excluded.
Numerical data were independently extracted by 3 reviewers. They also rated studies on a scale that assessed quality indicators for observational studies. Data on the interval between observation of depression and the diagnosis of AD were collected when available.
Meta-analytic evaluation with random-effects models resulted in pooled odds ratios of 2.03 (95% confidence interval, 1.73-2.38) for case-control and of 1.90 (95% confidence interval, 1.55-2.33) for cohort studies. Findings of increased risk were robust to sensitivity analyses. Interval between diagnoses of depression and AD was positively related to increased risk of developing AD, suggesting that rather than a prodrome, depression may be a risk factor for AD.
A history of depression may confer an increased risk for later developing AD. This relation may reflect an independent risk factor for the disease.
Depression is a critically important issue for those working with the elderly and especially those working with older persons with Alzheimer disease (AD). Depression affects a large number of elderly people1 and has been associated with poor cognitive function.2 Its treatment has been associated with improved functional status.3 It is a significant behavioral aspect of the symptomatology of AD4 that affects the cognitive and functional status of patients with AD.5 A better understanding of the relation of AD and depression thus might have important clinical and research implications.
A personal history of depression has been related to increased risk for developing AD later in life, although this finding has not been universal. For example, some case-control studies have found a relation between a history of depression and risk for AD.6- 9 A number of these studies, however, have potential biases that limit their interpretability. Most notably, case-control studies that are inherently retrospective may be biased by a greater recall of history of depression in patients with AD (the cases), who often undergo more careful evaluation than do control subjects. Even some case-control studies have yielded negative results.10- 14 Cohort studies, in which a group of persons undergoes evaluation and then prospective follow-up, have also yielded inconsistent results, with some indicating a statistically significant increased risk for AD with history of depression15- 19 and others not finding it.20- 26 Because depression affects a large number of patients with AD and has been associated with increased morbidity and mortality in the elderly, an improved understanding of the relation of depression to AD may have important public health implications.
The relation between depression and risk for later development of dementia is thus unclear. Other study-related factors may have affected these studies' outcomes. Some studies of depression and AD evaluated the relation between the number of depressive symptoms (rather than categorical diagnosis) and diagnosis of AD,27 whereas others studied the relation of symptoms and measures of cognitive function, such as the Mini-Mental State Examination,28 rather than AD diagnosis. Two studies19,29 found that the relation between depression and risk for AD existed only for men, although other studies found a relation between number of depressive symptoms and subsequent poorer cognitive function only in women.2,30
Understanding the relation of depression and AD is further complicated by the possibility that depression may be a prodromal symptom of AD, so that its appearance before the recognition of other illness-defining symptoms of AD constitutes a harbinger of rather than a risk factor for the disease. One reviewer,31 for example, concluded that there was insufficient evidence to determine whether depression represents an independent risk factor or an early symptom in AD. Other researchers19,24,32 have more recently discussed the uncertainty that exists on this issue.
Previous reviews31,33,34 on the topic did not include several recent references, did not complete a systematic review of data sources, and may have included data from studies of patient groups with mixed psychiatric histories. The primary purposes of the present study were thus to systematically review studies on the relation of a history of depression to the risk for subsequent diagnosis of AD and to investigate the relations among observed risk for AD and other study variables. A secondary purpose was to investigate the relation of the interval between the diagnoses of depression and AD to observed risk for AD. We hypothesized that a negative relation (an inverse relation between the interval and increased risk for AD, implying that depression was more likely to occur near the time of AD diagnosis) would support the interpretation of depression as a prodromal symptom. A positive relation, with a larger interval between the diagnoses of depression and AD related to increased risk for AD, would then support the hypothesis that depression is an independent risk factor for AD.
Studies were found through a variety of methods that included computerized bibliographic searches and review of reference lists of pertinent articles. Consistent with results of a study35 that showed the importance of searching multiple databases to find the maximum number of relevant citations, we searched MEDLINE, PsychLit, EMBASE, and BIOSIS using a search strategy with maximum sensitivity for studies of etiology.36 Searches initially used the following strings: risk* (in title or abstract) OR risk* (as a Medical Subject Heading [MeSH] term, not exploded) OR cohort studies (as a MeSH term) OR group* (as a text word). Results of these searches were combined with sets created with depression OR depressive AND Alzheimer disease OR dementia OR dementing. Bibliographies of located articles were reviewed for possible data sources, as were the bibliographies of articles thus located.
We included studies that provided sufficient information to allow the calculation of crude odds ratios (ORs) for the risk of developing AD or AD-like dementia in persons with a history of depression compared with the same risk in persons without this history. Studies thus were required to provide data on the history of clinically diagnosed depressive disorder at some time before the clinical diagnosis of AD or dementia. Studies were included if they provided a description of diagnostic criteria that required the presence of symptoms consistent with major depressive disorder, even if they did not specifically describe use of criteria from International Classification of Diseases (ICD) or DSM.37- 39 In similar fashion, studies were included if they included a description of diagnostic criteria for AD or AD-like dementia (eg, emphasized gradual progression of cognitive deficits), but they were excluded if diagnostic criteria were vague (eg, organic dementia) or included patients with vascular as well as AD-type dementias. Studies were excluded if they did not include a control group for comparison or if they provided data on continuous measures of depression (eg, number of depressive symptoms) and thus did not establish the clinical diagnosis of depression or cognitive status (eg, the Mini-Mental State Examination) and similarly did not establish the clinical diagnosis of dementia. (A list of excluded studies with explanations for exclusions is available from the authors.) Studies using continuous measures were excluded because of the diversity in use of measures and the fact that many studies only reported results adjusted for covariates such as age. Differences in measures of cognitive status and depression, as well as differences in statistical methodologies, made inclusion of these studies in the meta-analysis impossible.
Data from studies meeting inclusion criteria were extracted independently by 3 reviewers (3 from the group of us 5 authors for each extraction). In addition to the number of patients, studies were categorized as case-control or cohort, and if cohort, whether they were prospective or retrospective. Prospective cohort studies were defined as those that identified a group of participants and followed them up through time, whereas retrospective cohort studies were defined as those that identified a group of participants and used existing records to evaluate their clinical characteristics and course. The Newcastle-Ottawa Scale40 was used to assess the quality of each study. This measure assesses aspects of methodology in observational studies related to study quality, including selection of cases, comparability of populations, and ascertainment of exposure to risks. Where possible, an estimate of the interval between the diagnoses of depression and AD was extracted. In all cases, disagreements among raters were resolved through discussion so that a consensus was obtained.
Before combining studies in the meta-analysis, we evaluated the presence and possible causes of heterogeneity in risk for AD associated with a history of depression. The presence of heterogeneity in study effects was evaluated first. Because some evidence was found for the presence of heterogeneity in study outcomes, subsequent pooled analyses used random-effects estimating methods since this approach is more statistically appropriate41,42 for the question addressed in this study (ie, assuming a sample of studies, what is their likely pooled effect?). We evaluated the possible effects of dichotomous variables (such as case-control vs cohort study methodology or studies that used structured diagnostic criteria vs those that did not use structured criteria) in separate analyses stratified on the dichotomous variable. The effects of continuous variables, such as the interval between the diagnoses of depression and dementia, were evaluated via metaregression. The influence of specific study characteristics were evaluated through ratings on the Newcastle-Ottawa Scale. This scale assesses aspects of observational studies (eg, completeness of follow-up) often related to study quality, such as appropriateness of selection criteria or comparability of patient groups. Analyses of the relation of Newcastle-Ottawa Scale items to study outcomes were completed via metaregression analyses.
Publication bias was evaluated by inspection of a funnel plot that related studies' standard errors to their effect sizes. The underlying notion is that small studies are more likely not to have been published but would generally have had larger standard errors. If the funnel plot shows an asymmetry, it suggests that studies that might have reported negative results may not have been published.43 Because inspection of the funnel plot suggested the possibility of publication bias, we then applied a specific statistical technique that evaluated the effect of possible publication bias on estimates of pooled ORs.44 Finally, to address the question of whether a history of clinical depression was a prodrome of or a risk factor for AD, we conducted a separate metaregression analysis to test the hypothesis that the interval between the diagnoses of depression and AD might be related to the likelihood of developing AD.
Data were subjected to meta-analyses stratified by study type (case-control vs cohort study) to obtain composite estimates of ORs separately for each study type and for all studies combined. Study heterogeneity was assessed through inspection of the funnel plot and by the Egger test.45 We used this test rather than the χ2 test for heterogeneity because of its greater sensitivity to the presence of heterogeneity.46 Odds ratios corrected for possible publication bias were calculated44 to assess the effects of this potential bias. Sensitivity analyses included comparisons of risks in studies that used structured diagnostic criteria for the diagnoses of AD and depression compared with those that used less specific criteria and for prospective vs retrospective cohort studies. Sensitivity analyses also included assessment of the influence of each study on overall estimates of risk by recalculating ORs with 1 study removed and all others included from the pooled estimate over multiple iterations.
Metaregression analyses allow the application of regression techniques to the prediction of study outcomes so that the relation of other variables of interest to outcomes can be evaluated. The technique is similar to standard linear regression except that it is usually modified to include a term, τ,41 that estimates the magnitude of variability among studies. Because of its explicit allowance for and estimation of variability among studies, it is similar to random-effects analysis of variance and thus may be called random-effects metaregression. The effects of several variables in combination on study outcome can be evaluated and corrected for, providing estimates of the size of pooled study effects after correction for the variables of interest. In this way, metaregression analysis provides the ability to evaluate the relation of potential covariates and confounding variables to study effects.
To assess the effects of study characteristics on risk, we calculated random-effects metaregression analyses of ORs adjusted for each of the study characteristics assessed by the Newcastle-Ottawa Scale for case-control and cohort studies. Consistent with recommendations against using composite scale scores to control for study quality in meta-analyses and for the use of specific study characteristics for this purpose,47 we used the presence or absence of each study characteristic in the metaregression. To assess the effect of the interval between depression and AD diagnoses on risk, we completed a random-effects metaregression analysis with the interval between the diagnoses of depression and AD predicting the risk of developing AD. All analyses were completed using routines available for Stata software, version 8.2 (StataCorp, College Station, Tex).
We identified 153 potentially relevant studies from results of electronic searches and reviews of bibliographies. Of these, 16 were review articles that did not include data; 53 provided data but did not allow group comparisons; 43 provided data that were not relevant to the relation of depression and AD; 17 used continuous measures; and 2 included data that appeared to overlap with another study. Twenty studies provided data that could be used in calculating crude ORs. Two additional studies provided data on AD risk in twin samples as unadjusted ORs with 95% confidence intervals (CIs) that could be used in pooling estimates48,49; these were judged potentially relevant but, because the comparison groups differed substantially from those in the other 20 studies, they were excluded from pooled data analyses. We therefore report analyses of these 2 studies separately.
The remaining 20 studies provided data from an aggregate sample of 102 172 persons in 8 countries. Study characteristics are listed in Table 1 and ratings of study quality for each of the Newcastle-Ottawa criteria are presented in Table 2. We completed analyses of effect heterogeneity with case-control and cohort studies in combination and separately for each type of study. For all studies combined, evidence of significant heterogeneity was observed (Egger test, t=2.75; P=.01). For separate analyses by study type, evidence of heterogeneity of borderline statistical significance was found for case-control studies (P=.10), a probability larger than the generally accepted value of .05, but suggestive of the presence of heterogeneity, given the low power of these analyses stemming from the small number of studies involved. Significant heterogeneity was found for cohort studies (P=.02). Given these findings, we therefore completed subsequent meta-analyses of ORs using random-effects models stratified by study type and for all studies combined. A funnel plot was obtained (Figure 1) to visually assess the presence of publication bias. Inspection of this plot suggested the possible presence of publication bias (ie, failure to find studies with negative results). We used the “trim-and-fill” method proposed by Duval and Tweedie44 to obtain estimates of ORs corrected for possible publication bias.
Forest plots for random-effects meta-analysis stratified by study type and for all studies combined are presented in Figure 2. The OR for case-control studies was 2.03 (95% CI, 1.73-2.38; z = 8.65; P<.001) and for cohort studies, 1.90 (95% CI, 1.55-2.33; z = 6.15; P<.001). The estimate for all studies combining both case-control and cohort studies was 2.02 (95% CI, 1.80-2.26; z = 12.07; P<.001). Stratified analyses for prospective vs retrospective cohort studies yielded pooled OR estimates of 1.78 (95% CI, 1.16-2.73; z = 2.64; P = .008) for 6 prospective cohort studies and 2.11 (95% CI, 1.82-2.45; z = 9.73; P<.001) for 5 retrospective studies.
Stratified analyses for use of structured criteria (DSM or the National Institute of Neurological and Communicative Disorders and Stroke–Alzheimer's Disease and Related Disorders Association) for AD diagnosis yielded pooled ORs of 2.22 (95% CI, 1.98-2.49; z = 13.67; P<.001) for studies not using structured criteria and 1.91 (95% CI, 1.62-2.26; z = 7.54; P<.001) for studies using structured criteria. Evaluation of studies that did not use specific diagnostic criteria for depression (DSM or ICD) yielded an OR of 1.85 (95% CI, 1.58-2.17; z = 7.68; P<.001) whereas that for studies that did use specific criteria was 2.23 (95% CI, 2.00-2.48; z = 14.68; P<.001). The pooled OR for the 4 studies that used structured or specific diagnostic criteria for both depression and AD diagnoses was 2.30 (95% CI, 1.71-3.09; z = 5.53; P<.001). Only studies that found sex differences in ORs reported data on sex differences in AD risk, so that pooled comparisons of risk by sex might have yielded misleading results because of reporting bias.
Evaluation of possible publication bias suggested its presence, and we used a technique that estimates the number and magnitude of missing studies and then calculates a corrected OR to further evaluate the importance of this possible bias.44 Random-effects ORs corrected in this way for publication bias were 1.96 (95% CI, 1.68-2.30; z = 8.50; P<.001) for case-control studies, 1.90 (95% CI, 1.55-2.33; z = 6.16; P<.001) for cohort studies, and 1.98 (95% CI, 1.76-2.24; z = 11.19; P<.001) for all studies combined. Correction for possible publication bias thus did not substantially change estimates of ORs.
We recognized that some of the 6 prospective cohort studies might have included persons with a history of depression in their comparison groups. The reports of 2 of the prospective cohort studies explicitly controlled for this possibility, in one instance by excluding persons with a history of depression from their comparison group24 and in the other by characterizing patients as having early- or late-onset depression.21 Reports of the remaining 4 studies did not explicitly state how this issue had been dealt with. The OR for the 2 prospective cohort studies that accounted for personal history of depression was 1.58 (95% CI, 0.83-3.02) whereas the OR for the prospective studies that did not state how they accounted for a history of depression was 1.92 (95% CI, 1.34-2.74).
We evaluated the relation of study quality and observed risk through metaregression of study characteristic ratings and log ORs. This analysis was undertaken to further assess the possibility that the observed increase in risk of developing AD in depressed patients might be related to specific study characteristics not accounted for in stratified analyses. The OR for case-control studies adjusted in a metaregression analysis for the presence of quality characteristics from the Newcastle-Ottawa Scale (Table 2) was 4.14 (95% CI, 1.14-15.11; z = 2.16; P = .03). The similarly adjusted OR for cohort studies was 3.85 (95% CI, 1.88-7.89; z = 3.69; P<.001). In the regression model for case-control studies, none of the individual study characteristics was independently significantly related to the OR. In the combined regression of all cohort study variables on the OR, 1 study characteristic was significantly and negatively related to the predicted OR. This characteristic was the completeness with which study cohorts were followed up (eg, whether all participants who began the study were accounted for at the study's end; coefficient, −1.48; SE, −0.53; z = −2.79; P = .005). Correction of ORs for case-control and cohort studies for characteristics assessed by the Newcastle-Ottawa Scale, whether or not the characteristics were significant predictors of the log OR, did not change the inference that a history of depression may confer an increased risk for developing AD.
We completed influence analyses by recalculating pooled ORs for the sample on multiple occasions with 1 of the studies removed at each iteration. These analyses were especially important because several studies15,18 included samples that were substantially larger than most of the other studies and thus may have exerted large effects on overall effect estimates. For all studies, these analyses yielded ORs ranging from 1.93 (95% CI, 1.69-2.20) to 2.12 (95% CI, 1.94-2.31). Influence analyses stratified by study type yielded a range of ORs from 1.96 (95% CI, 1.52-2.52) to 2.07 (95% CI, 1.75-2.45) for case-control studies and from 1.81 (95% CI, 1.45-2.24) to 2.03 (95% CI, 1.71-2.41) for cohort studies.
As noted, 2 studies48,49 reported results from samples of twins in the form of ORs and CIs so that pooled estimates could be calculated. These results were judged potentially relevant but difficult to compare with other studies. The pooled OR for AD with a history of depression for these 2 studies was 2.53 (95% CI, 1.00-6.39; z = 1.97; P = .05).
Thirteen studies, including 2 case-control and 11 cohort studies, provided data on the interval between the diagnoses of depression and AD. Metaregression of the log OR for each study on the interval in years showed a positive and statistically significant relation (Table 3). Because whether a study was carried out retrospectively or prospectively might have been related to the length of the observed interval between the diagnoses of depression and AD, a second metaregression analysis was completed. This analysis showed that even after correction for this element of the study design, the relation between interval length and risk for AD continued to be significant and positive (coefficient, 0.003; SE, 0.001; z=2.01; P=.05). The interval between the diagnoses of depression and AD was thus positively related to increased risk for developing AD. We explored the possibility of a nonlinear relation between interval and risk in the metaregression via regression models that included fractional polynomial coefficients.56 In no case did adding coefficient terms of higher powers significantly improve fit, as assessed by change in the deviance statistic, of regression models when compared with the more parsimonious baseline linear coefficients model.
The purpose of this study was to investigate the relation of a history of depression to risk for subsequent development of AD. Results of this meta-analysis show that persons with a history of depression were more likely to be diagnosed as having AD later in life. This finding was robust across analyses stratified by study type, retrospective vs prospective data collection, and strictness of diagnostic criteria used for AD and depression. The ORs were still significantly greater than 1 when adjusted in separate metaregression analyses for case-control and cohort studies for 8 quality indicators from the Newcastle-Ottawa Scale. The ORs were also significantly greater than 1 in metaregression analyses that controlled for the interval between the diagnoses of depression and AD.
Influence analysis showed no substantial difference in pooled ORs when any single study was excluded. This was important because 1 cohort study18 contributed more than 65 000 individuals to the total sample of more than 100 000. Although heterogeneity was found across studies, all but 1 study examined found an increased (although sometimes nonsignificant) risk for AD in persons with a history of depression. Although significant heterogeneity was observed in effect sizes, it should be noted that 19 of the 20 studies used in these analyses yielded a positive relation of history of depression to risk for developing AD. Previous reports of negative findings may thus be attributable to failure to find statistical significance rather than reduced or no change in risk for AD in individuals with a history of depression.
A secondary purpose of this study was to evaluate whether observed risk for developing AD was related to the interval between diagnoses of depression and AD. This interval was positively and significantly related to the odds of developing AD in a metaregression analysis. This finding is consistent with the interpretation that occurrence of depression is a risk factor for AD rather than a prodrome of the disease, and it is also consistent with the results of 1 large study that specifically examined the issue.9 Given the small number of studies included in the metaregression, however, this interpretation must be tentative, but it further strengthens the observation that depression is a distinct risk factor for AD. It should be recognized as well, however, that these findings do not rule out the possibility that depression is both a remote risk factor for AD and a proximal prodromal feature of it.57
Limitations of this study should be acknowledged. These data relate only the occurrence of at least 1 episode of diagnosable depression to later risk for AD and neglect the fact that both depression and AD are heterogeneous entities that may reflect multiple underlying pathologies.58 Several studies used data on hospitalizations due to depression as an indicator of clinical depression whereas others recruited nonpatients and then examined them for depression on the basis of current symptoms. Groups of persons who were so severely depressed as to require hospitalization may not be equivalent to outpatients whose depression was discovered during study evaluations. Data used in the meta-analysis do not distinguish among the risk for AD in persons who may have had 1 episode of depression and recovered compared with those who have had multiple episodes or who may have had chronic minor depression. In this connection, 1 study59 found that risk for dementia increased with multiple episodes of depression. By choosing to include studies that allowed us to calculate crude ORs, we excluded studies that provided estimates of the relation between depression and AD risk in the form of adjusted ORs or on the basis of continuous measures. Exclusion of these studies may have biased our results.
These data do not provide information about why depression and AD may be linked. There is increasing awareness of the possible role of vascular disease in the expression of the clinical symptoms of AD, and it has been reported that AD and depression may share risk factors for vascular disease.60,61 One possible link may be long-term occurrence of inflammatory processes that may underlie depression and AD.62 Several proinflammatory cytokines (eg, tumor necrosis factor α) have been linked to both vascular disease and depression,63- 65 and the same cytokines may have direct effects on cognitive status.66 Possible genetic links between the 2 disorders have also been explored, most notably between apolipoprotein E ε4 and the disorders, because the ε4 allele is an established risk factor for AD.67 Research on the co-occurrence of apolipoprotein E in depression and AD has been inconclusive, with at least 1 study68 finding a cross-sectional relation between the ε4 genotype and depressive symptoms whereas other studies48,69 have not found a relation. Other authors61,70,71 have speculated on potential mechanisms for the link between mood disorder and AD. The findings of this meta-analysis and metaregression thus may have implications for those investigating the biochemical etiology of AD.
The clinical significance of the findings of the meta-analysis and metaregression should also be recognized. From one perspective, the absolute risk for AD conferred by a history of depression is small so that the importance of having a history of depression should not be overly emphasized in work with patients. On the other hand, however, depression may be viewed as a modifiable risk factor for AD if, in fact, depression is part of the same disease process that produces clinical AD, and its treatment will affect cognitive outcome. This possibility is intriguing in light of evidence that antidepressants can modify levels of inflammatory cytokines,72 but the possibility that treatment with antidepressants might reduce the risk for AD remains speculative without additional studies.
This meta-analysis and metaregression thus help clarify the relation between a history of depression and the subsequent risk of developing AD by drawing together data from numerous sources and exploring several possible reasons for the observed heterogeneity among studies. Results were found to be robust in several analyses stratified on study characteristics and in meta-analyses that controlled for study quality and explored the relation of the interval between the two diagnoses on risk for AD. These findings thus underscore the possible relation between the two disorders and the importance of continued research on the common and disparate factors in the etiology of depression and AD.
Correspondence: Raymond L. Ownby, MD, PhD, MBA, Department of Psychiatry and Behavioral Sciences, University of Miami Miller School of Medicine, MSMC Psychiatry, MRI Building 2nd Floor, 4300 Alton Rd, Miami Beach, FL 33140 (email@example.com).
Submitted for Publication: May 17, 2005; final revision received July 29, 2005; accepted September 15, 2005.
Funding/Support: This study was supported by grant K23 AG 19745 from the National Institute on Aging (Dr Ownby).
Acknowledgment: We thank Carmen Bou-Crick, MSLS, for her assistance in completing the electronic bibliographic searches reported herein.