Schjeide BM, Hooli B, Parkinson M, Hogan MF, DiVito J, Mullin K, Blacker D, Tanzi RE, Bertram L. GAB2 as an Alzheimer Disease Susceptibility GeneFollow-up of Genomewide Association Results. Arch Neurol. 2009;66(2):250-254. doi:10.1001/archneurol.2008.552
Genomewide association (GWA) studies have recently implicated 4 novel Alzheimer disease (AD) susceptibility loci (GAB2, GOLM1, and 2 uncharacterized loci to date on chromosomes 9p and 15q). To our knowledge, these findings have not been independently replicated.
To assess these GWA findings in 4 large data sets of families affected by AD.
Follow-up of genetic association findings in previous studies.
More than 4000 DNA samples from almost 1300 families affected with AD.
Main Outcome Measures
Genetic association analysis testing of 4 GWA signals (rs7101429 [GAB2], rs7019241 [GOLM1], rs10519262 [chromosome 15q], and rs9886784 [chromosome 9p]) using family-based methods.
In the combined analyses, only rs7101429 in GAB2 yielded significant evidence of association with the same allele as in the original GWA study (P =.002). The results are in agreement with recent meta-analyses of this and other GAB2 polymorphisms suggesting approximately a 30% decrease in risk for AD among carriers of the minor alleles. None of the other 3 tested loci showed consistent evidence for association with AD across the investigated data sets.
GAB2 contains genetic variants that may lead to a modest change in the risk for AD. Despite these promising results, more data from independent samples are needed to better evaluate the potential contribution of GAB2 to AD risk in the general population.
Although the heritability of late-onset Alzheimer disease (AD) is high,1,2 our knowledge of the underlying putative susceptibility genes remains incomplete. The only unequivocally established late-onset AD gene is APOE (107741) (encoding apolipoprotein E), whose ε4 allele increases the risk for AD by 4- to 15-fold in a dose-dependent manner.3,4APOE and most other genetic association findings in AD originate from candidate gene approaches (ie, studies that investigate certain genes based on a prior hypothesis regarding their potential involvement in pathogenesis).5 An alternative to this approach is afforded by recent advances in large-scale genotyping technologies enabling researchers to perform comprehensive unbiased genomewide association (GWA) analyses.6 To date, 3 groups have reported the results of AD GWA analyses.7- 9
The first group used a low-density design testing roughly 17 000 single-nucleotide polymorphisms (SNPs) in or near genetic coding regions in almost 4000 combined cases and controls, including several confirmed by autopsy.7 The only SNPs consistently associated with AD risk across different samples were located within or in proximity to APOE and most likely reflect linkage disequilibrium (LD) with the ε4 allele. Although several additional loci were highlighted as potential AD genes by the authors, none showed the same consistency of effect or level of statistical significance as the ε4-related variants.
The second group tested approximately 500 000 SNPs in roughly 1100 unrelated AD cases (all with neuropathologically confirmed diagnoses) and controls.8 Again, except for a single SNP in strong LD with APOE ε4, no other genomewide significant signals were observed. In a follow-up article,10 the same group reported evidence of an association between variants in GRB2-associated binding protein 2 (Gab2 [gene abbreviation, GAB2]) (OMIM 606203) on chromosome 11q14 and AD risk in the same 1100 neuropathologically confirmed persons and in approximately 360 clinically diagnosed AD cases and controls, but this was noted only in carriers of APOE ε4. The association was observed with 10 different SNPs in GAB2, all displaying high degrees of LD, indicating that they likely point to the same underlying signal.
Finally, the third group tested almost 500 000 markers in approximately 1500 clinically diagnosed AD cases and controls from Canada (replicated in approximately 670 AD cases and controls from the United Kingdom). They reported an association between variants in Golgi membrane protein 1 (encoded by GOLM1 [OMIM 606804], also known as GOLPH2) on chromosome 9q22 and 2 uncharacterized loci on chromosomes 9p and 15q.9 The findings from the 2 high-density GWA screens are summarized in Table 1.
In the present study, we assessed the 4 putative AD loci that emerged from 2 high-density 500 000 GWA screens.
The following 4 data sets tested in this project were initially collected for the study of genetic factors in AD: (1) Consortium on Alzheimer's Genetics (CAG),11 (2) National Institute on Aging (NIA [http://www.ncrad.org]), (3) National Institute of Mental Health (NIMH),12 and (4) National Cell Repository for Alzheimer's Disease (NCRAD [http://www.ncrad.org]). Characteristics of the samples are given in Table 2. All data sets represented primarily sibships and lacked parental genotypes. Except for the CAG data set, most pedigrees analyzed herein were nuclear families ascertained on the basis of multiple affected individuals. In addition to containing at least 1 affected relative pair, many pedigrees also had DNA samples available from additional affected or unaffected persons (mostly siblings). The diagnosis of definite, probable, or possible AD was made according to National Institute of Neurological and Communicative Disorders and Stroke–Alzheimer's Disease and Related Disorders Association criteria for affected individuals in all 4 data sets.12 For all 4 data sets, we included only families of white race/ethnicity and only families in which no affected person had an age at onset younger than 50 years. The NIMH families were collected as part of the NIMH Genetics Initiative.12 This data set is composed of 1528 persons from 457 families. Only families in which DNA samples were available from at least 2 affected family members were included in these analyses (1376 persons from 410 families). The NIA and NCRAD families were obtained from the NCRAD, and ascertainment and collection details can be found at the NCRAD Web site (http://www.ncrad.org). Our sampling rules yielded 1040 persons from 329 families for the NIA data set and 1108 persons from 331 families for NCRAD data set. The CAG families were recruited from multiple NIA-funded AD research centers under the auspices of the CAG.11 Probands were included only if they had at least 1 unaffected living sibling willing to participate in the study. This data set was composed of 483 persons from 215 families.
All genetic variants were genotyped on individually optimized single-base extension reactions detected by fluorescent polarization as previously described11 (genotyping details are available from the corresponding author). In GAB2, we chose a proxy (rs7101429) in lieu of the strongest GWA SNP (rs2373115) because this SNP showed better clustering in our preliminary genotyping. Both SNPs are in almost complete LD in persons of white race/ethnicity (r2 = 0.92 and D′ = 1.0 based on the most current genotype release from the International HapMap Project Web site [http://www.hapmap.org]) and can be used interchangeably for the purpose of association testing. Hardy-Weinberg equilibrium was determined separately for all markers in all 4 data sets using a computer program (Haploview, version 4.1; http://www.broad.mit.edu/mpg/haploview/).13
To test for association, we used statistical software (PBAT, version 3.6; http://www.biostat.harvard.edu/~clange/default.htm)14 using the same traits (affection status or age at onset) and genetic models as in the original GWA analyses (using an additive model as a substitute for the allelic tests). For GAB2, we also tested for association in families with at least 1 affected person carrying an APOE ε4 allele, similar to the stratification in the original article.10 All analyses were restricted to families of self-reported white race/ethnicity. To combine statistical evidence across the association analyses from each independent data set, we used the combined probability test by Fisher.15 Because the hypothesis of our study was to test for association in the same direction as that observed in the original articles, all P values are 1-tailed. P values were inversed (1 − P) when overtransmission to affected individuals was observed with the opposite allele compared with the original GWA finding. Odds ratios (ORs) were calculated by fitting a conditional logistic regression model to each data set, where family defines the stratum.16 Summary ORs were calculated for all tested variants using fixed-effects models, as no evidence of significant heterogeneity in the sample-specific ORs was observed across data sets. Results of power calculations (performed in PBAT) suggested that we had good to excellent (≥85%) power in the combined sample to detect the genetic effect sizes (Table 1) estimated in the original GWA studies8- 10 at α = .05 and a disease prevalence of 10% using the allele frequencies of the respective control populations.
Overall, genotyping efficiency for the 4 tested variants was 98.4%. The mean error rate (based on approximately 10% of the sample run in duplicate) was below 0.5%. None of the markers deviated significantly from Hardy-Weinberg equilibrium. The only variant to show nominally significant association when results were combined across all 4 data sets was rs7101429 in GAB2 (Table 3). Although the association was more pronounced in APOE ε4–positive families vs APOE ε4–negative families (similar to the original article10), the association was most significant in the unstratified families possibly because of increased power. The same allele was undertransmitted across all 4 data sets, although this association reached only nominal significance in 2 data sets (NIMH and NCRAD). The undertransmission of the minor allele is indicative of a “protective” effect (approximate summary OR across all 4 data sets, 0.76 [95% confidence interval, 0.62-0.94]) (Table 4). This is in line with the study by Reiman et al,10 who found an increased risk for AD associated with the opposite allele. However, the effect size estimated herein is about 1 order of magnitude smaller than that in the study by Reiman et al, possibly indicative of the “winner's curse” phenomenon.17 None of the other 3 tested SNPs (ie, those in GOLM1 and the 2 loci on chromosomes 9p and 15q) showed overall evidence of association across all data sets. Only the marker in GOLM1 (rs7019241) showed modest evidence for association in the NIMH data set (P =.09), whereas GWA_9p24 (rs9886784) showed nominal association in the CAG data set (P = .05) (Table 3).
To our knowledge, our study provides the first systematic assessment of genetic association findings that originated from high-density GWA analyses in AD. Although GWA studies represent a leap forward in the analysis of complex diseases such as AD, independent replication (as with earlier methods) remains the first step in distinguishing “real” from false-positive association findings.18 Based on the data generated in our study, the GAB2 association represents the most promising of the GWA findings in AD to date. In GWA data by Li et al,9 who used the same marker panel as Reiman et al,10 all GAB2 variants show nominally significant association on reanalysis of the publicly available genotype data (for more details, see the meta-analyses on the AlzGene Web site [http://www.alzgene.org]).5 One other study19 to date has investigated the potential association between GAB2 and AD risk in case-control samples from France and the United Kingdom. Although the authors reported an overall nonsignificant outcome, in 2 of 3 samples the allele and genotype distributions showed the same pattern as in the 2 GWA case-control studies9,10 and in our families with AD. Finally, based on systematic meta-analyses of all published genetic association studies in AD,5GAB2 effect sizes (OR range, 0.66-0.79 for protection [equivalent to 1.27-1.51 for risk for the opposite allele]) are among the strongest and most significant observed in any putative disease gene after APOE in the field of AD. Taken together, these findings suggest that GAB2 ranks among the top contenders for a second genuine AD susceptibility gene.
The lack of confirmation of the 4 GWA findings reported by Li et al9 in our sample could be related to several reasons, including low power (eg, because of overestimation of the effect size in the original study), locus or allelic heterogeneity (eg, because the risk effects exerted by some loci or alleles may be present in some populations but not in others), or LD with other as yet unidentified alleles (eg, because LD with the presumed risk alleles may be lower in our sample compared with that of Li et al, reducing our power to detect such effects). Alternatively, the lack of replication of the original findings—which is a common occurrence in genetic studies of complex diseases regardless of study design—could also be due to type I error (ie, false-positive results in the original study). More independent follow-up analyses using case-control and family-based designs are needed to more accurately assess these possibilities.
Gab2 is a member of a family of evolutionarily highly conserved scaffolding and adapter proteins that are involved in multiple signaling pathways and particularly in the transduction of cytokine and growth receptor signaling.20,21 Gab2 is ubiquitously expressed but is found at high levels in white blood cells, prefrontal cortex, and hypothalamus. Functionally, preliminary evidence reported in the original GWA article10 suggested that changes in Gab2 expression could potentially affect Gsk3-dependent phosphorylation of tau and the formation of neurofibrillary tangles. Moreover, growth factor receptor–bound protein 2, which binds Gab2, has been reported to bind tau,22 amyloid-β precursor protein,23 and presenilin 1 and 2.23 These interactions have been proposed to regulate signal transduction (eg, the ERK1/2 pathway).23 Therefore, GAB2 is a biologically plausible gene for involvement in the pathogenesis of AD.
In conclusion, results of this first systematic follow-up (to our knowledge) of recent high-density GWA analyses in AD suggest that GAB2 on chromosome 11q14, or a locus nearby, may harbor genetic variants that modulate the risk for AD. Additional independent replications and functional genetic analyses are warranted to elucidate the potential biochemical mechanisms and the epidemiologic relevance of this association.
Correspondence: Lars Bertram, MD, Genetics and Aging Research Unit, MassGeneral Institute for Neurodegenerative Disease, Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Massachusetts General Hospital, 114 16th St, Charlestown, MA 02129 (firstname.lastname@example.org).
Accepted for Publication: September 11, 2008.
Author Contributions:Study concept and design: Blacker, Tanzi, and Bertram. Acquisition of data: Schjeide, Hooli, Parkinson, Hogan, DiVito, Mullin, Blacker, Tanzi, and Bertram. Analysis and interpretation of data: Schjeide, Tanzi, and Bertram. Drafting of the manuscript: Schjeide and Bertram. Critical revision of the manuscript for important intellectual content: Schjeide, Hooli, Parkinson, Hogan, DiVito, Mullin, Blacker, Tanzi, and Bertram. Statistical analysis: Schjeide and Bertram. Obtained funding: Tanzi and Bertram. Administrative, technical, and material support: Schjeide, Mullin, and Bertram. Study supervision: Bertram.
Financial Disclosure: Drs Tanzi and Bertram are consultants to and equity holders in TorreyPines Therapeutics, Inc, and Prana Biotechnology, Ltd. Dr Tanzi is a cofounder of TorreyPines Therapeutics, Inc, and Prana Biotechnology, Ltd.
Funding/Support: This study was supported by grant 5R01AG23667 from the National Institute on Aging (Dr Bertram), by grant R37MH60009 from the National Institute of Mental Health (Dr Tanzi), and by grants from the Extendicare Foundation and the Cure Alzheimer's Fund (Dr Bertram). The AlzGene database project is funded by the Cure Alzheimer’s Fund (principal investigator Dr Bertram).
Additional Contributions: We thank the families for participating in the study.