Forest plot showing odds ratio estimates and 95% confidence intervals at p.E508K (squared boxes) from the 4 SIGMA studies, the SIGMA pooled mega-analysis, the replication studies, and the overall meta-analysis. Odds ratios for the meta-analyses are represented with a diamond. SIGMA mega-analysis represents the combined results from the 4 SIGMA studies. DMS indicates Diabetes in Mexico Study; MCDS, Mexico City Diabetes Study; MEC, Multiethnic Cohort; UIDS, Universidad Nacional Autónoma de México/Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán Diabetes Study; T2D-GENES, Type 2 Diabetes Genetic Exploration by Next-Generation Sequencing in Multi-Ethnic Samples.aRepresents data from the current article.
The dimerization, DNA binding, and transactivation domains of the HNF-1A protein49- 51 are highlighted. The position of the p.E508K mutation is shown as well as a common variant (p.I27L), MODY3 mutations studied (p.P112L, p.R229Q, p.P379fsdelCT, p.P447L, p.Q466X), and a rare variant associated with type 2 diabetes (p.M490T). The overlaid heat map illustrates how many of the amino acid residues of each HNF-1A domain have been reported to be mutated and hence due to the monogenic diabetes form MODY3. Domain areas in red have a higher concentration of reported mutations than areas in orange and green. Pseudo POU indicates protein domain that includes short sequence motifs similar to regions in the POU family of transcriptional activators; Homeo, protein homeodomain that binds DNA in a sequence-specific manner.
HeLa cells were transient transfected with nonmutant or mutant HNF1A plasmids and reporter plasmids pGL3-RA and pRL-SV40. Measurements are given in fold activity relative to wild-type. Each point represents the mean (error bars indicate 95% CIs) of 9 readings. TA indicates variants that affect the transactivation domain; DNAbind, the DNA binding domain; and pcDNA3.1, the empty pcDNA3.1 vector. All values were P < .05 compared with wild-type activity.
Xpress-epitope-tagged wild-type and p.E508K mutant proteins incubated with a radiolabeled DNA fragment containing the HNF-1A-binding site in the rat albumin promoter. A, Two HNF-1A mutants (p.P112L and p.R229Q) with impaired DNA-binding were included as negative controls. Addition of the anti-Xpress antibody induced a supershift (a reduction in mobility of protein-DNA complex due to antibody binding, relative to protein-DNA complex alone) for the DNA-protein complexes, confirming the identity of HNF-1A within the complexes. B, A competition assay was performed by adding increasing amounts (0x, 10x, 50x, or 100x) of radiolabeled DNA fragment, confirming the identity of the radiolabeled probe.
Cells were transfected for 48 hours and Xpress-epitope-tagged HNF-1A proteins detected with anti-Xpress antibody and Alexa488 (green). DNA staining (DAPI) is shown in blue. A previously reported HNF-1A mutant, p.Q466X, with impaired nuclear localization was included as a control. For the purpose of clarity, the nuclei have been marked with a solid white line. To illustrate cytosolic accumulation, the cell membrane has been marked with a dotted white line for mutants p.E508K and p.Q466X.
The scatterplot shows the age of onset and the body mass index (BMI) for each p.E508K carrier (filled circle) with type 2 diabetes in the discovery studies with data on age of onset and BMI available (n = 29). The vertical and horizontal lines represent classical thresholds for the clinical diagnosis of MODY3 (age of onset <25 years and BMI<25). Histograms showing distributions of BMI and age of diabetes onset 1274 SIGMA discovery cohort participants (p.E508K carriers and noncarriers with Type 2 diabetes) are shown on the left and below the scatterplot. In the box-and-whisker plots, the central horizontal line indicates median, with box extremes indicating the first and third quartiles. The whiskers indicate maximum and minimum values after removal of outliers (unfilled circles).
eMethods. Supplementary methods
eFigure 1. Principal component analysis including parental samples
eFigure 2. Proportions of Native American ancestry
eFigure 3. Principal component analysis of exome-sequenced samples in SIGMA
eFigure 4. Quantile-Quantile plot of observed vs expected test statistics
eFigure 5. Regional association plot of the HNF1A locus
eFigure 6. Transactivation and subcellular localization experiments
eTable 1. Study descriptives of replication studies
eTable2. Functional annotation of variants identified in the discovery cohort
eTable 3. Intersection of known and novel variants ascertained by the SIGMA T2D exome project
eTable 4. Discovery stage results for all significant markers in the exome
eTable 5. Local ancestry results near the HNF1A p.E508K variant
eTable 6. Top burden association tests results for non-synonymous variants with MAF < 1%
eTable 7. Top burden tests for loss of function variants with MAF<1%
eTable 8. Gene-set association test results for non-synonymous variants with MAF < 1%
eTable 9. Gene-set association test results for loss of function variants with MAF < 1%
Customize your JAMA Network experience by selecting one or more topics from the list below.
. Association of a Low-Frequency Variant in HNF1A With Type 2 Diabetes in a Latino Population. JAMA. 2014;311(22):2305–2314. doi:10.1001/jama.2014.6511
Copyright 2014 American Medical Association. All Rights Reserved. Applicable FARS/DFARS Restrictions Apply to Government Use.
Latino populations have one of the highest prevalences of type 2 diabetes worldwide.
To investigate the association between rare protein-coding genetic variants and prevalence of type 2 diabetes in a large Latino population and to explore potential molecular and physiological mechanisms for the observed relationships.
Design, Setting, and Participants
Whole-exome sequencing was performed on DNA samples from 3756 Mexican and US Latino individuals (1794 with type 2 diabetes and 1962 without diabetes) recruited from 1993 to 2013. One variant was further tested for allele frequency and association with type 2 diabetes in large multiethnic data sets of 14 276 participants and characterized in experimental assays.
Main Outcome and Measures
Prevalence of type 2 diabetes. Secondary outcomes included age of onset, body mass index, and effect on protein function.
A single rare missense variant (c.1522G>A [p.E508K]) was associated with type 2 diabetes prevalence (odds ratio [OR], 5.48; 95% CI, 2.83-10.61; P = 4.4 × 10−7) in hepatocyte nuclear factor 1-α (HNF1A), the gene responsible for maturity onset diabetes of the young type 3 (MODY3). This variant was observed in 0.36% of participants without type 2 diabetes and 2.1% of participants with it. In multiethnic replication data sets, the p.E508K variant was seen only in Latino patients (n = 1443 with type 2 diabetes and 1673 without it) and was associated with type 2 diabetes (OR, 4.16; 95% CI, 1.75-9.92; P = .0013). In experimental assays, HNF-1A protein encoding the p.E508K mutant demonstrated reduced transactivation activity of its target promoter compared with a wild-type protein. In our data, carriers and noncarriers of the p.E508K mutation with type 2 diabetes had no significant differences in compared clinical characteristics, including age at onset. The mean (SD) age for carriers was 45.3 years (11.2) vs 47.5 years (11.5) for noncarriers (P = .49) and the mean (SD) BMI for carriers was 28.2 (5.5) vs 29.3 (5.3) for noncarriers (P = .19).
Conclusions and Relevance
Using whole-exome sequencing, we identified a single low-frequency variant in the MODY3-causing gene HNF1A that is associated with type 2 diabetes in Latino populations and may affect protein function. This finding may have implications for screening and therapeutic modification in this population, but additional studies are required.
The estimated prevalence of type 2 diabetes in Mexican adults was 14.4% in 2006,1 making it one of the leading causes of death in Mexico.2 Based on statistics from 1999-2002, the standardized prevalence of diagnosed diabetes was 10% in Mexican Americans and 5.2% in whites.3 Although environmental factors such as lifestyle and diet likely explain the majority of this health disparity, it was recently found that genetic variants in the gene SLC16A11 (NCBI NC_000017.11) were associated with higher rates of type 2 diabetes in Latinos.4Latinos, defined as persons who trace their origin to Central and South America, and other Spanish cultures, fall on a continuum of Native American and European genetic ancestry.4 Identifying genetic factors associated with type 2 diabetes in Latino populations could increase understanding of its pathophysiology, improve risk prediction, and focus treatment choice based on knowledge of the underlying biology of the disease.
Type 2 diabetes is typically diagnosed after age 40 years, is caused by the combined action of genetic susceptibility and environmental factors, is associated with obesity, and is polygenic. Genome-wide association studies for typical type 2 diabetes forms have identified more than 70 distinct genetic loci carrying common variants that are associated with modest differences in prevalence of the disease.5- 7 Because these common variants explain a small fraction of the estimated heritability, it is hypothesized that low-frequency or rare variants of strong effects, not captured by genome-wide association studies but amenable to sequencing approaches, contribute in a meaningful proportion to the genetic architecture of the disease. To date, low-frequency variants with near-complete penetrance have not been found in whole-exome sequencing studies of type 2 diabetes,8,9 although a recent whole-genome sequencing study found rare variants associated with type 2 diabetes prevalence in an Icelandic population.10
To explore the association of rare protein-coding genetic variants with type 2 diabetes in the Latino population, we performed whole-exome sequencing (which captures both common and rare genetic variants in the protein-coding regions of genes) on case-control studies composed of individuals of Mexican or another Latino ancestry, with replication in a separate multiethnic data set.
This study was performed as part of the Slim Initiative in Genomic Medicine for the Americas (SIGMA) Type 2 Diabetes Consortium, whose goal is to characterize the genetic basis of type 2 diabetes in Mexican and Latin American populations drawn from 4 studies4,11- 13 (Table 1, details of these studies are provided in the Supplement). All participants had either Mexican or other Latino ancestry based on self-report and verification using principal component analysis of genotype data. Replication studies included individuals from a multiethnic study (Type 2 Diabetes Genetic Exploration by Next-Generation Sequencing in Multi-Ethnic Samples [T2D-GENES] and Genetics of T2D [GoT2D]) and an ongoing collection of Mexican participants from 18 indigenous groups for genetic studies (Diabetes in Mexico Study 2[DMS2]) (eTable 1, details of these studies are provided in Supplement). Diagnosis of type 2 diabetes followed the American Diabetes Association criteria. Each participant provided written informed consent for genetic investigation. All contributing studies were approved by their respective local ethics committees.
In total, 3862 samples were selected for whole-exome sequencing from a larger data set of 8214 samples previously genotyped with the OMNI 2.5 array (Illumina).4 To increase representation of genetic variation not queried in studies of European populations, selection criteria for whole-exome sequencing was based on the proportion of Native American ancestry estimated from principal component analysis of genotype data (eMethods section and eFigures 1 and 2 in the Supplement). Whole-exome sequencing was performed on blood DNA from these samples using Sure-Select Human All Exon v2.0 (Illumina), 44-Mb–baited target. Raw reads were mapped with the Burrows-Wheeler Aligner, reprocessed with Picard to recalibrate base quality scores and perform local realignment around known indels. Genetic variants were called with the Genome Analysis Toolkit Unified Genotyper module14 and were filtered to remove likely artifacts using several quality-control metrics such as mean coverage, concordance of nonreference genotypes with array data, and missing rate as specified in the eMethods section in the Supplement. Independent replication was sought in whole-exome sequence data from the T2D-GENES and GoT2D projects, which together sequenced 13 098 individuals from 5 ethnic groups (Europeans, East Asians, African Americans, South Asians, and Latinos).
We used the liability threshold model, which models participants as having an unobserved continuous phenotype called liability.15 We computed the residual value of the liability after accounting for the part that can be predicted by each participant's age and body mass index (BMI) using LTSOFT software (http://www.hsph.harvard.edu/alkes-price/software).16 Significance was evaluated with the residual liabilities as outcome using an expedited mixed linear model,17 which adjusts for sex, ancestry (eFigure 3 in the Supplement), and relatedness via a variance-component matrix with 2-sided tests. Odds ratios (ORs) were estimated using logistic regression models on type 2 diabetes status adjusting for age, BMI, and ancestry as specified in the eMethods section in the Supplement. The experiment-wide statistical significance threshold was set to P < 5 × 10−8 to adjust for the number of variants evaluated. In addition to single-variant testing, the sequence kernel association test18 and collapsing tests19 were used to test the possibility of genes and groups of genes associated to disease susceptibility via aggregation of rare variants.
Results of all functional experiments are expressed as means (SDs), and experiments were performed on at least 3 independent occasions unless otherwise specified. Statistical analyses were performed using the 2-tailed t test, and P <.05 was considered significant for these functional studies.
Details of functional studies are specified in the eMethods section in the Supplement. The human liver hepatocyte nuclear factor 1α (HNF1A) complementary DNA in expression vector pcDNA3.1/HisC (NCBI Entrez Gene BC104910.1) was used for all cell studies.20 Firefly luciferase reporter vectors (pGL3) included promoter sequences for the rat albumin (pGL3-RA), human HNF4A (NCBI Entrez Gene 3172) P2 (pGL3-HNF4AP2), and mouse Glut2 (pGL3-GLUT2) genes. Renilla luciferase reporter construct pRL-SV40 (GenBank AF025845.2) was used as an internal control. The HNF-1A mutants were made using the QuikChange Site-Directed XL Mutagenesis Kit (Stratagene). HeLa cells and MIN6 β-cells were grown as previously described,20,21 and transfected according to manufacturers’ recommendations using the Metafectene Pro (Biontex-USA) or Lipofectamine 2000 (Life Technologies), respectively.
Transcriptional activity was measured 24 hours after transfection using the Dual-Luciferase Reporter Assay System (Promega Biotech) on a Chameleon luminometer (Hidex). To measure HNF-1A protein levels, transfected HeLa cells were lysed in passive lysis buffer (Promega Biotech) and proteins were analyzed (from 2.5 µg of total protein) by SDS-PAGE and immunoblotting using an HNF-1A-tag (anti-Xpress antibody, Life Technologies).
The HNF-1A protein was produced in a coupled in vitro transcription/translation System (TnT-T7, Promega Biotech). The level of binding of HNF-1A proteins to a radiolabeled rat albumin oligonucleotide was investigated by electrophoretic mobility shift assays as previously described.22
Analysis of nuclear vs cytosol localization of HNF-1A proteins was performed in 500 cells using an HNF-1A-tag (anti-Xpress antibody) and Alexa Fluor 488 (Life Technologies) essentially as reported previously.20
Demographic and clinical characteristics of the 3756 participants in the discovery cohort are shown in Table 1. Only 2% of type 2 diabetes cases had onset before 25 years, and 81% of them were overweight or obese (BMI >25, calculated as weight in kilograms divided by height in meters squared).
Our hybrid selection libraries covered 76% of sequenced targets at 20x depth of coverage with a mean of 67.17x. The concordance of nonreference genotypes between the sequence data and the array data was 0.995. After quality control of sequence data, 1 190 196 variants were observed in the whole-exome sequencing data of 3756 samples (1794 type 2 diabetes cases and 1962 controls; eTable 2 in the Supplement). Of these, 264 995 variants were observed in at least 2 of our samples but absent in the 1000 Genomes Project23 and the Exome Sequencing Project24 (eTable 3 in the Supplement).
In our single-variant association analyses, a cluster of linked common missense variants in SLC16A11 were consistently associated with type 2 diabetes prevalence (P = 2.08 × 10−10) as had been previously reported in genome-wide association studies by the SIGMA T2D Consortium and others (eFigure 4A and eTable 4 in the Supplement).4,25
Among variants with minor allele frequency of less than 5%, a single missense variant departed from the null distribution (eFigure 4B in the Supplement). This variant encoded an NCBI NP_000536.5:p.E508K (p.E508K) substitution (NCBI NC_000012.12:c.1522G>A; chr12:121437091_G>A) in exon 8 of HNF1A, the gene responsible for the maturity onset diabetes of the young type 3 (MODY3) subtype of MODY3 (Mendelian Inheritance in Man No. 142410). The p.E508K variant was observed in 37 type 2 diabetes cases (1 in homozygous form) and in 7 participants without diabetes (OR, 5.48; 95% CI, 2.83-10.61; P = 4.4 × 10−7; Figure 1 and Figure 2 and eFigure 5 in the Supplement).
In our replication effort, the p.E508K variant was found in the T2D-GENES Latino group26,27 but entirely absent in all other populations, showing a nominally significant association with increased prevalence for type 2 diabetes (7 affected carriers and 1 nonaffected carrier; OR, 5.61; 95% CI, 1.34-23.49; P = .0013). After de novo genotyping 1178 additional Mexican self-identified indigenous individuals (DMS2, further details are provided in the Supplement), we observed 9 affected carriers and 4 nonaffected carriers (OR, 3.50; 95% CI, 1.17-10.44; P = .0183). Combined, the 2 replication studies identified 15 affected carriers and 5 nonaffected controls (OR, 4.16; 95% CI, 1.75-9.92; P = .0013). Combining all available data yielded 52 affected carriers and 12 nonaffected controls (OR, 4.96; 95% CI, 2.93-8.38; an experiment-wide P = 2.39 × 10−9 ; Figure 1).
We found no evidence for p.E508K in the 1092 samples of the 1000 Genomes Project,23 the 6503 samples in the Exome Sequencing Project24 or in 11 160 non-Latino samples in the T2D-GENES and GoT2D data sets. Analysis of local ancestry in our data indicates that all p.E508K carriers in our studies carry at least 1 segment of inferred Native American ancestry (eTable 5 in the Supplement).
In group tests that included combinations of rare (MAF <1%) nonsynonymous, loss-of-function variants, or both in up to 15 469 genes (eTables 6 and 7 in the Supplement), we found no significant associations after removing the effect of the HNF1A p.E508K variant. The aggregated effect of these potentially functional variants in 2 gene-sets of 13 MODY genes and 70 previously implicated type 2 diabetes genes were similarly negative after removing the effect of the HNF1A p.E508K variant (eTables 8 and 9 in the Supplement).
Mutations in HNF1A that cause MODY diabetes alter protein function through reduced transactivation, decreased binding to DNA, or disrupted nuclear localization.20 Because p.E508K is located in the HNF-1A transactivation domain, we investigated its effect on transactivation using a reporter construct assay in HeLa cells. Protein carrying p.E508K was compared with a wild-type HNF-1A variant as well as 4 other HNF-1A variants in the DNA-binding or transactivation domains: p.M490T, which has been observed in 1 patient with type 2 diabetes,28 and 3 mutations (p.P447L, p.P379fsdelCT, and p.R229Q) previously identified in patients with MODY3.29 The p.E508K mutant demonstrated lower transcriptional activity on the HNF-1A-responsive rat albumin promoter than wild-type HNF-1A (P < .0001) or p.M490T. However, the 3 MODY3 mutants showed greater reductions in transactivation (Figure 3). Similar reductions in p.E508K transcriptional activation were found in MIN6 cells (eFigure 6A in the Supplement), and using 2 different reporter constructs (GLUT2 and HNF4A promoters; eFigure 6B in the Supplement). The p.E508K mutant protein bound to an HNF-1A binding site-containing oligonucleotide with equal affinity to the wild-type protein (Figure 4 and eFigure 6C in the Supplement), whereas 2 MODY3-associated mutants with mutations in the DNA-binding domain, p.P112L and p.R229Q, demonstrated impaired DNA binding (Figure 4).20
Compared with wild-type HNF-1A, the p.E508K mutant demonstrated slightly impaired nuclear targeting, with an increased proportion of cells displaying both cytosolic and nuclear staining. The shift in nuclear localization was less than that observed using the cytosol-retained HNF-1A mutant p.Q466X (Figure 5 and eFigure 6D in the Supplement). Expression of the p.E508K protein was 47.5% lower than that of wild-type HNF-1A (P = 1.03×10−5; eFigure 6E in the Supplement).
When comparing p.E508K carriers with noncarriers among the 3756 participants in our study, we did not observe statistically significant differences in the mean (SD) age of diabetes onset: 45.3 (11.2) years vs 47.5 (11.5) years, P = .49; BMI, 28.2; (5.5) vs 29.3 (5.3), P = .19; waist circumference in men, 92.9 (7.0) cm vs 99.3 (11.0) cm, P = .14 or women, 98.0 (13.9) cm vs 99.7 (13.9) cm, P = .64; or in fasting glucose levels, 176.5 (84.6) mg/dL vs 165.7 (75.6) mg/dL, P = .43 (To convert fasting glucose from mg/dL to mmol/L, multiply by 0.0555; Table 2 and Figure 6).
We performed whole-exome sequencing in 3756 individuals of Mexican and Mexican American ancestry and performed an exome-wide search for low-frequency and rare variants associated with type 2 diabetes. The only rare variant with a significant association with type 2 diabetes prevalence was the p.E508K variant in HNF1A, the gene responsible for MODY3. The effect size of the variant (OR, 4.96; 95% CI, 2.93-8.38) was the largest observed to date for any diabetes variant with a frequency more than 1 in 1000. This association was replicated in 2 independent cohorts of Latinos and Mexicans with an OR of similar magnitude. We also demonstrated, using transiently transfected cell models, reduced levels of transactivation activity for p.E508K compared with wild-type HNF-1A. As shown in binding assays, this reduction in activity was not driven by differences in DNA-binding affinity but may be attributable to reduced protein expression and altered nuclear localization of the mutant protein.
MODY is a monogenic cause of diabetes, which usually manifests at earlier ages (<25 years) and presents in nonobese patients.30 Each MODY family carries a rare coding mutation in 1 of 13 genes that has an autosomal dominant pattern of transmission.30 Mutations in the known MODY genes are thought to explain between 0.18% and 1.8% of all type 2 diabetes cases.31- 34
The p.E508K variant has been reported in 2 published articles,35,36 both reporting on individuals with MODY. In 1 case, a family member had early onset diabetes (age 17 years), and carried both HNF1A p.E508K and a mutation in HNF4A, p.R80Q. The father from whom p.E508K was inherited was diagnosed with type 2 diabetes at age 57 years.35,36 The finding of these variants in patients with MODY suggested that they might be high-penetrance alleles. Our study in large populations without ascertainment bias for early-onset showed that p.E508K was associated with a 5-fold increase in prevalence, but incomplete penetrance. Moreover, in our study, carriers of p.E508K did not show early-onset of type 2 diabetes, were indistinguishable from the wider type 2 diabetes population in adiposity or glycemia, and thus did not fulfill classical MODY3 diagnostic criteria (Table 2, Figure 6). These data are consistent with the possibility that p.E508K is a weaker allele than some other MODY3 mutations and that ascertainment bias may have led to overestimation of the effects of this and other MODY mutations, as suggested previously.28
A private mutation (G319S) in HNF1A has been found in Oji-Cree populations associated with early-onset type 2 diabetes.37 Also, a very rare frameshift deletion in HNF1A, 290fsdelC, was recently associated with MODY and type 2 diabetes in the Icelandic population.10,38
Our study surveyed variants across the majority of protein-coding exons in a sizable population, providing the highest-resolution scan to date of the contribution of protein-coding genetic variation to type 2 diabetes. Our study had 80% power to detect variants with the OR and carrier frequency of p.E508K (5-fold and 1% in the population). For variants of higher frequency, our power was sufficient to detect a smaller effect (80% power for variants with frequency >2% and OR>3.3). We performed both single-variant analysis and burden tests that combined rare variants in each gene. Only 1 rare coding variant and 1 gene showed significant association with type 2 diabetes prevalence. These data suggest that low-frequency variants in coding regions explain only a small fraction of the heritability of type 2 diabetes.
Our study has limitations. Current exome-capture methods are imperfect. Additional low-frequency variants associated with type 2 diabetes might have been missed due to incomplete coverage of all human exons, and, by design, this technology does not detect variants in the noncoding majority of the genome. Although a 2% frequency of p.E508K among type 2 diabetes cases could translate into more than 100 000 carriers in Mexico alone, this number is still far from explaining the expected overall genetic contribution to type 2 diabetes. Although our study represents the largest published exome-based survey of type 2 diabetes to date, larger sample sizes will be needed to perform an adequately powered survey of variants at frequencies lower than 1%.39,40
The current study and a recent publication reporting an association of common variants in SLC16A11 with type 2 diabetes in Latinos4 demonstrate the value of studying diverse populations. The HNF1A p.E508K variant has not been reported in other whole-exome sequencing or candidate gene association studies for type 2 diabetes of European9,10,41 and Asian42- 45 ancestry. We surveyed a total of 25 663 exomes in this study, both from our own study and collaborating consortia. The p.E508K variant was identified only in individuals from Mexico or in Latinos from the southern United States, indicating that this variant is only found at appreciable frequency in a tightly restricted subset of human populations. Further studies will be required to characterize the fine-scale geographic distribution of p.E508K and its association with type 2 diabetes prevalence in other Latino populations. Our results emphasize that systematic discovery of the genetic determinants of complex disease, especially for rare variants, will require surveys across a wide range of human populations.
The association of the p.E508K variant with type 2 diabetes prevalence in the Latino population has potential clinical implications. Approximately 4 in a thousand people in Latino populations carry p.E508K, and these individuals have a 5-fold increase in prevalence for type 2 diabetes (2.1% in cases, 0.35% in controls). Second, it is known that patients with MODY3 are sensitive to sulfonylureas,46 experiencing improved metabolic control on sulfonylurea therapy compared with insulin,47 in addition to improved quality of life due to reduced injections and capillary glucose measurements. Also, these patients have a 5-fold higher response to the sulfonylurea gliclazide than to metformin, which is the first-line drug of choice for the treatment of type 2 diabetes.48 If this was shown to be the case for carriers of p.E508K, it could motivate choice of sulfonylurea therapy for the estimated 2% of all Latino patients with type 2 diabetes who carry this variant. Because this response may be dependent on additional genetic or environmental factors, further studies are needed to determine whether metformin or a sulfonylurea should be the first line of treatment in these patients.
Using whole-exome sequencing, we identified a single low-frequency missense variant (p.E508K) in HNF1A, the gene responsible for a monogenic, early-onset form of diabetes (MODY3), that was associated with type 2 diabetes prevalence in general populations of Latinos. This rare variant was associated with a 5-fold increase in the prevalence of type 2 diabetes, but it was not associated with an early-onset form of diabetes, and, in our data, affected carriers were clinically indistinguishable from the wider type 2 diabetes population. In vitro, p.E508K negatively affected transcriptional activation, protein expression, and nuclear localization. Further research is warranted to evaluate the clinical relevance of these findings, including the benefits of selective population screening and the choice of genotype-guided therapeutic regimens.
Corresponding Author: Jose C. Florez, MD, PhD, Center for Human Genetic Research, Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA 02114 (email@example.com).
Authors: The following investigators of the SIGMA Type 2 Diabetes Consortium take authorship responsibility for the study results: Karol Estrada, PhD; Ingvild Aukrust, PhD; Lise Bjørkhaug, PhD; Noël P. Burtt, PhD; Josep M. Mercader, PhD; Humberto García-Ortiz, PhD; Alicia Huerta-Chagoya, MSc; Hortensia Moreno-Macías, PhD; Geoffrey Walford, MD; Jason Flannick, PhD; Amy L. Williams, PhD; María J. Gómez-Vázquez, BSc; Juan C. Fernandez-Lopez, MSc; Angélica Martínez-Hernández, PhD; Silvia Jiménez-Morales, PhD; Federico Centeno-Cruz, PhD; Elvia Mendoza-Caamal, MD; Cristina Revilla-Monsalve, PhD; Sergio Islas-Andrade, MD, PhD; Emilio J. Córdova, PhD; Xavier Soberón, PhD; María E. González-Villalpando, MD; E. Henderson, MD; Lynne R. Wilkens, DrPH; Loic Le Marchand, MD, PhD; Olimpia Arellano-Campos, MD, PhD; Maria L. Ordóñez-Sánchez, BSc; Maribel Rodríguez-Torres, BSc; Rosario Rodríguez-Guillén, MSc; Laura Riba, MSc; Laeya A. Najmi, MSc; Suzanne B.R. Jacobs, PhD; Timothy Fennell, BSc; Stacey Gabriel, PhD; Pierre Fontanillas, PhD; Craig L. Hanis, PhD; Donna M. Lehman, PhD; Christopher P. Jenkinson, PhD; Hanna E. Abboud, MD; Graeme I. Bell, PhD; Maria L. Cortes, PhD; Michael Boehnke, PhD; Clicerio González-Villalpando, MD; Lorena Orozco, MD, PhD; Christopher A. Haiman, ScD; Teresa Tusié-Luna, MD, PhD; Carlos A. Aguilar-Salinas, MD, PhD; David Altshuler, MD, PhD; Pål R. Njølstad, MD, PhD; Jose C. Florez, MD, PhD; Daniel G. MacArthur, PhD.
Affiliations of Authors: Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts (Estrada, Burtt, Mercader, Flannick, Williams, Jacobs, Fontanillas, Altshuler, Florez, MacArthur); Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston (Estrada); Department of Medicine, Harvard Medical School, Boston, Massachusetts (Estrada, Walford, Altshuler, Florez, MacArthur); KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway (Aukrust, Bjørkhaug, Najmi, Njølstad); Department of Pediatrics, Haukeland University Hospital, Bergen, Norway (Bjørkhaug, Njølstad); Department of Biomedicine, University of Bergen, Bergen, Norway (Aukrust); Center for Human Genetic Research and Diabetes Research Center (Diabetes Unit), Massachusetts General Hospital, Boston (Mercader, Walford, Altshuler, Florez); Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona Supercomputing Center, Barcelona, Spain (Mercader); Instituto Nacional de Medicina Genómica, Tlalpan, Mexico City, Mexico (García-Ortiz, Fernandez-Lopez, Martínez-Hernández, Jiménez-Morales, Centeno-Cruz, Mendoza-Caamal, Córdova, Soberón, Orozco); Instituto de Investigaciones Biomédicas, UNAM Unidad de Biología Molecular y Medicina Genómica, UNAM/INCMNSZ, Coyoacán, Mexico City, Mexico (Huerta-Chagoya, Riba, Tusié-Luna); Universidad Autónoma Metropolitana, Tlalpan, Mexico City, Mexico (Moreno-Macías); Centro de Estudios en Diabetes, Unidad de Investigacion en Diabetes y Riesgo Cardiovascular, Centro de Investigacion en Salud Poblacional, Instituto Nacional de Salud Publica, Mexico City, Mexico (M. E. González-Villalpando, C. González-Villalpando); Department of Molecular Biology, Harvard Medical School, Boston, Massachusetts (Flannick, Altshuler); Department of Biological Sciences, Columbia University, New York, New York (Williams); Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor (Boehnke); Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles (Henderson, Haiman); Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Sección XVI, Tlalpan, Mexico City, Mexico (Gómez-Vázquez, Arellano-Campos, Ordóñez-Sánchez, Rodríguez-Torres, Rodríguez-Guillén, Tusié-Luna, Aguilar-Salinas); Department of Genetics, Harvard Medical School, Boston, Massachusetts (Altshuler); Center for Human Genetic Research, Massachusetts General Hospital, Boston (Altshuler); Department of Biology, Massachusetts Institute of Technology, Cambridge (Altshuler); Unidad de Investigación Médica en Enfermedades Metabólicas, CMN SXXI, Instituto Mexicano del Seguro Social, Mexico City (Revilla-Monsalve, Islas-Andrade); Epidemiology Program, University of Hawaii Cancer Center, Honolulu (Wilkens, Le Marchand); Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway (Najmi); The Genomics Platform, The Broad Institute of Harvard and MIT, Cambridge, Massachusetts (Fennell, Gabriel); Human Genetics Center, University of Texas Health Science Center at Houston (Hanis); Department of Medicine, University of Texas Health Science Center at San Antonio (Lehman, Jenkinson, Abboud); Department of Human Genetics, University of Chicago, Chicago, Illinois (Bell); Department of Medicine, University of Chicago, Chicago, Illinois (Bell); Broad Institute of Harvard and MIT, Cambridge, Massachusetts (Cortes).
AuthorContributions: Dr Estrada had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Study concept and design: Estrada, Aukrust, Bjørkhaug, Burtt, Orozco, Haiman, Tusié-Luna, Altshuler, Njølstad, MacArthur, Williams, Islas-Andrade, M. González-Villalpando, Hanis, Florez, Boehnke.
Acquisition, analysis, or interpretation of data: Estrada, Aukrust, Bjørkhaug, Burtt, Mercader, Garcia-Ortiz, Huerta-Chagoya, Moreno-Macías, C. González-Villalpando, Orozco, Salinas, Altshuler, Njølstad, MacArthur, Flannick, Cortes, Williams, Gómez-Vázquez, Fernandez-Lopez, Martínez-Hernández, Centeno-Cruz, Mendoza-Caamal, Revilla-Monsalve, Córdova, Soberón, Henderson, Wilkens, Marchand, Arellano-Campos, Ordóñez-Sánchez, Torres, Rodríguez-Guillén, Riba, Walford, Najmi, Jacobs, Fennell, Gabriel, Fontanillas, Jiménez-Morales, Hanis, Florez, Lehman, Jenkinson, Abboud, Bell, Boehnke.
Drafting of the manuscript: Estrada, Mercader, Garcia-Ortiz, Huerta-Chagoya, Moreno-Macías, Orozco, Altshuler, Njølstad, MacArthur, Cortes, Martínez-Hernández, Centeno-Cruz, Islas-Andrade, Córdova, Henderson, Arellano-Campos, Najmi, Gabriel, Jiménez-Morales.
Critical revision of the manuscript for important intellectual content: Estrada, Aukrust, Bjørkhaug, Burtt, Mercader, C. González-Villalpando, Orozco, Haiman, Tusié-Luna, Salinas, Altshuler, Njølstad, MacArthur, Flannick, Williams, Gómez-Vázquez, Fernandez-Lopez, Mendoza-Caamal, Revilla-Monsalve, Soberón, M. González-Villalpando, Wilkens, Marchand, Torres, Rodríguez-Guillén, Riba, Walford, Jacobs, Fennell, Gabriel, Fontanillas, Hanis, Florez, Lehman, Jenkinson, Abboud, Bell, Boehnke.
Statistical analysis: Estrada, Mercader, Garcia-Ortiz, Huerta-Chagoya, Moreno-Macías, Orozco, Haiman, Altshuler, MacArthur, Flannick, Williams, Gómez-Vázquez, Fernandez-Lopez, Walford, Najmi, Fennell, Fontanillas, Boehnke.
Obtained funding: Orozco, Tusié-Luna, Altshuler, Njølstad, Cortes, Soberón, Wilkens, Hanis, Florez, Lehman, Boehnke.
Administrative, technical, or material support: Aukrust, Bjørkhaug, Burtt, Orozco, Tusié-Luna, Salinas, Altshuler, Njølstad, MacArthur, Flannick, Cortes, Fernandez-Lopez, Martínez-Hernández, Centeno-Cruz, Mendoza-Caamal, Revilla-Monsalve, Islas-Andrade, Córdova, Ordóñez-Sánchez, Torres, Rodríguez-Guillén, Riba, Jiménez-Morales, Florez, Lehman, Jenkinson, Abboud, Bell.
Study supervision: Aukrust, Bjørkhaug, Burtt, C. González-Villalpando, Orozco, Tusié-Luna, Altshuler, Njølstad, MacArthur, M. González-Villalpando, Riba, Gabriel, Florez.
Conflict of Interest Disclosures: All authors have completed and submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest and none were reported.
Funding/Support: The work was conducted as part of the Slim Initiative for Genomic Medicine, a project funded by the Carlos Slim Health Institute in Mexico. The UNAM/INCMNSZ Diabetes Study was supported by Consejo Nacional de Ciencia y Tecnologıía grants 138826, 128877, CONACT- SALUD 2009-01-115250, and a grant from Dirección General de Asuntos del Personal Académico, UNAM, IT 214711. The Diabetes in Mexico Study was supported by Consejo Nacional de Ciencia y Tecnología grant 86867 and by Instituto Carlos Slim de la Salud, A.C. The Mexico City Diabetes Study was supported by National Institutes of Health (NIH) grant R01HL24799 and by the Consejo Nacional de Ciencia y Tenologia grants 2092, M9303, F677-M9407, 251M, and 2005-C01-14502, SALUD 2010-2-151165. The Multiethnic Cohort was supported by NIH grants CA164973, CA054281, and CA063464. The Singapore Chinese Health Study was funded by the National Medical Research Council of Singapore under its individual research grant scheme and by NIH grants R01 CA55069, R35 CA53890, R01 CA80205, and R01 CA144034. The Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples (T2D-GENES) project was supported by NIH grants U01DK085526 and U01DK085501. The San Antonio Mexican American Family Studies (SAMAFS) were supported by R01 DK042273, R01 DK047482, R01DK053889, R01 DK057295, P01 HL045522, and a Veterans Administration Epidemiologic grant (R.A.D). The University of Bergen, Research Council of Norway, KG Jebsen Foundation, Helse Vest, and European Research Council funded the Norwegian team. Dr Mercader was supported by Sara Borrell Fellowship from the Instituto Carlos III, Spain. Dr Estrada was supported by The Netherlands Organization for Scientific Research under the Rubicon fellowship 825.12.023.
Role of the Sponsors: The funding sources had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
The SIGMA Type 2 Diabetes Consortium: Writing Team: Karol Estrada, PhD, Ingvild Aukrust, PhD, Lise Bjørkhaug, PhD, Noël P. Burtt, PhD, Josep M. Mercader, PhD, Humberto García-Ortiz, PhD, Alicia Huerta-Chagoya, MSc, Hortensia Moreno-Macías, PhD, Geoffrey Walford, MD, Jason Flannick, PhD, Amy L. Williams, PhD, Michael Boehnke, PhD, Clicerio González-Villalpando, MD, Lorena Orozco, MD, PhD, Christopher A. Haiman, ScD, Teresa Tusié-Luna, MD, PhD, Carlos A. Aguilar-Salinas, MD, PhD, David Altshuler, MD, PhD, Pål R. Njølstad, MD, PhD, Jose C. Florez, MD, PhD, Daniel G. MacArthur, PhD.
Analysis Team: Karol Estrada, PhD, Alicia Huerta-Chagoya, MSc, Humberto García-Ortiz, PhD, Hortensia Moreno-Macías, PhD, Josep M. Mercader, PhD, Jason Flannick, PhD, Amy L. Williams, PhD, María J. Gómez-Vázquez, BSc, Juan C. Fernandez-Lopez, MSc, Noël P. Burtt, PhD, Carlos A. Aguilar-Salinas, MD, PhD, Lorena Orozco, MD, PhD, Teresa Tusié-Luna, MD, PhD, David Altshuler, MD, PhD, Jose C. Florez, MD, PhD, Daniel G. MacArthur, PhD; Whole-Exome Sequenced cohorts: Diabetes in Mexico Study: Humberto García-Ortiz, PhD, Angélica Martínez-Hernández, PhD, Federico Centeno-Cruz, PhD, Elvia Mendoza-Caamal, MD, Cristina Revilla-Monsalve, PhD, Sergio Islas-Andrade, MD, PhD, Emilio J. Córdova, PhD, Xavier Soberón, PhD, Lorena Orozco, MD, PhD. Mexico City diabetes study: Clicerio González-Villalpando, MD, María E. González-Villalpando, MD. Multiethnic cohort study: Christopher A. Haiman, ScD, Brian E. Henderson, MD, Lynne R. Wilkens, DrPH, Loic Le Marchand, MD, PhD. UNAM/INCMNSZ diabetes study: Olimpia Arellano-Campos, MD, PhD, Alicia Huerta-Chagoya, MSc, Maria L. Ordóñez-Sánchez, BSc, Maribel Rodríguez-Torres, BSc, Rosario Rodríguez-Guillén, MSc, Laura Riba, MSc, Teresa Tusié-Luna, MD, PhD, Carlos A. Aguilar-Salinas, MD, PhD.
Functional Studies: Laeya A. Najmi, MSc, Ingvild Aukrust, PhD, Lise Bjørkhaug, PhD, Suzanne B. R. Jacobs, PhD, Pål R. Njølstad, MD, PhD.
Whole-Exome Sequencing: Noël P. Burtt, PhD, Timothy Fennell, BSc, Broad Genomics Platform, Stacey Gabriel, PhD.
Replication Studies:T2D-GENES Consortium: Jason Flannick, PhD, Pierre Fontanillas, PhD, Craig L. Hanis, PhD, Donna M. Lehman, PhD, Christopher P. Jenkinson, PhD, Hanna E. Abboud, MD, Graeme I. Bell, PhD, Jose C. Florez, MD, PhD, David Altshuler, MD, PhD, Michael Boehnke, PhD. Diabetes in Mexico study 2: Humberto García-Ortiz, PhD, Angélica Martínez-Hernández, PhD, Emilio J. Córdova, PhD, Silvia Jiménez-Morales, PhD, Federico Centeno-Cruz, PhD, Elvia Mendoza-Caamal, MD, Cristina Revilla-Monsalve, PhD, Sergio Islas-Andrade, MD, PhD, Xavier Soberón, PhD, Lorena Orozco, MD, PhD.
Scientific and Project Management: Noël P. Burtt, PhD, Maria L. Cortes, PhD.
Steering Committee: David Altshuler, MD, PhD, Jose C. Florez, MD, PhD, Christopher A. Haiman, ScD, Carlos A. Aguilar-Salinas, MD, PhD, Clicerio González-Villalpando, MD, Lorena Orozco, MD, PhD, Teresa Tusié-Luna, MD, PhD.
Additional Information: The members of the SIGMA Type 2 Diabetes Consortium mourn the sudden passing of coauthor Laura Riba, a good friend, respected colleague and lab manager with outstanding contributions to the research of type 2 diabetes in Mexico. We dedicate this article to her memory.
Additional Contribution: Researchers of the DMS2 study thank Olaf Iván Corro Labra and José Luis de Jesus García Ruíz from the “Comisión Nacional para el Desarrollo de los Pueblos Indígenas” for their support in sample collection, for which they were not compensated.
Correction: The authors added a tribute on August 20, 2014 to a colleague who had died unexpectedly and added the name of an author who was not included in the byline.