Regression of SLC1A1 expression on single-nucleotide polymorphisms using publicly accessible lymphoblastoid cell line data. Depicted here are 3 cis-acting expression quantitative trait loci that were nominally significant and had the same direction of effect in 2 independent data sets.24,25 The single-nucleotide polymorphism rs301430 has previously been shown to be associated with obsessive-compulsive disorder both as a single marker and as part of a haplotype.17,18 A, Stranger et al25 and Spielman et al24 data, P = .02. B, Stranger et al25 data, P = .02; Spielman et al24 data, P = .005. C, Stranger et al25 data, P = .02; Spielman et al24 data, P = .01.
Regression of SLC1A1 expression on single-nucleotide polymorphisms in brain tissue. A total of 97 samples of human postmortem dorsolateral prefrontal cortex were genotyped and quantitated for SLC1A1 messenger RNA levels by real-time reverse-transcriptase–polymerase chain reaction. Two of the 3 expression quantitative trait loci single-nucleotide polymorphisms that were analyzed were nominally significant and had the same direction of effect in brain tissue as in lymphoblastoid cell lines (Figure 1). A, nonsignificant. B, P = .02. C, P = .02.
Reporter gene analyses of putative expression quantitative trait loci. The 3 single-nucleotide polymorphisms identified in this study to be associated with SLC1A1 messenger RNA levels were tested in luciferase constructs. Six constructs were transfected 5 times in both human neuroblastoma (SH-SY5Y) and rat pheochromocytoma (PC12) cell lines. Error bars represent standard error of mean. One single-nucleotide polymorphism, rs301430, showed allele-specific differential luciferase expression that was highly significant in neuroblastoma (t 8 = 8.6, P < .001) and in PC12 cells (t8 = 6.1, P < .001). Note that the T allele–mediated reduction in luciferase expression for rs301430 is in line with the T allele–associated decrease in SLC1A1 messenger RNA levels in lymphoblastoid cell lines and brain tissue (Figure 1 and Figure 2). NS indicates nonsignificant.
Two linear regression analyses of rs3933331 for the hoarding subphenotype, both factor analysis–derived (A) and on the basis of the Saving Inventory–Revised (SIR) score (B), which were nominally significant. A, P = .005. B, P = .009.
Wendland JR, Moya PR, Timpano KR, Anavitarte AP, Kruse MR, Wheaton MG, Ren-Patterson RF, Murphy DL. A Haplotype Containing Quantitative Trait Loci for SLC1A1 Gene Expression and Its Association With Obsessive-Compulsive Disorder. Arch Gen Psychiatry. 2009;66(4):408-416. doi:10.1001/archgenpsychiatry.2009.6
Copyright 2009 American Medical Association. All Rights Reserved. Applicable FARS/DFARS Restrictions Apply to Government Use.2009
Recent evidence from linkage analyses and follow-up candidate gene studies supports the involvement of SLC1A1, which encodes the neuronal glutamate transporter, in the development of obsessive-compulsive disorder (OCD).
To determine the role of genetic variation of SLC1A1 in OCD in a large case-control study and to better understand how SLC1A1 variation affects functionality.
A case-control study.
Publicly accessible SLC1A1 expression and genotype data.
Three hundred twenty-five OCD probands and 662 ethnically and sex-matched controls.
Probands were assessed with the Structured Clinical Interview for DSM-IV, the Yale-Brown Obsessive Compulsive Scale, and the Saving Inventory–Revised. Six single-nucleotide polymorphisms (SNPs) were genotyped. Multiple testing corrections for single-marker and haplotype analyses were performed by permutation.
Gene expression of SLC1A1 is heritable in lymphoblastoid cell lines. We identified 3 SNPs in or near SLC1A1 that correlated with gene expression levels, 1 of which had previously been associated with OCD. Two of these SNPs also predicted expression levels in human brain tissue, and 1 SNP was further functional in reporter gene studies. Two haplotypes at 3 SNPs, rs3087879, rs301430, and rs7858819, were significantly associated with OCD after multiple-testing correction and contained 2 SNPs associated with expression levels. In addition, another SNP correlating with SLC1A1 gene expression, rs3933331, was associated with an OCD-hoarding subphenotype as assessed by 2 independent, validated scales.
Our case-control data corroborate previous smaller family-based studies that indicated that SLC1A1 is a susceptibility locus for OCD. The expression and genotype database–mining approach we used provides a potentially useful complementary approach to strengthen future candidate gene studies in neuropsychiatric and other disorders.
Dysfunctional glutamatergic neurotransmission in the pathogenesis of obsessive-compulsive disorder (OCD) was postulated on the basis of magnetic resonance imaging studies.1,2 This hypothesis was subsequently supported by animal models based on abnormal glutamate neurotransmission that contributed to OCD spectrum–like phenotypes,3,4 case reports of beneficial therapeutic effects of riluzole (an antiglutamatergic agent) on treatment-resistant OCD,5 and elevated glutamate levels in the cerebrospinal fluid of OCD probands.6 From a genetic perspective, investigation of glutamatergic genes in OCD therefore represents a plausible biological candidate approach given this disorder's high heritability and risk to relatives.7,8 Indeed, studies have shown positive associations of glutamate receptor genes with OCD.9,10
More recently, a glutamate transporter gene, SLC1A1 (GenBank NM_004170), has been proposed to be involved with OCD in particular, as this gene is also a chromosome 9p24 positional candidate from linkage analyses.11,12 This transporter (also known as EAAC1 and EAAT3) is widely expressed in neurons and also involved in cysteine transport.13,14 While an initial mutation screen of SLC1A1 and family association study did not detect biased transmission in OCD,15 3 subsequent studies with larger numbers of tested polymorphisms and somewhat larger proband samples yielded positive associations in both single-marker and haplotypic tests, though neither specific single-nucleotide polymorphisms (SNPs) nor haplotypes closely corresponded.16- 18 These 3 family-based studies motivated us to analyze several SLC1A1 polymorphisms in our large and phenotypically well-characterized collection of OCD probands using a case-control study design. Case-control studies, while susceptible to population stratification, provide greater statistical power than family-based designs. We focused on SLC1A1 SNPs that were significant in at least 1 previous study and we included additional SNPs that predicted gene expression levels as determined by analyzing deposited expression and genotype data from nonpatient sources. We present evidence for a susceptibility haplotype spanning large portions of SLC1A1 and also for a SNP located 100 kilobases (kb) upstream of SLC1A1 that was associated with an OCD subphenotype, compulsive hoarding, which has some genetic linkage and clinical differences from the more common OCD phenotype.19- 21
For heritability calculations, we extracted lymphoblastoid cell line SLC1A1 gene expression values (probe No. 213664_at) from the Gene Expression Omnibus data set GSE1485.22 All individuals had a “present” call for this probe; technical replicates were averaged and then log2-transformed. Family structures for the 14 three-generational families represented in this data set were obtained from the Centre d’Etude du Polymorphisme Humain database (http://www.cephb.fr/en/cephdb/browser.php). Pedigree and expression data were then analyzed with the Sequential Oligogenic Linkage Analysis Routines software package by performing a standard quantitative genetic analysis.23 The residual kurtosis of 0.125 was within the normal range.
For expression quantitative trait locus (eQTL) analyses, we retrieved lymphoblastoid cell line SLC1A1 expression values for the 60 HapMap Centre d’Etude du Polymorphisme Humain founders with northern and western European ancestry from 2 independent platforms: the Gene Expression Omnibus data set GSE5859 (Affymetrix platform, probe No. 213664_at; replicates were averaged and transformed as above)24 and from the Gene Expression Variation Web site hosted by the Wellcome Trust Sanger Institute (http://www.sanger.ac.uk/humgen/genevar; Illumina platform, probe No. GI_31543625-S).25 Genotypes for all 90 individuals from the Centre d'Etude du Polymorphisme Humain HapMap data were downloaded from the HapMap Web site (http://www.hapmap.org, release 22) for the entire SLC1A1 region plus 1 megabase (Mb) flanking both sides (corresponding to position 3 480 444 to 5 577 469 on chromosome 9 in the University of California–Santa Cruz Genome Browser Human, March 2006 assembly). Markers inconsistent with mendelian inheritance patterns, with minor allele frequencies of less than 1%, or with more than 10% missing genotypes were removed, resulting in a total of 2467 polymorphisms that were analyzed. These SNPs were then tested for quantitative association with SLC1A1 expression using PLINK,26 Microsoft Excel (Microsoft, Redmond, Washington), and Prism 4 (GraphPad Software, San Diego, California).
We obtained RNA and genomic DNA from 105 postmortem dorsolateral prefrontal cortex samples through the Stanley Medical Research Institute, Chevy Chase, Maryland. This collection is composed of 3 diagnostic groups (35 normal, 35 bipolar, and 35 schizophrenic donors). With the exception of 2, all samples were from white donors. Detailed demographics of this sample are available at http://www.stanleyresearch.org. Six of these 105 samples did not provide sufficient RNA for quantitation, and the 2 nonwhite samples were quantitated but not analyzed, resulting in a total of 97 samples, which are presented herein. All samples were successfully genotyped for rs3933331, rs7858819, and rs301430.
We reverse-transcribed 1 μg of total RNA with oligo-dT primers using the Transcriptor First Strand cDNA Synthesis kit according to the manufacturer's instructions (Roche Applied Science, Indianapolis, Indiana). Next, we amplified a total of 5 ng of reverse-transcribed RNA in triplicate reactions with 1 assay for SLC1A1 and 1 endogenous control assay for PGK1. The SLC1A1 assay was designed with the Universal Probe Library Assay Design Center, version 2.4 (Roche Applied Science) using primers GCT GTC GAC TGG CTC CTG and GGA GAG CTT TTC CAC AAT GC (Operon Biotechnologies, Huntsville, Alabama) and Universal Probe Library probe No. 9 (Roche Applied Science). The PGK1 assay was predesigned and purchased directly from the vendor (Universal Probe Library Human PGK1 Gene Assay, Roche Applied Science). Both assays span introns and do not contain known SNPs in primers or probes. Amplification was carried out in a 10-μL total reaction volume with 500 nM of each primer and 100 nM of fluorescent probe using PerfeCta qPCR FastMix (Quanta BioSciences, Gaithersburg, Maryland). Thermocycling consisted of 2 minutes at 45°C for uracil-N-glycosylase incubation, 30 seconds at 95°C for hot-start activation, and 40 amplification cycles of 2 seconds at 95°C and 20 seconds at 60°C followed by fluorescence acquisition in a LightCycler 480 instrument (Roche Applied Science). Crossing-point determination by second derivative maximum was carried out with LightCycler 480 software, release 1.3. Relative abundances of SLC1A1 messenger RNA were then calculated using the efficiency-corrected comparative threshold method and log2-transformed.27 Efficiency calculations were performed with 2-fold serially diluted pooled complementary DNA (range, 30-0.47 ng per reaction). Amplicon size and assay specificity were verified by agarose electrophoresis. No-template controls did not display detectable amplification (data not shown). All experiments were done after the specimen code was broken and they were thus unblinded. The entire data set has been uploaded to the Stanley Medical Research Institute data bank (http://www.stanleyresearch.org/brain).
We performed luciferase assay experiments for 3 SNPs within or near SLC1A1 as recently described.28 Briefly, genomic DNA was amplified with primers flanking rs3933331 (TTC TGC CCA AGA CAA TCA CA and TCT GGG TTT AGA CTG CCA CA), rs7858819 (TGC AAA TGG TTA TGG GAC AA and TCA AAT GGT CCT TGG CTA CC), and rs301430 (CAT GTC TTC AGG CAG GGA CT and CAC TGC GAC ATT CTT GGT GT; all primers were obtained through Operon Biotechnologies). Amplicons were then TA-cloned into pCR2.1 (Invitrogen, Carlsbad, California) and subcloned into the pGL3 reporter vector (Promega, Madison, Wisconsin). Equal orientation and sequence specificity, including absence of additional SNPs, were confirmed through sequencing of pGL constructs at the National Institute of Neurological Disorders and Stroke intramural DNA sequencing core facility. Allele-specific luminescence was determined with the Dual Luciferase Reporter Assay System (Promega). For each of the 6 reporter constructs, 5 independent transfections were performed in both human neuroblastoma (SH-SY5Y) and rat pheochromocytoma (PC12) cell lines grown under standard conditions. For each SNP, data were normalized to the geometric mean of 1 allele and tested against the other allele with a 2-sided, unpaired t test using Prism 4 (GraphPad Software).
Adult OCD probands (325 white subjects) were recruited through an ongoing outpatient OCD program at the National Institute of Mental Health, Bethesda, Maryland. All probands were aged at least 18 years at the time of inclusion (mean age at time of interview, 38 years; standard deviation [SD], 16 years) and gave written informed consent. The diagnosis was made on the basis of the Structured Clinical Interview for DSM-IV. Exclusion criteria were active psychosis or schizophrenia, severe mental retardation that would prohibit an evaluation of OCD, and OCD symptoms that occurred exclusively in the context of major depression.
The sample consisted of 186 women (57%) and 139 men (43%), with a mean age at OCD onset of 14 years (SD, 9.6 years), which was similar to that in the SLC1A1 OCD study by Arnold et al (14.4 years)16 and higher than those in the studies by Dickel et al (8 years)17 and Stewart et al (7.9 years).18 As reported previously in our sample,29 mood (81%) and anxiety (53%) disorders were most frequently represented among comorbid disorders, with alcohol abuse disorder (24%), substance abuse disorders (14%), and tics (6%) less common. Hoarding, as we will define, was present in 27% of this patient sample. Total mean Yale-Brown Obsessive Compulsive Severity Scale (Y-BOCS) lifetime severity scale score was 29.6 (SD, 15.4), which is approximately equally contributed to by obsession and compulsion scores.
Our control samples (662 white subjects [46% male and 54% female]) originated from 3 independent sources: (1) undergraduate students (270 self-declared healthy white students) from a large Southeastern university who participated in a separate study of genes and personality in return for partial course credit; (2) Coriell Cell Repository, Camden, New Jersey (200 self-declared healthy white Americans); and (3) European Collection of Cell Cultures (Sigma-Aldrich, St Louis, Missouri; 192 apparently healthy white blood donors in the United Kingdom). The male to female ratio did not significantly differ between cases and controls (χ21 = 0.72; P > .05). We cannot rule out the occurrence of OCD within the second and third control groups, as self-report and standard scales were used for evaluation of only the first group. Genotypic and allelic frequencies did not significantly differ between these 3 control groups (data not shown). All studies were conducted under protocols approved by the institutional review board at the National Institute of Mental Health Division of Intramural Research Programs and by the Human Subjects Committee at Florida State University. Additional details on proband and control samples have been described previously.29,30
Genomic DNA was extracted from whole blood obtained through venipuncture or from saliva samples (Oragene discs; DNA Genotek, Ottawa, Ontario, Canada). All SNPs were genotyped with 5′ exonuclease assays (Applied Biosystems TaqMan predesigned or custom assays; Applied Biosystems, Foster City, California) (assay identification numbers and primer/probe sequences are available from the corresponding author) under standard conditions described previously.31 The overall genotyping completion rate exceeded 98% for each assay; the per-individual genotyping completion rate was at least 5 of 6 SNPs. None of the SNPs deviated from Hardy-Weinberg equilibrium as determined by contingency-table statistics (nominal P > .05; data not shown). Duplicate samples (at least 10% of all samples, randomly chosen, for each of the 6 SNPs) and no-template controls consistently yielded expected results.
Genotype data from probands and controls were analyzed for single-locus and haplotypic associations with the software packages PLINK26 and WHAP.32 Conditional haplotype analyses were carried out as recently described after detection of a significant omnibus association.28 Calculation of intermarker linkage disequilibrium and correction for multiple testing with 100 000 permutations for the 6 individual markers as well as for the 3-locus haplotype was conducted using HaploView, version 4.0.33
Using the Genetic Power Calculator,34 we determined that our sample had 78% power at P = .05 with unscreened controls (risk allele frequency = 0.3; disease prevalence = 0.025; genotypic relative risk Aa = 1.4 and AA = 1.8; marker allele frequency = 0.3 at D′ = 0.9 with risk variant). When we selected screened controls and used otherwise identical parameters, power rose only slightly, to 80%. We conclude from these calculations that our partially unscreened control sample did not significantly affect our study results.
We examined 2 subphenotypes in this article: (1) we measured lifetime OCD symptom severity using Y-BOCS35; and (2) we further considered hoarding as a second subphenotype based on 2 separate, validated measures of the hoarding construct. We focused on hoarding as a subphenotype, since its familiality is stronger than other OCD subphenotypes,36 which is further reflected by 2 recent significant linkage analyses.19,20 The Y-BOCS symptom checklist was used to derive factor scores for the hoarding symptom dimension using principal component analysis and the Varimax procedure, as previously reported.37 The factor analysis generated a factor score for each subject representing the correlation of their symptom profile with the hoarding factor, factor IV, as reported previously.37 Because the hoarding factor was essentially independent (factor loading = 0.85) from the other 3 factors identified in our sample (factor loading on the hoarding factor by other factors, 0.01-0.16) and likewise by others,36- 38 the hoarding factor scores were standardized, with a mean of 0 and an SD of 1. Subjects were dichotomized on the basis of scores greater than 0 or less than 0 on the hoarding factor. Details of this procedure have been presented elsewhere.37,38 We also assessed the severity of hoarding symptoms with the Saving Inventory–Revised (SIR).39 This measure is a validated tool for examining the magnitude of hoarding symptoms and includes facets of this phenomenon not considered in the Y-BOCS symptom checklist.40 Both of these measures of hoarding have been previously characterized and evaluated in our clinical OCD sample.37,40
Statistical differences in these subphenotypes according to genotype were ascertained using linear regression and SPSS, version 14.0 (SPSS Inc, Chicago, Illinois), for Microsoft Windows XP (Microsoft). All tests were performed with 2-sided P < .05. Given missing data, the populations for each of the analyses varied slightly (SIR, n = 198; hoarding factor, n = 216).
Using publicly available gene expression data, we observed that SLC1A1 is robustly expressed in lymphoblastoid cell lines. This prompted us to investigate whether SLC1A1 expression constitutes a heritable trait. By analyzing 194 lymphoblastoid cell lines from 14 three-generational families,22 we determined an additive (narrow-sense or h2) heritability of 0.23 (P < .001) for SLC1A1 gene expression, with no significant sex effects when sex was included as a covariate. We then sought to identify eQTLs for SLC1A1 acting in cis within 1 Mb of the SLC1A1 genomic locus by performing a linear regression analysis of expression data on genotypes recoded as 0, 1, or 2 on the basis of minor allele occurrences. Of 2467 markers tested, 3 SNPs that were not in complete linkage disequilibrium scored a nominal P ≤ .02 and had the same direction of effect in both data sets (ie, positive or negative regression coefficient; r2 value, 9%-14%) (Figure 1). One of these SNPs, rs3933331, is located about 100 kb upstream of the SLC1A1 transcription initiation site, while the 2 other eQTL SNPs, rs7858819 and rs301430, are in the 3′ end of SLC1A1. Remarkably, rs301430 has recently been associated with early-onset OCD in a collection of 71 trios both individually and as part of a haplotype,17 and the haplotypic association has been replicated subsequently.18
As a next step, we analyzed whether the 3 eQTLs we identified were also associated with SLC1A1 gene expression in neuronal tissue. To this end, we determined genotypes and relative SLC1A1 messenger RNA levels by quantitative polymerase chain reaction in 97 postmortem brain samples from normal, bipolar, and schizophrenic donors, all derived from the dorsolateral prefrontal cortex. There were no significant differences in SLC1A1 expression levels among the 3 diagnosis groups as calculated by analysis of variance (data not shown). We then analyzed all 3 groups jointly for association with eQTLs to have increased power. As shown in Figure 2, rs7858819 and rs301430 were nominally significantly associated with SLC1A1 messenger RNA levels in human brain tissue (P = .02 each). Moreover, the direction of effect (increased expression with increasing number of T alleles at rs7858819 and decreased expression at rs301430) was identical to the data from lymphoblastoid cell line data (Figure 1 and Figure 2).
Following the replication of 2 of 3 eQTLs in neuronal tissue, we sought to directly assess functionality of these SNPs in vitro through the use of luciferase reporter gene assays. We generated constructs with both alleles at each of the 3 SNPs, rs3933331, rs7858819, and rs301430, and used human neuroblastoma (SH-SY5Y) and rat pheochromocytoma (PC12) cell lines for transfection. As depicted in Figure 3, the 2 alleles at rs301430 had a highly significant differential effect on reporter gene expression in both cell lines (P < .001 for both). In line with the T allele–associated decrease in SLC1A1 expression in lymphoblastoid cell lines and neuronal tissue, the rs301430 construct with the T allele showed about 50% decreased expression compared with the C allele (Figure 3).
We then genotyped the 3 eQTL SNPs we described and 3 additional SNPs within SLC1A1 that have been individually associated with OCD in prior family-based studies16,17 in our collection of 325 probands and 662 controls matched for ethnicity and sex. Table 1 details genomic location and linkage disequilibrium of the 6 analyzed markers. As presented in Table 1, none of these 6 SNPs were individually associated with OCD after permutation-based multiple-testing correction. We also performed these analyses separately for males and females and did not observe significant results (data not shown).
Haplotype analyses, in contrast, revealed a highly significant omnibus association of 3 of these markers, rs3087879, rs301430, and rs7858819, with OCD (Table 2). In haplotype-specific tests (ie, when tested individually against all other haplotypes, with df = 1), 3 of the 4 common haplotypes were nominally significant at P < .05. Two of these haplotypes remained significant after multiple-testing correction by permutation (Table 2). One haplotype, H4 in Table 2, was highly significant and almost twice as common in OCD than in controls (haplotype frequency, 9.4% vs 5.1%, respectively; odds ratio, 1.89; P = .001, family-wise permuted). We obtained similar—though not always as highly significant—results when we conducted these haplotype analyses stratified by sex (Table 2).
Lastly, we set out to determine whether or not SLC1A1 is also associated with differential OCD symptom presentation, given the growing appreciation of OCD as a clinically, and possibly etiologically, heterogeneous disorder.38,41- 43 We performed subphenotype analyses on the basis of Y-BOCS–ascertained OCD symptom severity and the hoarding phenotype evaluated by 2 measures: (1) a Y-BOCS factor analysis–derived hoarding symptom dimension,37 and (2) the magnitude of hoarding symptoms, as indexed by the SIR.39,44 We observed a nominally significant association of 1 eQTL SNP, rs3933331, with the hoarding phenotype. Specifically, this SNP was significantly associated with hoarding as assessed by both the Y-BOCS hoarding symptom dimension (β = .21, t216 = 2.9, P = .005) and total SIR scores (β = .19, t198 = 2.6, P = .009). That is, greater scores on the hoarding dimension, as well as more severe SIR symptom scores, were strongly correlated with the number of G alleles at rs3933331 (Figure 2). The association with SIR scores appears to be largely driven by the acquisitioning subscale (β = .22, P = .002), as rs3933331 was less strongly associated with the clutter (β = .18, P = .01) and difficulty discarding (β = .115, P > .05) subscales.
In the present study, we analyzed the candidate gene SLC1A1, which encodes the neuronal glutamate transporter, in a large collection of OCD probands and controls in light of increasing interest in glutamatergic neurotransmission in OCD development and its treatment. We identified an SLC1A1 haplotype that was strongly associated with OCD that consists of 3 markers between intron 2 and the 3′ untranslated region of SLC1A1, 2 of which we found to be SNPs that predict gene expression levels (ie, eQTLs) in lymphoblastoid cell lines. In addition, another eQTL polymorphism, located somewhat upstream from SLC1A1, was strongly correlated with an OCD subphenotype (hoarding), as evaluated by 2 validated scales. Our data corroborate the hypothesis of dysfunctional glutamatergic neurotransmission in OCD and hoarding, mediated in part through SLC1A1.
Three recently published, family-based association studies have suggested SLC1A1 to be a candidate gene for OCD.16- 18 However, no given SNP within SLC1A1 in these 3 studies was consistently positively associated with OCD across the 3 studies when tested individually, with the exception of rs3780412, which was considered significant in 2 studies: the overall and males-only study by Dickel and coworkers17 and in the males-only sample by Stewart and colleagues.18 However, our results do not support a specific involvement of this marker in OCD, nor did we observe any preferential or stronger association of single variants or haplotypes in males compared with females in our case-control sample (Table 1 and Table 2). Speculatively, this observation might be related to the increased power of our case-control design compared with the family-based studies published previously.16- 18 Clearly, additional studies are warranted to clarify the discrepancy for preferential association in males observed in prior family-based studies,16- 18 though the overall finding of involvement of SLC1A1 in OCD now appears more clearly replicable.
One of the most important common findings in these 3 previous SLC1A1 studies was a haplotypic association in the 3′ region of the gene.16- 18 Our results are in line with this observation, as 2 of the 3 SNPs of the associated haplotype we identified are located in exon 10 and the 3′ untranslated region. The third SNP of this haplotype, however, lies somewhat upstream (intron 2) of this region, and although all 3 SNPs (rs7858819, rs301430, and rs3087879) were in considerable linkage disequilibrium with each other (Table 1), we speculatively interpret this observation as evidence for more than 1 functional locus within SLC1A1, as has been suggested previously.17,18 Additional support for this tentative conclusion stems from another SNP, rs3933331, located farther upstream and clearly not in linkage disequilibrium with the previously mentioned haplotype markers that we found to be associated with gene expression functionality and that predicted an OCD hoarding subphenotype in our sample (Figure 1 and Figure 2). The possibility of 2 or more functional (possibly causal) variants of differential effect size provides 1 plausible explanation for the molecular discrepancies between present and published SLC1A1 genotyping studies in OCD, especially if these were found to be interacting from within different haplotype blocks. This possibility might be addressed by genotyping markers at higher density or by resequencing large genomic regions.
We have made extensive use of publicly accessible data with the aim of better understanding 1 facet of SLC1A1 functionality—gene expression. Some of the most extensive gene expression data sets currently available were obtained from lymphoblastoid cell lines, and we used these data to better understand how genetic SLC1A1 variation might affect its gene expression levels. To this end, it was first necessary to determine whether or not SLC1A1 is expressed in lymphoblastoid cell lines at sufficiently high levels for quantitative analyses and then to identify the degree to which (if at all) SLC1A1 gene expression in lymphoblastoid cell lines constitutes a heritable trait. The additive heritability (h2) value of 0.23 determined herein, while highly significant, is quite modest compared with some other genes expressed in lymphoblastoid cell lines, whose heritabilities ranged between 0.18 and 1, with a median of 0.34.45 In light of the relatively modest additive heritability for SLC1A1 expression now determined in our analysis, it is not surprising that the eQTL markers identified herein also had a small effect size (Figure 1). Nevertheless, our approach proved valuable, not only because it identified a previously OCD-associated SNP, rs301430, as an eQTL, but also because (1) the 3-marker haplotype that was strongly associated with OCD contained 2 eQTLs, and (2) another eQTL marker farther upstream correlated with an OCD hoarding subphenotype.
Given the modest eQTL significance values and effect sizes from our analyses in lymphoblastoid cell lines that would not withstand multiple testing correction (Figure 1), we conducted additional studies to further assess functionality. We were able to show that 2 of the 3 initial eQTLs, rs7858819 and rs301430, are also associated with SLC1A1 messenger RNA expression in the brain with similar significance levels and the same effect direction (Figure 1 and Figure 2). These results not only strengthen the relevance of these 2 markers for SLC1A1 genetics research, but also suggest that regulation of gene expression in lymphoblastoid cell lines resembles brain regulation for this gene. This would make this readily obtainable cell line suitable for future functional and population-based analyses of SLC1A1 in a manner that has been used extensively for a related molecule, the serotonin transporter (SLC6A4).46,47 It also calls for a systematic comparison of gene expression genetics between brain tissue and lymphoblastoid cell lines now that suitable brain data sets have become available.48
The second complementary study, ie, luciferase reporter gene constructs, was aimed at direct functional assessment of individual SNPs. While association analyses with tissue samples or cell lines have the advantage of more closely resembling in vivo conditions, the eQTLs do not necessarily have to directly confer functionality, as they may merely be in linkage disequilibrium with the actual functional polymorphism. We addressed this uncertainty by examining individual polymorphisms in vitro with reporter gene assays. Our luciferase data for rs301430 suggest that this variant is indeed directly functional. Additional studies are now needed to elucidate the molecular mechanisms through which rs301430 affects gene expression.
The H4 haplotype, as detailed in Table 2, which was almost twice as common in those with OCD as in controls (odds ratio, 1.89), represents the strongest haplotypic association of SLC1A1 with OCD published to date.16- 18 It illustrates the usefulness of our complementary expression–mining approach, because 2 of its markers were eQTLs, and conditional haplotypic association testing with the remaining markers failed to yield positive results (see the “Methods” section).28,32 The individual alleles in H4, however, are not fully compatible with the notion of increased or decreased SLC1A1 gene expression in OCD, as the C allele at rs7858819 was the lower-expressing allele in the regression analyses (Figure 1B), while the C allele at rs301430 in H4 correlated with increased expression (Table 2 and Figure 1). On the other hand, the mildly protective haplotype H2 in Table 2 (odds ratio, 0.78), which narrowly survived correction for multiple testing in individual haplotype analysis, contained the lower-expressing allele at both eQTLs. Here, too, it will be essential to conduct gene expression analyses in larger samples and with more densely spaced markers.
The major limitation of our study is the potential susceptibility of the case-control design to population stratification, especially because our controls originated from 3 different groups that did not closely correspond in their geographic origin to our proband sample. While we addressed this issue by matching self-reported ethnicities of probands and controls, spurious effects in case-control studies (which are regarded by some authors as relatively uncommon49) can be ruled out by genotyping multiple ancestry-informative markers, which was not feasible for this work. Dense genetic information on SLC1A1 markers from different ethnicities and geographic locations will increasingly become available from genome-wide association studies. Such data will allow a better judgment of how strongly (and if at all) population stratification is relevant to the SLC1A1 markers analyzed herein.
Another aspect of genetic studies such as ours with several investigated phenotypes that requires discussion is the multiple comparison problem. In these studies, the multiple statistical tests are clearly not independent of one another given linkage disequilibrium between variants (Table 1) and the high correlation between subphenotypes (hoarding in this case) and the general diagnosis of OCD. Therefore, methods that correct by the total number of tests (eg, the Bonferroni procedure) would be more conservative than appropriate in our view. We conducted empirical family-wise error correction for the categorical single-marker and haplotype analyses by permutation and determined that 2 haplotypes remain significant (H4 and H2) (Table 2). The significance values for our subphenotype analyses are nominally robust (for association of rs3933331 with Y-BOCS score, P = .005, and with SIR-assessed hoarding, P = .009) (Figure 4). They would, however, not withstand Bonferroni correction and therefore need to be interpreted with caution and observed in larger samples in future studies.
In summary, we have identified a strong association of OCD with a 3-marker haplotype composed of eQTLs for SLC1A1, which in turn were determined by exploiting expression and genotype data repositories and further validated with brain expression studies and reporter assays. Our complementary expression analysis approach might prove to be of use in future genetic studies for genes expressed at quantifiable levels in lymphoblastoid cell lines and other population-based tissue collections. Our haplotypic association results call for continued appreciation of glutamatergic neurotransmission abnormalities in OCD research.
Correspondence: Jens R. Wendland, MD, Medical Genetics Branch, National Human Genome Research Institute, National Institutes of Health, 35 Convent Dr, Bldg 35, Porter Bldg, Room 1B209, Bethesda, MD 20892-3717 (email@example.com).
Submitted for Publication: February 29, 2008; final revision received July 5, 2008; accepted August 23, 2008.
Author Contributions: Drs Wendland and Murphy had full access to all of the data in the study and take responsibility for the integrity of the data and accuracy of the data analysis.
Financial Disclosure: None reported.
Funding/Support: This research was supported by the Intramural Research Program of the National Institute of Mental Health, National Institutes of Health, and by a NARSAD 2007 Young Investigator Award and a National Institute of Mental Health Division of Intramural Research Programs Julius Axelrod Memorial Fellowship Training Award to Dr Wendland.
Additional Contributions: We are indebted to all the participants in this study. Diane Kazuba, BS, and Brenda Justement, RN, BSN, MA, conducted patient interviews; Teresa Tolliver, MS, provided technical assistance in DNA extraction; and James Nagle, MS, and Debbie Kauffman, BS, of the National Institute of Neurological Disorders and Stroke DNA sequencing facility provided technical assistance. Postmortem brain tissue was supplied by the Stanley Medical Research Institute.