Distribution of standardized neuroticism scores in the Australian samples collected in 1989 (AU89): all participants (A) and those selected to have a measure collected in 1999 (B), for those in the genome scan (GS) and not in the GS.
MERLIN-REGRESS linkage results of logarithm of odds (LOD) score (y-axis) for each chromosome (1-22 and X) based on a 5–Kosambi cM grid (x-axis) for mean neuroticism score of the Australian (AU), Dutch (NL), and combined AU and NL data sets. Empirical thresholds for suggestive linkage were 1.7 (red horizontal line) for AU and AU and NL combined and 1.9 (blue horizontal line) for NL.
Wray NR, Middeldorp CM, Birley AJ, Gordon SD, Sullivan PF, Visscher PM, Nyholt DR, Willemsen G, de Geus EJC, Slagboom PE, Montgomery GW, Martin NG, Boomsma DI. Genome-Wide Linkage Analysis of Multiple Measures of Neuroticism of 2 Large Cohorts From Australia and the Netherlands. Arch Gen Psychiatry. 2008;65(6):649-658. doi:10.1001/archpsyc.65.6.649
People meeting diagnostic criteria for anxiety or depressive disorders tend to score high on the personality scale of neuroticism. Studying this personality dimension can give insights into the etiology of these important psychiatric disorders.
To undertake a comprehensive genome-wide linkage study of neuroticism using large study samples that have been measured multiple times and to compare the results between countries for replication and across time within countries for consistency.
Genome-wide linkage scan.
Twin individuals and their family members from Australia and the Netherlands.
Nineteen thousand six hundred thirty-five sibling pairs completed self-report questionnaires for neuroticism up to 5 times over a period of up to 22 years. Five thousand sixty-nine sibling pairs were genotyped with microsatellite markers.
Nonparametric linkage analyses were conducted in MERLIN-REGRESS for the mean neuroticism scores averaged across time. Additional analyses were conducted for the time-specific measures of neuroticism from each country to investigate consistency of linkage results.
Three chromosomal regions exceeded empirically derived thresholds for suggestive linkage using mean neuroticism scores: 10p 5 Kosambi cM (cM) (Dutch study sample), 14q 103 cM (Dutch study sample), and 18q 117 cM (combined Australian and Dutch study sample), but only 14q retained significance after correction for multiple testing. These regions all showed evidence for linkage in individual time-specific measures of neuroticism and 1 (18q) showed some evidence for replication between countries. Linkage intervals for these regions all overlap with regions identified in other studies of neuroticism or related traits and/or in studies of anxiety in mice.
Our results demonstrate the value of the availability of multiple measures over time and add to the optimism reported in recent reviews for replication of linkage regions for neuroticism. These regions are likely to harbor causal variants for neuroticism and its related psychiatric disorders and can inform prioritization of results from genome-wide association studies.
The personality trait of neuroticism is defined as a tendency to experience psychological distress. Individuals with high neuroticism scores are characterized by emotional instability, low self-esteem, and feelings of anxiety, depression, and guilt.1 Neuroticism scores are found to be high in those with psychiatric disorders such as major depression and anxiety disorders2 and this association appears to be reciprocal. Prospective studies demonstrate that neuroticism or neuroticism-like traits predict future major depression3- 7 and not merely because of overlap with prodromal symptoms of major depression. Self-report questionnaires can be used to score neuroticism as a quantitative trait measurable on large population cohorts.8,9 Therefore, study of neuroticism in large populations is relatively easy and can give insights into the etiology of important psychiatric disorders.
Neuroticism scores have been found to be robust measures, with test-retest correlations of 0.7910 to more than 0.909,11 for scores measured up to 2 years apart and approximately 0.60 for scores measured up to 6 years12 or 19 years11 apart. It is well established that neuroticism is partially under genetic control,13,14 with heritability estimates of 30% to 54%.8,12,15,16 Twin studies have consistently shown no evidence for a shared common environmental component.12,15,17 Genetic correlations between neuroticism scores taken over a 6-year period were higher than 0.88 for all age groups.12 On average, women score higher for neuroticism than men, but heritability estimates are mostly consistent across sex.14- 16 However, opposite-sex sibling correlations16,17 and mother-son correlations15 have been reported as lower, suggesting that different genes may be important in men and women. Estimates of the genetic correlation between neuroticism and depression or anxiety range from 0.4 to 0.8.17- 20
Four previous linkage studies of neuroticism have been published10,16,21,22; 3 of these studies used a single measure of neuroticism and 1 study10 used an average of 2 measures taken 6 months apart. For 2 of the studies, the linkage analyses for neuroticism were secondary to the analyses of the ascertainment criteria of their study cohorts, namely alcohol22 or nicotine21 dependence. Recent reviews14,23 summarized the linkage analysis results from the 3 earliest published of these studies and from an additional 14 studies of psychiatric disorders considered to be genetically related to neuroticism and concluded that some consistency is starting to emerge across studies.
Examples of genetic linkage analysis of longitudinal data on any trait in adults are rare,24 despite recognition that use of multiple measures can increase power by reducing between-sib residual nonshared variance.25 Consistency in linkage regions across repeated measures cannot be considered a replication, as this requires identification of the same linkage region in independent data sets. Nonetheless, inconsistency in linkage regions identified from repeated measures might indicate type I error and biological implausibility of the putative region.
In this study, we report a linkage analysis of neuroticism from 2 large study samples of twin families from Australia and the Netherlands. Individuals in the Australian study have been measured up to 4 times over a 22-year period and on different scales. Individuals in the Dutch sample have been measured up to 5 times over an 11-year period using the same scale. These data sets are independent between countries and therefore provide an opportunity to investigate replication of linkage results. Within countries, there are partly overlapping samples of participants at each measurement occasion, providing an opportunity to investigate consistency of linkage results.
All participants were adult twins and they and their families were recruited through the Australian Twin Registry and were of North European ancestry. All provided written informed consent under study protocols approved by the Queensland Institute of Medical Research Human Research Ethics Committee. Participants completed 1 or more personality questionnaires: the 23-item Eysenck Personality Questionnaire–Revised (EPQ-R),26 the shortened 12-item subset (EPQ-RS), or the NEO Five-Factor Inventory personality questionnaire,27 which includes 12 items in the neuroticism domain and, compared with the EPQ-R, probes angry hostility, self-consciousness, impulsiveness, and vulnerability as well as anxiety and depression. Each individual could have up to 4 measures of neuroticism measured at 4 different times; these (or their transformations, discussed later) are referred to as AU80 (EPQ-R), AU89 (EPQ-RS), AU99 (EPQ-R), and AU02 (NEO), with these trait codes reflecting the approximate year in which the scores were collected. The participants contributing AU80, AU89, and AU99 measures are described in detail elsewhere.8 Briefly, participants contributing AU80 or AU89 scores were ascertained solely on the status of being a twin registered through the Australian Twin Registry or, in the case of AU89, being a family member of a registered twin. The participants contributing AU99 measures were ascertained as siblings pairs selected for discordance or concordance with respect to extreme neuroticism or anxiety or depression scores: one sibling in the top or bottom decile and the other sibling in the top or bottom quintile, excluding monozygotic twin pairs and allowing for selection of multiple siblings per family in an extreme discordant and concordant (EDAC) design.8,28 The EDAC design identifies the sibling pairs who are most informative for genetic studies.29 The participants in the 1999 study had the opportunity to complete the EPQ-R by telephone interview and/or by mail; approximately 80% completed both within 6 months, with a test-retest correlation of 0.9.8,11 The 2 scores were averaged for analysis in this study. The long-term stability of the AU80, AU89, and AU99 measures are reported in Birley et al11 (in which the 1980, 1989, and 1999 studies are named Canberra, alcohol cohorts [where “alcohol” does not refer to any ascertainment criteria], and anxiety studies). The participants contributing AU02 measures were ascertained as being extended twin families with a high incidence of smoking as part of an ongoing nicotine addiction genetics study.30 Where possible, blood (or buccal) samples were obtained from the study participants and their parents.
Families with adolescent and adult twins have been assessed roughly every 2 years since 1991 as part of an ongoing longitudinal survey study of the Netherlands Twin Register. Participants are of Dutch ancestry31 and were recruited under informed consent. Each survey, with the exception of the 1995 wave, collected information on personality and psychopathology31,32 and was conducted under protocols approved by the ethics committee of the Free University Hospital, Amsterdam, the Netherlands. Consequently, each individual could have up to 5 measures of neuroticism measured at 5 different times; these (or their transformations, discussed later) are referred to as NL91, NL93, NL97, NL99, and NL02, with subscript codes reflecting the approximate year in which the scores were collected (corresponding to waves 1, 2, and 4-6 of data collection32). Neuroticism was measured using the Amsterdamse Biografische Vragenlijst,33 a self-report questionnaire similar in content to the EPQ-R.34 The neuroticism scale comprises 30 questions with a 3-item response scale (no, don't know, yes). The neuroticism score is a weighted sum of the item responses.
Neuroticism scores are sum scores and such data typically deviate from normality by having heavy tails. The averaged angular transformation35 was used to normalize the distribution, as in other studies.8,11,16,36 The neuroticism scores used in the analysis were residuals from regression of the transformed neuroticism scores on age, sex, and age × sex (and age2 and age2 × sex for AU89), which were standardized separately for each sex. The mean AU89 score of those selected for measurement in the AU99 study sample was not significantly different from that of the full study group, but the variance was higher. Therefore, the AU99 measures were standardized using the variance of the AU89 cohort so that the higher variance of AU99 measures was maintained. Finally, an average neuroticism score was calculated for each person within each country, denoted by AU and NL. The number of measures contributing to each average was recorded and used as a weight in the repeated-measures linkage analysis. Descriptions of the phenotype (all those measured) and genome scan (only those used in the linkage analysis) data sets are given in Table 1 and Table 2.
The genotypic data available for the Australian study sample resulted from submission of DNA samples to 1 or more of 6 genotyping centers: Gemini P/L, Sequana Therapeutics Inc, Leiden University Medical Centre, the Center for Mammalian Genetics Mammalian Genotyping Service of the Marshfield Clinic Research Foundation, the Australian Genome Research Facility, and the University of Helsinki Finnish Genome Center. Descriptions of the Gemini P/L, Leiden University Medical Centre, Mammalian Genotyping Service, and Sequana Therapeutics Inc genotyping and the subsequent merging and cleaning of the marker data sets are provided in detail elsewhere.37,38 Since then, additional Mammalian Genotyping Service, Australian Genome Research Facility, and Finnish Genome Center genotypes have been merged using the same protocol. Family members were submitted to the same genotyping facility. Participants with measure AU99 were submitted preferentially for genotyping (Figure 1B), but this was not the sole criterion used to select families for genotyping, so the impact of the EDAC design was less marked for the AU89 measure, which was available on the largest subset of samples (Figure 1A). Data cleaning based on mendelian errors, unlikely genotypes, and consistency of pedigree and marker relationships was undertaken as described by Cornes et al.37
Dutch samples were genotyped by the Mammalian Genotyping Service or Leiden University Medical Centre laboratories. The genotype data from these screens were combined. Allele calling and binning were equalized between markers that were present in multiple scans, using approximately 30 control samples. Data cleaning based on mendelian errors, unlikely genotypes, and consistency of pedigree and marker relationships was undertaken as described by Middeldorp et al.39 The distributions of the neuroticism measures for those with and without genome scan data were similar.
Map positions of all genotyped markers were estimated in Kosambi cM (cM) by locally weighted linear regression (http://www.qimr.edu.au/davidD) from the NCBI build 35.1 physical map positions and published Decode and Marshfield genetic map positions.40 Identical markers genotyped at different genotyping facilities were all included, separated by 0.001 cM on the genetic map. Using markers genotyped in common, the proportion of the total variance attributed to differences between the Australians and Dutch samples (Fst) was estimated to be 0.30%, implying that these samples can be combined for joint genetic analysis.41 Individuals were required to have genotypes on more than 280 markers resulting in an average distance of 8.2 cM (Australian sample) and 11.0 cM (Dutch sample) between genotyped markers of sib pairs. Thirty-eight percent (Australian sample) and 51% (Dutch sample) of parents were genotyped.
Phenotypic (test-retest) correlations between the EPQ measures AU80, AU89, and AU99 ranged between 0.59 and 0.62.11,36 Test-retest correlations of these measures with AU02 were lower (range, 0.46-0.54),36 reflecting the different emphasis of some of the items included in the NEO Personality Inventory neuroticism domain. The average phenotypic correlation between the Dutch measures was 0.65 (range, 0.56-0.77), with higher correlations between consecutive measures. The highest sib-pair correlations (estimated in Sib-Pair42) (Table 1) were for the youngest cohorts, NL91 and NL93. The high sib correlation for AU99 is a reflection of the EDAC selection. The lowest sib correlation was for AU02, scored on the NEO scale. Analyses of subsets of the Australian11,15 and Dutch32 data have consistently shown no evidence for the influence of common environmental effects. Genetic correlations were estimated in ASReml43 and ranged between 0.91 and 0.95 between the EPQ measures (AU80, AU89, and AU99) and between 0.80 and 0.95 between these measures and the AU02 NEO measure. Formal testing showed that the measures can be considered repeated measures of the same trait.36 The genetic correlations between the 5 Dutch traits ranged from 0.84 to 0.95. Across all neuroticism measures, averaged estimates of heritability and phenotypic and genetic correlations were 0.32, 0.61, and 0.90, respectively. Preliminary linkage analyses conducted using a full multivariate model (not presented) suggested that there was little to be gained compared with the repeated-measures model with genetic correlations of this magnitude.
Genetic linkage analysis of the autosomes was conducted in MERLIN-REGRESS,44 which regresses estimated identity by descent between relative pairs on the squared sums and squared differences of trait values of the pairs. Investigation of the properties of the method by simulation44 showed it to be powerful and efficient even for selected samples (EDAC designs). It requires phenotypic measures to be standardized in the unselected population sample and uses the population parameters (mean, variance, heritability) derived from the full population sample rather than the selected or genotyped sample. The method is also appropriate for general pedigrees, including multiple sibs per family. However, simulation studies44 showed that, although large sibships can increase power, the distribution of the test statistic can become distorted if the contributions from families become highly skewed. For this reason, sibships were limited to a maximum of 5, selecting sibs who maximized either the discordance or concordance of each family. Mean neuroticism scores were analyzed in MERLIN-REGRESS options −mean 0 −var 1, with heritabilities entered as twice the sib correlations (Table 1), and −testretest, with a correlation of 0.61. Analyses were repeated using mean measures from only males and only females because other studies have reported sex-specific linkage regions.16 Analyses using scores of males or females only are denoted with subscripts in Table 3. Linkage analysis for the X chromosome was conducted in MERLIN MINX. In all analyses, multipoint logarithm of odds (LOD) scores for the presence of a quantitative trait locus (QTL) were estimated every 5 cM (a 1-cM grid was used to determine linkage region confidence intervals, as the region bounded by 1 LOD score less than the maximum observed). Using the 5-cM grid allowed the linkage statistic to be collected over all families even when families were genotyped for different markers. Option −singlepoint was used to identify the individual marker contributing most to regions showing evidence of linkage. Linkage analyses were repeated using individual measures of neuroticism to allow examination of consistency in linkage signal between time-specific measures for each country.
Autosomal genome-wide empirical significance thresholds were derived from 1000 gene-drop simulations as implemented in MERLIN −simulate, which uses the allele frequencies, marker positions, and missing genotype patterns of the real data set and simulates under a model that assumes random linkage between genotype and phenotypes. All phenotypes were analyzed using the same simulated data sets, which maintained the correlation structure between phenotypic measures. The maximum LOD scores from each chromosome of each simulation replicate were retained and were used to derive the empirical LOD score thresholds for Lander-Kruglyak53 suggestive linkage (1 LOD score exceeding the threshold per genome scan) and significant linkage (1 LOD score exceeding the threshold per 20 genome scans) for each neuroticism measure analyzed and for the 9 mean measures of neuroticism simultaneously to derive thresholds that accounted for multiple testing.
Within MERLIN-REGRESS, option −rankFamilies gives an ELOD20 score for each phenotypic measure. The ELOD20 score is the LOD score expected given the data of a QTL that accounts for 20% of the phenotypic variance, assuming fully informative markers. Observed marker informativeness (I) was estimated as the average information content of the 5-cM estimates across the autosomes. The ELOD20 scores corrected for observed marker informativeness were calculated as ELOD20(I) = ELOD20 × I. Both ELOD20(I) and ELOD10(I) scores were used to calculate the power of our study samples given the phenotypic and genotypic information to detect a QTL that accounted for 20% and 10% of the total variance at the empirical significant53 or suggestive53 thresholds for linkage using the Probability Function Calculator of the Genetic Power Calculator,55 where ELOD10(I) = ELOD20(I)/4.
Empirically derived suggestive and significant LOD score thresholds for samples with each level of neuroticism are listed in Table 1. The lowest thresholds are for the samples composed predominantly of a single sib pair per family: AU80, NL91, and NL93. The empirical thresholds for suggestive and significant linkage accounting for the multiple testing of the 9 mean measures of neuroticism were 2.5 and 4.1, respectively.
The means (SD) of the information content across the autosomes as calculated every 5 cM in MERLIN-REGRESS were 0.73 (0.08) (Australian samples) and 0.51 (0.10) (Dutch samples), the difference reflecting the average distance of genotyped markers between sib pairs. The ELOD20 scores are listed in Table 1. By accounting for the observed informativeness of the genotyped markers, we estimate that the Dutch, Australian, and combined study samples have 86%, 99%, and 100% power, respectively, to detect a QTL that accounts for 20% of the total variance at the significant threshold of linkage. These samples have powers of 9%, 27%, and 60% to detect a QTL that accounts for 10% of the total variance at the significant thresholds and 37%, 65%, and 89% at the suggestive thresholds. The power of sex-specific analyses is much lower, as expected from the number of same-sex sib pairs contributing to the analysis. The sex-specific Dutch, Australian, and combined study sample measures have for females 64%, 86%, and 99% and for males 24%, 40%, and 69% power to detect a QTL that accounts for 20% of the total variance at the suggestive threshold of linkage.
The genome-wide linkage plot for the Australian and Dutch samples and the joint analysis of the Australian and Dutch samples (Figure 2) show 3 regions that exceed the empirical threshold for suggestive linkage for their respective measures: 18q 117 cM for the combined Australian and Dutch sample, 14q 103 cM for the Dutch sample, and 10p 5 cM for the Dutch sample. An additional 2 regions just fail to reach the this threshold: 8q 134 cM for the Australian sample and 6cen 75 cM for the Dutch sample. The chromosomal position with the maximum LOD score based on a 1-cM grid scan, linkage intervals, corresponding cytogenetic band, and the marker with the maximum LOD score within the region based on a single-point analysis are listed in Table 3. To investigate these results, we looked for consistency in linkage signal in time-specific measures of neuroticism within the linkage intervals and found that for all 5 regions, at least 2 individual measures achieved a LOD score higher than 1 (Table 3). In contrast, an additional 10 chromosomal regions (NL97, 14q 22 cM; AU80, 16p 56 cM; AU02, 3q 14 cM; AU80, 6q 178 cM; NL02, 9q 149 cM; NL97, 16q 124 cM; AU99, 17p 14 cM; NL93, 18q 60 cM; AU80, 19q 104 cM; and AU89, 21p 21 cM) achieved LOD scores that exceeded the empirical suggestive threshold for significance for an individual measure of neuroticism, but only the first 2 listed achieved a LOD score higher than 1 for any other individual measure within the 1–LOD score drop confidence interval. Within country, some data sets were longitudinal (eg, > 90% of those included in the AU80 or AU99 analyses were also included in the AU89 analysis [Table 2]), while some data sets were largely independent (eg, < 10% of participants with AU80 or AU99 scores were also scored for AU02 [Table 2]). The most extreme example of inconsistency was for AU89, which achieved a LOD score of 2.7 for 21p 21 cM, yet no evidence for linkage was found with the AU99 measure (maximum LOD score within the region of 0.1), a difference that persisted when the analysis was limited to include only phenotypes of individuals who were measured in both studies. Examination of the sib-pair phenotypic scores and the identity by descent of marker alleles in families who contributed most to these linkage signals showed nothing that could not reasonably be attributed to stochastic variation.
Five regions that exceeded empirical suggestive thresholds of linkage are reported for analyses of single-sex average neuroticism scores (Table 3). For the linkage interval of these regions, analyses were conducted for the relevant sex for the 9 time-specific data sets. Of the 5 regions, 3 (2p, 5q, and 15cen) were supported by more than 1 sex-specific individual measure, with a LOD score higher than 1.0 (Table 3). Of the 8 regions listed in Table 3 that exceeded the empirical suggestive threshold for significance, only the 14q region exceeded the threshold that accounts for multiple testing of the 9 mean measures.
We performed a linkage analysis for neuroticism using 2 large independent study samples of North European descent. In total, 5069 sibling pairs contributed to the linkage analysis, which used mean neuroticism scores from both Australia and the Netherlands to maximize sample size and power. Linkage analyses of the mean neuroticism score for each country separately allowed us to look for replication between independent data sets. The mean neuroticism measure of each participant could comprise between 1 and 5 individual measures and we used individual neuroticism scores to look for consistency of linkage results. Although individuals with more than 1 measure of neuroticism age over time, the high genetic correlations between measures would not lead us to expect different genetic variants to be identified in the linkage analysis of different measures. Using mean neuroticism score, we identified 5 regions where the LOD score was higher than 1.5; for 3 of these, the LOD score exceeded the empirical threshold for significance. All 5 regions showed some consistency in linkage scores for individual time-specific measures within country, and 2 regions (8q 134 cM and 18q 117 cM) showed some evidence for replication between countries. Other studies that have reported linkage to these regions are listed in Table 3; we include studies reviewed by Fullerton14 plus a small number of additional, mostly subsequent, publications. Region 18q 117 cM overlaps the linkage intervals reported by 3 other studies: recurrent early-onset and major depression,46 73 cM; neuroticism in females,22 91 cM and 115 cM; and harm avoidance,47 109 cM. Region 14q 103 cM has previously been identified in a linkage analysis of the Dutch study samples for a broad anxiety phenotype39 but also in an independent study of extended families with a high occurrence of anxiety disorders.45 Region 10p 5 cM was estimated from the linkage graph for EPQ neuroticism presented by Fullerton et al16 to have exceeded the level of suggestive linkage. Only the confidence interval of the 18q region overlapped with a region considered to have “reasonable support for linkage” by Fullerton14 (10 regions were identified, representing about 9% of the genome). Also listed in Table 3 are human chromosomal regions homologous to linkage regions from studies of anxiety in mice as summarized by Smoller et al52; 11 homologous human chromosomal regions were identified, which totaled about 17% of the human genome. Linkage studies in mice are relevant because similar brain processes are likely to exist for anxiety in mice and neuroticism in humans13 and the powerful design of studies that are possible in mice can lead to highly significant linkage regions bounded by tight confidence intervals. Of the 5 regions, we identified (Table 3) 4 that overlapped with regions identified by Smoller et al,52 an overlap that exceeds chance expectations (binomial P = .003).
Five sex-specific linkage regions exceeded thresholds of suggestive linkage (Table 3), of which 2cen 112 cM (males) showed evidence for replication between countries and 5q 191 cM (males) showed evidence for consistency between the Dutch time-specific measures. Of these, region 8p has previously been identified in other linkage studies, including 2 male-specific reports (Table 3), and linkage with suicide and recurrent early major depression has been reported for 2p.50 Two of the 5 sex-specific regions overlapped with homologous regions identified by Smoller et al52 from mouse linkage studies. Analyses of male and female mean scores separately had much reduced power compared with the joint-sex analyses, particularly the male-specific analyses, and so we place less emphasis on the sex-specific results.
For a study of its kind, our sample size is large (Table 4), yet the number of linkage regions that we identified for the Dutch, Australian, and combined study samples were 3, 0, and 2 respectively, not very different from the 1 per linkage scan expected by chance. Of the other linkage studies for neuroticism (Table 4), only the study of Fullerton et al16 had more power to detect a QTL. Based on observed phenotypic and marker information, we had 100% (or 89%) power in the combined Australian and Dutch sample to detect a QTL that accounts for 20% (or 10%) of the total variance at the suggestive53 threshold for linkage. For a trait with a heritability of 30%, these are perhaps optimistic power calculations; nonetheless, the next largest neuroticism linkage study10 to date, assuming fully informative marker information, reports only 72% power for a QTL that accounts for 20% of the variance. Studies likely to have much less power to detect a QTL have identified more suggestive and significant linkage regions (Table 4). Theoretically, sample sizes of more than 50 sib pairs should not result in a biased number of linkage statistics exceeding suggestive or significant linkage thresholds under the null hypothesis.53 Although under the alternate hypothesis (when a QTL does exist), an inverse correlation between sample size and LOD score is expected.57 One conclusion is that there simply are no variants that explain 10% or more of the genetic variance. When do our suggestive linkage peaks represent false positives and when does their low significance reflect variants of a smaller effect size? It is not possible to answer this question, but by considering multiple measures of neuroticism, we reduce the impact of the environmental noise surrounding chance extreme concordance or discordance of measures and therefore reduce one cause of the occurrence of false-positive linkage signals. The examination of linkage analyses from the individual measures of neuroticism provides some evidence for the robustness of our results using mean score.
Limitations of our study include different measures of neuroticism, both between countries and within the Australian sample. The Dutch participants came from a younger cohort than the Australian participants. A recent study has suggested that subtle differences in the EPQ-R and NEO neuroticism instruments may be important for genetic studies.58 However, the high genetic correlations between different measures suggest the different measurement instruments are probing the same underlying trait, at least within country. As in other studies, we have undertaken some multiple testing (sex-dependent analyses, both between and within countries) that has not been accounted for in the empirical thresholds derived for each mean measure. The empirical threshold (LOD score, 2.5) derived to account for the multiple testing of the 9 mean neuroticism measures (including sex-specific means) is exceeded only for region 14q 103 cM. Nonetheless, the robustness of our results as measured by consistency in linkage score between time-specific measures, combined with the high rate of overlap with regions reported in other studies, adds to the optimism reported in recent reviews14,46 for replication of linkage regions, even though the true effect sizes of underlying variants are unlikely to be large. A recent genome-wide association study of neuroticism using DNA pooling59 failed to identify any loci that explained more than 1% of the variance. It is unlikely that the consensus in linkage signals across studies is driven by single variants of such a small magnitude but more likely implies allelic heterogeneity of causal variants within functionally important genes. Consistently identified regions from linkage analyses will be important in prioritizing results from genome-wide association studies. Time will tell if genome-wide association studies result in the identification of causal variants that account for most observed genetic variance. International collaborations compiling large family-based study samples for linkage analysis may well be necessary for identification of genes that contain multiple but rare causal variants.
Correspondence: Naomi R. Wray, PhD, Department of Genetic Epidemiology, Queensland Institute of Medical Research, 300 Herston Rd, Brisbane, Australia 4029 (firstname.lastname@example.org).
Submitted for Publication: June 11, 2007; final revision received October 22, 2007; accepted November 26, 2007.
Financial Disclosure: None reported.
Author Contributions: Drs Boomsma and Martin take responsibility for the integrity of the data and Dr Wray takes responsibility for the accuracy of the data analysis.
Funding/Support: The Dutch study was supported by grants 400-05-717, 911-03-016, 904-61-193, 985-10-002, and 575-25-006 from the Netherlands Organization for Scientific Research; grant R01 HL55976 from the National Institutes of Health; and the National Heart, Lung, and Blood Institute Mammalian Genotyping Service, Marshfield, Wisconsin. Dr Middeldorp was supported by grant 13F05(2).47 from the Hersenstichting Nederland. For the Australian study, phenotype collection was funded by grants 971232 and 339450 from the Australian National Health and Medical Research Council and by Gemini Genomics PLC (now defunct) (Dr Martin and Gavin Andrews, MB ChB, MD). Additional funding was provided by grant MH59160 from the US National Institute of Mental Health (Dr Sullivan). We acknowledge the role of Andrew Heath, DPhil, and National Institutes of Health grants AA07535 and AA07728 in earlier projects in which selection variables were collected. For the genome scans, we acknowledge and thank the Mammalian Genotyping Service (Director James Weber, PhD) for genotyping conducted under grants awarded to Daniel T. O’Connor, MD, David Duffy, MD, Dr Nyholt, and Dr Sullivan; Dr Slagboom (Leiden); Peter Reed, PhD (Gemini); Jeff Hall, PhD (Sequana); Aarno Palotie, PhD (Finnish Genome Center, Helsinki); and Sue Forrest, PhD (Australian Genome Research Facility).
Additional Contributions: Arthur Gilmour, PhD, helped with ASReml; David Smyth, BSc, Harry Beeby, BSc, and Olivia Zheng, BsC, performed data management and computer support; and our laboratory staff, especially Megan Campbell, BSc, Anjali Henders, BSc, and Leanne McNeil, BSc, conducted sample processing and preparation. We thank the twins and their relatives for their willing participation in the study, our interviewers, and clerical and administrative support staff supervised by Dixie Statham, MPsychClin.