American_Journal

PMC:2725236 JSON TXT 7 Projects

CMIP and ATP2C2 Modulate Phonological Short-Term Memory in Language Impairment Abstract Specific language impairment (SLI) is a common developmental disorder characterized by difficulties in language acquisition despite otherwise normal development and in the absence of any obvious explanatory factors. We performed a high-density screen of SLI1, a region of chromosome 16q that shows highly significant and consistent linkage to nonword repetition, a measure of phonological short-term memory that is commonly impaired in SLI. Using two independent language-impaired samples, one family-based (211 families) and another selected from a population cohort on the basis of extreme language measures (490 cases), we detected association to two genes in the SLI1 region: that encoding c-maf-inducing protein (CMIP, minP = 5.5 × 10−7 at rs6564903) and that encoding calcium-transporting ATPase, type2C, member2 (ATP2C2, minP = 2.0 × 10−5 at rs11860694). Regression modeling indicated that each of these loci exerts an independent effect upon nonword repetition ability. Despite the consistent findings in language-impaired samples, investigation in a large unselected cohort (n = 3612) did not detect association. We therefore propose that variants in CMIP and ATP2C2 act to modulate phonological short-term memory primarily in the context of language impairment. As such, this investigation supports the hypothesis that some causes of language impairment are distinct from factors that influence normal language variation. This work therefore implicates CMIP and ATP2C2 in the etiology of SLI and provides molecular evidence for the importance of phonological short-term memory in language acquisition. Main Text Developmental speech and language disorders are a heterogeneous group of childhood conditions with variable presentation and etiology. Together, they account for 40% of pediatric referrals1 and statements of educational need.2 The term specific language impairment (SLI) defines a category of speech and language disorders in which a profound language impairment represents the primary deficit.2 This disorder affects 5%–8% of preschool children2 and is highly heritable.3 Nonetheless, in contrast to other related developmental disabilities (e.g., dyslexia [MIM #127700] and attention deficit hyperactivity disorder [ADHD, MIM #143465]), relatively few genetic studies have been performed for SLI. SLI is a prototypical multifactorial disorder that is predicted to involve numerous genetic loci and environmental factors.3 Three primary sites of linkage have been described4,5, the most robust of which is on chromosome 16q (SLI1, MIM #606711). This region is of interest because the linkage is highly specific to a single psychometric measure (nonword repetition).4,6,7 The test for nonword repetition involves the repetition of nonsensical words of increasing length and complexity and is regarded as a measure of phonological (speech sound) processing and short-term memory.8 Individuals with SLI typically perform particularly poorly on nonword repetition, even when their language difficulties have apparently resolved, leading to the postulation that a short-term memory deficit causes susceptibility to SLI9 by impairing the retention of novel verbal information.10 This paper incorporates two contingent investigations: an association screen of the SLI1 region in a cohort of language-impaired families and a subsequent replication study of detected association effects in an independent sample selected from the Avon Longitudinal Study of Parents and Children (ALSPAC) general-population cohort.11,12 The association screen utilized 806 individuals from 211 families ascertained by the SLI Consortium (SLIC). This nuclear-family cohort was collected from five sites across the UK (The Newcomen Centre at Guy's Hospital, London; the Cambridge Language and Speech Project (CLASP)13; the Child Life and Health Department at the University of Edinburgh14; the Department of Child Health at the University of Aberdeen; and the Manchester Language Study15,16) and included the families in whom the SLI1 linkage was originally identified. Ethical permission for each collection was granted by local ethics committees. SLIC families were all selected on the basis of a single proband with receptive and/or expressive language skills more than 1.5 SD below the normative mean for his or her age. A more detailed description of these samples and the exclusionary criteria applied to the SLIC collection can be found in previous publications.4,6,7 Genotyping for the association screen was performed in two phases with a combination of Sequenom and Illumina technologies. We performed an initial high-density screen involving 1906 SNPs to tag all 58 genes (including introns, exons, and 5 Kb 5′ and 2 Kb 3′ of coding sequences) mapped to the 10.29 Mb SLI1 region of linkage (D16S3138–D16S413. Chromosome 16 position 76.16 Mb–86.45 Mb [B35]). Haplotype blocks were built within Haploview17 via the Gabriel method.18 Any between-block gap that was more than 15 Kb in size was tagged with the Tagger algorithm. Two genes that mapped to the region (CDH13 [MIM #601364] and WWOX [MIM #605131]) were found to be larger than 1 Mb in size. For these two genes, blocks were built to cover the exonic regions only. Any region containing a SNP that met our predefined significance threshold (p < 0.001 in any one analysis or p < 0.01 across both analyses) was then supplemented with additional markers in a follow-up panel that included 138 SNPs, eight of which had previously been genotyped. Both phases of genotyping were completed prior to the replication study and were subjected to consistent quality-control procedures. The total genotype mismatch rate was 0.73% for duplicated SNPs and 0.76% for duplicated samples. Across both phases, 261 (12.7%) of SNPs were excluded at the quality-control stage. These included SNPs with a genotype rate of <80%, a minor-allele frequency of <2.5%, SNPs with unusual Beadstudio cluster patterns (Illumina) or atypical peaks in MassArray TyperAnalyser (Sequenom), SNPs with a GenTrain score of <0.5 (Illumina), and markers that showed consistent bad inheritances (>10 errors after data clean up). Across the entire region, the merged data set consisted of, on average, one SNP every 6.4 Kb. Across the known genes, there was on average one SNP every 4.5 Kb, and the largest remaining gap between blocks was 19,579 bp. Details of SNP coverage can be found in Table S1. Q-Q plots can be found in Figure S1. Given the consistent linkage between SLI1 and nonword repetition, all association analyses were based upon this measure. Our principal analysis involved the variance-components modeling of 28-item nonword repetition scores8 within 211 SLIC families (ao option) as a quantitative trait and was performed within QTDT.19 In addition, we performed a categorical case-control allelic test of association within PLINK.20 In this case-control analysis, SLIC individuals with low nonword-repetition scores (>2 SD below population mean, n = 79) were chosen as cases, and family members with above-average performance (>0.5 SD above population mean, n = 71) were used as controls. To avoid interdependence, we selected only one case or control from each family unit. The initial screen involved 1678 SNPs, of which thirteen (0.77%) exceeded our significance threshold, highlighting two primary regions of association (Table 1 and Figure 1). The follow-up panel chiefly included SNPs in these two regions and supported the association seen in the screen while reducing the evidence for association at other loci (Table 2 and Figure 1). Of the 105 SNPs tested in the follow-up panel, five (4.8%) were found to be significantly associated (Table 2 and Figure 1). The first identified cluster of association lay across 26 Kb (exons 2–4) of the CMIP gene (MIM #610112; seven significant SNPs, minP = 5 × 10−7). This gene encodes an adaptor protein and has two isoforms, the shorter of which is involved in cell signaling pathways and is upregulated in minimal change nephrotic syndrome (MCNS), a childhood kidney disease.21 Little is known about the function of the longer transcript. Both isoforms are expressed in the brain.21 The second region of association was observed between exons 7 and 12 (10.8 Kb) of the ATP2C2 gene (six significant SNPs, minP = 2 × 10−5). This gene is one of two secretory-pathway Ca2+-ATPases (SPCAs) that move cytosolic calcium and manganese ions into the golgi.22 Its expression is limited to the brain, testis, gastrointestinal tract, and respiratory tissues and mammary, salivary, and thyroid glands.22 In the mammary gland, ATP2C2 expression facilitates the secretion of Ca2+ into casein micelles during lactation.23 Three lines of evidence indicate that the associations at CMIP and ATP2C2 represent separate effects. First, we did not see any indication of long-range linkage disequilibrium between the two loci (which lie almost 3 Mb apart) in the SLIC cohort or public data (Figure S2). Second, the inclusion of a CMIP covariate in the linkage or association model did not affect the level of linkage or association seen at ATP2C2 (or vice versa for ATP2C2 covariates) (Figure S3). Finally, in a stepwise regression model, the group mean for SLIC individuals carrying a double-risk genotype was found to be significantly lower than those who were homozygous for risk at a single locus (p = 3.7 × 10−6, Table 3). In this model, the group mean for double-risk individuals was 15.8 points (1.05 SD) below that of individuals carrying nonrisk variants at both loci (Table 3). We therefore propose that CMIP and ATP2C2 independently regulate nonword repetition performance and together underlie the linkage seen between SLI and chromosome 16. Our replication sample consisted of 490 cases selected from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort.11,12 This is a general-population sample that follows the development of 14,062 live-born individuals born in the southwest of England. The ALSPAC group periodically performs an assessment of the development of consenting individuals, and these measurements include tests of language ability. Informed written consent was obtained from the parents at the time of enrolment. Ethical approval for the study was obtained from the ALSPAC Law and Ethics Committee and the Local Research Ethics Committees. Because the current study focuses upon language impairment, we selected individuals from the lower extreme of language-related phenotype distributions (Children's Communication Checklist (CCC)24 and Wechsler Objective Language Dimensions (WOLD)25) for our replication sample. This included 665 individuals (10.3%) with a CCC pragmatic composite 1–3 SD below the ALSPAC population mean (123 ≤ × ≤ 145) or a WOLD listening comprehension score ≥2 SD below the ALSPAC population mean (≤3). Of these individuals, 490 had completed a 12-item nonword repetition test. Because the genotyping in the replication sample was restricted to a single individual from each family, we performed a quantitative association analysis within PLINK20 by using nonword repetition in a linear-regression framework. In addition, we used PLINK20 to carry out a case-control analysis analogous to that described for SLIC. We selected cases and controls from the extremes of the nonword repetition performance distribution of the 490 selected individuals. As expected, given the extreme nature of the language impairment in the SLIC samples, the distribution of nonword repetition differed between the SLIC and ALSPAC cohorts. Therefore, in the replication cohort, the cut-offs used for cases and controls were less extreme than those applied for the association screen. Cases were selected from the identified replication sample to have nonword repetition scores ≥1 SD below the general-population mean (n = 112), and controls had nonword repetition scores ≥1 SD above the general-population mean (n = 72). Data were analyzed for three CMIP and three ATP2C2 SNPs (rs12927866, rs4265801, and rs16955705; and rs16973771, rs2875891, and rs8045507, respectively), and significant associations (p < 0.05) were seen for two CMIP and two ATP2C2 SNPs (Table 4 and Figure 2). Regression trends for ATP2C2 followed those seen in SLIC, replicating the previously described association. Association to CMIP was in an opposite direction from that described above (Table 4 and Figure 2). Although this result might represent a type I error, the consistency of significant association in light of the low number of SNPs tested supports a role for CMIP. Associations can occur in opposite directions if the relationship between the observed and causal variants differs between populations.26 This is particularly true if multiple risk loci interact in an additive or multiplicative fashion26, as is predicted for CMIP. Identification of the causal variant will enable the further characterization of the relationship between risk variants in different populations. Given the partial replication of association, we investigated whether the primary associated SNPs in ATP2C2 and CMIP had an effect upon additional language- and memory-related measures (Table S2). In SLIC, we found borderline association for ATP2C2 with measures of receptive language (oral directions27 [p = 0.006], word classes27 [p = 0.04], and comprehension28 [p = 0.03]), expressive language (formulating sentences27 [p = 0.04]), and vocabulary28 (p = 0.04). In the replication cohort, aside from nonword repetition, we only observed borderline association between ATP2C2 and counting span, a measure of working memory (p = 0.01). In the replication sample, nonword repetition performance had been scored according to the number of syllables the nonword contained. For both CMIP and ATP2C2, the majority of association came from the five-syllable nonwords (p = 0.016 and p = 6 × 10−4, respectively) (Table S2). In neither sample did we observe association to reading-related tasks, which have been reported to show linkage to SLI1.6 Nor did we find any association to digit span28 or recalling sentences,27 two measures that have a high memory load. This is consistent with the finding that nonword repetition correlates with SLI to a higher degree than other short-term memory tests (e.g., digit span). The sensitivity of nonword repetition to SLI could be because it places heavier demands on processing of speech sounds than other memory tests as a result of the child's having to perceive and produce an unfamiliar sequence.29 It is important to note that, although nonword repetition is a good marker for SLI, poor performance on nonword repetition is not a perfect correlate of this disorder.30 In our study, 50% of SLIC probands performed poorly (>1 SD below the expected population mean) on nonword repetition, but a significant number (27%) scored above the expected population mean. These findings support recent opinion that deficits across multiple domains are required to cause persistent language impairments.31 A recent genome-wide association study of ADHD listed a SNP (rs10514604; p = 8 × 10−7) in ATP2C2 within the top 30 significant associations.32 Despite distinct defining characteristics, ADHD and SLI show a high level of comorbidity both with each other32 and with disorders such as developmental coordination disorder, speech-sound disorder (SSD; MIM #608445), and dyslexia.33–35 For example, individuals with SLI, SSD, ADHD, or dyslexia often present with linguistic deficits and impairments in short-term memory.33 It has therefore been suggested that certain aspects of these disorders might share a common etiology. Given the high levels of co-occurrence, we did not exclude children affected by ADHD and dyslexia from our study samples. However, in some of our SLIC samples, data were available for the presence of hyperactivity, coordination, and reading problems. From this, we estimate that approximately one-third of our SLIC samples showed some evidence of ADHD or developmental coordination disorder and that approximately one-half of our probands had reading problems. In the entire ASLPAC sample, 1.3% of individuals met criteria for ADHD. In the selected ALSPAC replication sample, the rate of ADHD increased to 3.7%. Thus, as expected, it is clear that the rate of developmental disorders across our cohorts is elevated over that expected in a population sample. Nonetheless, the association detected in our samples shows a strong correlation to nonword-repetition ability which has repeatedly been shown to be a strong indicator of language impairment.9,10 Furthermore, in ADHD samples, performance on the nonword-repetition task is correlated with linguistic ability rather than the presence of hyperactivity.33,36 Thus, we conclude that variants in ATP2C2 might account for shared aspects of the linguistic deficit in SLI and ADHD. Given this possibility, we also postulate that ATP2C2 might contribute to phonological short-term memory in other developmental disorders. Finally, we investigated the effects of ATP2C2 and CMIP on nonword-repetition performance at the population level. Across the entire unselected ALSPAC population (n = 3612), there was no evidence for quantitative association between nonword-repetition ability and either locus (minP = 0.48). Moreover, there were no differences in allele frequency for ATP2C2 or CMIP SNPs between either SLIC or replication-sample individuals and unselected European population controls (data not shown). Taken together, these data indicate that ATP2C2 and CMIP do not modulate nonword-repetition performance across the entire population, nor, in isolation, do they cause a predisposition to SLI. Instead, we propose that when combined with additional, as-yet-unidentified, susceptibility factors (either genetic or environmental), variants in ATP2C2 and CMIP have a detrimental effect upon nonword repetition performance and thus heighten the risk of developmental language impairments. This situation demonstrates a fundamental principle often overlooked in the mapping of complex disorders: that genetic variants might have selective effects in specific populations depending upon the genetic and environmental background. The question as to whether SLI constitutes a qualitatively distinct disorder caused by abnormal development of language abilities or merely represents the tail end of normal linguistic development is a matter of recent debate.37 Although the absence of association in our population sample could reflect insufficient sample sizes or the insensitivity of psychometric tests to quantify variation beyond the lower extremes of the spectrum, it is obvious that the effects of ATP2C2 and CMIP upon nonword-repetition performance are particularly pertinent to individuals with language difficulties. As such, this investigation provides molecular evidence that, at least in terms of the effects described here, SLI represents a distinct disorder caused by genetic variants discrete from those that influence language ability in the general population. In summary, we have used a positional fine-mapping approach to demonstrate association between ATP2C2 and CMIP and nonword repetition performance across two independent language-impaired populations. We propose that variants in both loci combine to modulate nonword-repetition performance in language-impaired populations. Both genes are expressed in the brain and represent good candidates for language- and memory-related processes. ATP2C2 is involved in the translocation of cytosolic calcium and manganese ions to the golgi.22 Calcium homeostasis is important for the regulation of many neuronal processes, including working memory, synaptic plasticity, and neuronal motility38, and manganese dysregulation has been linked to Parkinsonism (MIM #168600), Alzheimer disease (MIM #104300), and disordered memory.39 The functional role of CMIP is less defined, but it is known to interact with filamin A (MIM #300017)40 and the NF-kappaB subunit RelA (MIM #164014).41 The filaminA protein is involved in the reorganization of the actin cytoskeleton, which is of importance in the formation of the dendritic spine.40 The NF-κB family of transcription factors plays a central role in many neuronal processes, including synaptic activity and memory formation, and members of this family have been implicated in neurodegenerative disorders.42 Further characterization of the observed associations has enabled us to infer that SLI represents a qualitatively distinct disorder caused by a combination of genetic variants that disrupt multiple pathways important to the development of language. It is anticipated that the functional characterization of ATP2C2 and CMIP will promote a better understanding of the molecular basis of language acquisition and aid in the diagnosis and treatment of individuals affected by language disorders. Supplemental Data Document S1. Three Figures and Two Tables Web Resources The URLs for data presented herein are as follows:Illumina, www.illumina.com/ Sequenom, http://www.sequenom.com/ GE Healthcare, http://www6.gelifesciences.com/ Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim Tagger, http://www.broad.mit.edu/mpg/tagger/ Haploview, http://www.broad.mit.edu/mpg/haploview/ QTDT, http://www.sph.umich.edu/csg/abecasis/QTDT/ PLINK, http://pngu.mgh.harvard.edu/∼purcell/plink/ MERLIN, http://www.sph.umich.edu/csg/abecasis/Merlin/ PEDSTATS, http://www.sph.umich.edu/csg/abecasis/PedStats/ HAPMAP, http://www.hapmap.org/ R, http://www.r-project.org/ The Monaco Group at the Wellcome Trust Centre for Human Genetics (Neurogenetics), http://www.well.ox.ac.uk/monaco/ ALSPAC, http://www.bristol.ac.uk/alspac/ Manchester Language Study, http://www.manchesterlanguagestudy.co.uk/ Acknowledgments We thank all the families and professionals who participated in the study, Caroline Durrant and Jean-Baptiste Cazier for statistical advice, members of the Monaco lab for support, and Leila Jannoun, Jane Addison, Clare Craven, Deborah Jones, Tilly Storr, Til Utting-Brown, Margaret Main, Jane Steele, and Alan MacLean for assistance with data collection and management. We are extremely grateful to all the families who took part in the ALSPAC study, to the midwives for their help in recruiting them, and to the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists, and nurses. The UK Medical Research Council, the Wellcome Trust, and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors, and D.F. Newbury and A.P. Monaco will serve as guarantors for this paper's contents. The Wellcome Trust specifically funded this research. All laboratory work and the collection of data from families ascertained by Guy's Hospital and the University of Manchester were funded by The Wellcome Trust. CLASP was funded by The Wellcome Trust, British Telecom, Isaac Newton Trust, National Health Service (NHS) Anglia & Oxford Regional R&D Strategic Investment Award, and an NHS Eastern Region R&D Training Fellowship Award. The Edinburgh group was supported by the Chief Scientist's Office, Scotland. The Aberdeen group was supported by Grampian Healthcare Trust and Grampian Primary Care NHS Trust. D.V.M. Bishop is a Wellcome Trust Principal Research Fellow, and S.E. Fisher is a Royal Society Research Fellow. Figure 1 Association in SLIC Cohort Association results for family-based quantitaive analysis and case-control analysis of nonword repetition across the SLI1 region. In the case-control analysis, cases and controls were selected on the basis of their nonword-repetition performance (see text). Gaps in data represent regions where there are no mapped genes. SNPS included in the screen genotype panel are shown as +, and SNPs included in the follow-up genotype panel are shown as x. Figure 2 Nonword-Repetition Means for CMIP and ATP2C2 in SLIC and Replication Cohorts (A) CMIP. (B) ATP2C2. All means are for age- and sex-adjusted nonword-repetition scores standardized with a mean of 0 and a SD of 1. The three CMIP SNPs (rs12927866, rs4265801, and rs16955705) show genotype trends in the opposite direction from SLIC (A), whereas the three ATP2C2 SNPs (rs16973771, rs2875891, and rs8045507) show genotype trends in the same direction as SLIC (B). Table 1 Significant Association in the SLIC Association Screen SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) A1 CEPH Frequency Typed Strand p Quant Effect Size Heritability p Emp QTDT p Case-Cont Frequency of A1 Cases Frequency of A1 Controls Odds ratio (95% CI) p Emp PLINK rs8051754 78,554,834 intergenic T/C∗ 0.46 − 0.0931 −0.28 ± 0.11 0.019 0.0892 0.0007∗ 0.64 0.85 3.1 (1.6–6.0) 0.0018∗ rs4417561 78,568,860 intergenic G∗/C 0.26 − 0.0244 −0.30 ± 0.11 0.022 0.0252 0.0004∗ 0.37 0.15 3.2 (1.7–6.3) 0.0011∗ rs2316184 79,204,885 CDYL2 G/A∗ 0.14 + 0.0032∗ -0.48 ± 0.12∗ 0.045 0.0034∗ 0.0096∗ 0.15 0.30 2.5 (1.2–4.9) 0.0126 rs12927866 80,209,823 CMIP A/G∗ 0.47 − 0.4104 −0.27 ± 0.10 0.019 0.3581 0.0003∗ 0.29 0.49 2.4 (1.5–3.9) 0.0004∗ rs4265801 80,222,553 CMIP T∗/G 0.43 + 0.3446 −0.09 ± 0.09 0.030 0.5065 4 × 10−5∗ 0.61 0.29 3.9 (2.0–7.6) 0.0393∗ rs7201632 80,234,949 CMIP C/T∗ 0.49 + 0.8966 −0.25 ± 0.09 0.017 0.7975 0.0004∗ 0.36 0.56 2.3 (1.4–3.7) 0.0004∗ rs3785054 82,918,978 WFDC1 C∗/T 0.36 − 0.0044∗ −0.29 ± 0.10∗ 0.019 0.0033∗ 0.0089∗ 0.34 0.20 2.0 (1.2–3.4) 0.0102 rs8053211 83,011,254 ATP2C2 A∗/G 0.46 + 5 × 10−5∗ −0.38 ± 0.09∗ 0.040 3 × 10−5∗ 0.0014∗ 0.61 0.43 2.1 (1.3–3.3) 0.0029∗ rs11860694 83,014,948 ATP2C2 C∗/G 0.54 − 2 × 10−5∗ −0.37 ± 0.09∗ 0.039 9 × 10−6∗ 0.0018∗ 0.61 0.43 2.1 (1.3–3.3) 0.0027∗ rs16973771 83,018,079 ATP2C2 G/A∗ 0.48 − 0.0003∗ −0.35 ± 0.09∗ 0.034 0.0006∗ 0.0025∗ 0.34 0.51 2.0 (1.3–3.2) 0.0036∗ rs2875891 83,021,410 ATP2C2 T/C∗ 0.44 + 0.0057∗ −0.34∗ ± 0.10∗ 0.031 0.0063∗ 0.0022∗ 0.30 0.47 2.1 (1.3–3.4) 0.0026∗ rs8045507 83,022,078 ATP2C2 T/C∗ 0.48 − 0.0017∗ −0.33 ± 0.09∗ 0.029 0.0020∗ 0.0022∗ 0.34 0.51 2.1 (1.3–3.3) 0.0028∗ Three significant SNPs fell within the CMIP gene, and five fell within ATP2C2. The remaining four significant SNPs were either intergenic or isolated signals of association. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. P Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are marked with an asterisk. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK. Table 2 Significant Association in the SLIC Cohort with the Follow-up Panel SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) A1 CEPH Frequency Typed Strand P Quant Effect Size Heritability p Emp QTDT p Case-Cont Frequency of A1 Cases Frequency of A1 Controls Odds Ratio (95% CI) p Emp PLINK rs6564903 80,211,158 CMIP C∗/T 0.48 + 0.1279 −0.37 ± 0.10 0.038 0.1225 5 × 10−7∗ 0.79 0.38 3.5 (2.1–5.9) 1 × 10−6∗ rs3935802 80,219,068 CMIP G∗/C 0.46 − 0.2667 −0.31 ± 0.10 0.025 0.2486 0.0003∗ 0.71 0.49 2.5 (1.5–4.2) 0.0006∗ rs16955705 80,230,851 CMIP C/A∗ 0.50 + 0.3916 −0.25 ± 0.10 0.017 0.3627 0.0003∗ 0.31 0.54 2.6 (1.5–4.4) 0.0003∗ rs4243209 80,247,592 CMIP C/T∗ 0.22 + 0.0065∗ −0.42 ± 0.12 0.027 0.0043∗ 0.0007∗ 0.11 0.26 3.0 (1.6–5.8) 0.0012∗ rs12149426 83,022,607 ATP2C2 A/C∗ 0.26 + 0.0064∗ −0.31 ± 0.12 0.017 0.0082∗ 0.0082∗ 0.14 0.27 2.3 (1.2–4.2) 0.0039∗ Of the 105 SNPs analyzed in the follow-up panel, 16 lay in CMIP, 76 lay in the ATP2C2 gene, and the remaining 13 lay in other regions that had shown association in the screen (see Table 1). Eight SNPs were genotyped in both the screen and follow-up panels. All of these markers showed some evidence of association in the screen phase (p < 0.01) but had genotype success rates of <95%, and none lay within CMIP or ATP2C2. Each of the duplicated SNPs showed increased success rates and decreased association levels in the follow-up panel. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. p Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are shown in bold. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK. Table 3 Nonword-Repetition Group Means for CMIP and ATP2C2 Risk Variants Genotype (Number of Risk Alleles) Single SNP rs6564903 (CMIP) TT (0) CT (1) CC (2) Single SNP 96.62 92.57 86.30 rs11860694 (ATP2C2) GG (0) 96.54 99.14 99.85 89.65 CG (1) 91.77 99.40 93.10 85.84 CC (2) 87.03 88.44 88.33 83.32 The effects of CMIP (rs6564903) and ATP2C2 (rs11860694) on nonword-repetition performance were modeled as additive effects within a regression framework in the R package. This regression model included all available SLIC children with genotype and nonword-repetition data (n = 503). Group means were calculated for each SNP in isolation (“Single SNP” entries) and in combinations of genotypes (3 × 3 grid) across risk SNPs. Note that individuals carrying combinations of risk alleles performed significantly worse than those carrying risk variants at a single locus. Nonword-repetition scores are age adjusted and standardized against normal population controls with a mean of 100 and a SD of 15. Table 4 Association in the Replication Cohort SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) SLIC Risk Allele A1 CEPH Frequency Typed Strand p Quant Effect Size p Case-Cont Frequency of A1 Cases Frequency of A1 controls Odds Ratio (95% CI) rs12927866 80,209,823 CMIP T/C C 0.47 + 0.1623 −0.08 0.0955 0.39 0.30 1.5 (0.9-2.3) rs4265801 80,222,553 CMIP T/G∗ T 0.43 + 0.0182∗ −0.15 0.0214∗ 0.43 0.56 1.6 (1.1-2.5) rs16955705 80,230,851 CMIP C∗/A A 0.50 + 0.0238∗ −0.14 0.0257∗ 0.48 0.36 1.6 (1.1-2.5) rs16973771 83,018,079 ATP2C2 C/T∗ T 0.48 + 0.0079∗ −0.14 0.0135∗ 0.32 0.45 1.7 (1.1-2.7) rs2875891 83,021,410 ATP2C2 T/C C 0.44 + 0.0668 −0.06 0.0802 0.29 0.37 1.5 (1.0-2.3) rs8045507 83,022,078 ATP2C2 A/G∗ G 0.48 + 0.0058∗ −0.15 0.0110∗ 0.31 0.44 1.8 (1.1-2.7) SNP alleles are given with the minor allele first. Putative risk alleles in the replication cohort are marked with an asterisk. p Quant shows the p value for the quantitative analysis. p < 0.05 are highlighted in bold. The odds ratio indicates the ratio of case/control odds for each additional copy of the putative risk allele. The 95% confidence intervals for the odds ratios of all significantly associated SNPs exceeded 1.0. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD).

Document structure show

article-title	CMIP and ATP2C2 Modulate Phonological Short-Term Memory in Language Impairment
abstract	Specific language impairment (SLI) is a common developmental disorder characterized by difficulties in language acquisition despite otherwise normal development and in the absence of any obvious explanatory factors. We performed a high-density screen of SLI1, a region of chromosome 16q that shows highly significant and consistent linkage to nonword repetition, a measure of phonological short-term memory that is commonly impaired in SLI. Using two independent language-impaired samples, one family-based (211 families) and another selected from a population cohort on the basis of extreme language measures (490 cases), we detected association to two genes in the SLI1 region: that encoding c-maf-inducing protein (CMIP, minP = 5.5 × 10−7 at rs6564903) and that encoding calcium-transporting ATPase, type2C, member2 (ATP2C2, minP = 2.0 × 10−5 at rs11860694). Regression modeling indicated that each of these loci exerts an independent effect upon nonword repetition ability. Despite the consistent findings in language-impaired samples, investigation in a large unselected cohort (n = 3612) did not detect association. We therefore propose that variants in CMIP and ATP2C2 act to modulate phonological short-term memory primarily in the context of language impairment. As such, this investigation supports the hypothesis that some causes of language impairment are distinct from factors that influence normal language variation. This work therefore implicates CMIP and ATP2C2 in the etiology of SLI and provides molecular evidence for the importance of phonological short-term memory in language acquisition.
p	Specific language impairment (SLI) is a common developmental disorder characterized by difficulties in language acquisition despite otherwise normal development and in the absence of any obvious explanatory factors. We performed a high-density screen of SLI1, a region of chromosome 16q that shows highly significant and consistent linkage to nonword repetition, a measure of phonological short-term memory that is commonly impaired in SLI. Using two independent language-impaired samples, one family-based (211 families) and another selected from a population cohort on the basis of extreme language measures (490 cases), we detected association to two genes in the SLI1 region: that encoding c-maf-inducing protein (CMIP, minP = 5.5 × 10−7 at rs6564903) and that encoding calcium-transporting ATPase, type2C, member2 (ATP2C2, minP = 2.0 × 10−5 at rs11860694). Regression modeling indicated that each of these loci exerts an independent effect upon nonword repetition ability. Despite the consistent findings in language-impaired samples, investigation in a large unselected cohort (n = 3612) did not detect association. We therefore propose that variants in CMIP and ATP2C2 act to modulate phonological short-term memory primarily in the context of language impairment. As such, this investigation supports the hypothesis that some causes of language impairment are distinct from factors that influence normal language variation. This work therefore implicates CMIP and ATP2C2 in the etiology of SLI and provides molecular evidence for the importance of phonological short-term memory in language acquisition.
body	Main Text Developmental speech and language disorders are a heterogeneous group of childhood conditions with variable presentation and etiology. Together, they account for 40% of pediatric referrals1 and statements of educational need.2 The term specific language impairment (SLI) defines a category of speech and language disorders in which a profound language impairment represents the primary deficit.2 This disorder affects 5%–8% of preschool children2 and is highly heritable.3 Nonetheless, in contrast to other related developmental disabilities (e.g., dyslexia [MIM #127700] and attention deficit hyperactivity disorder [ADHD, MIM #143465]), relatively few genetic studies have been performed for SLI. SLI is a prototypical multifactorial disorder that is predicted to involve numerous genetic loci and environmental factors.3 Three primary sites of linkage have been described4,5, the most robust of which is on chromosome 16q (SLI1, MIM #606711). This region is of interest because the linkage is highly specific to a single psychometric measure (nonword repetition).4,6,7 The test for nonword repetition involves the repetition of nonsensical words of increasing length and complexity and is regarded as a measure of phonological (speech sound) processing and short-term memory.8 Individuals with SLI typically perform particularly poorly on nonword repetition, even when their language difficulties have apparently resolved, leading to the postulation that a short-term memory deficit causes susceptibility to SLI9 by impairing the retention of novel verbal information.10 This paper incorporates two contingent investigations: an association screen of the SLI1 region in a cohort of language-impaired families and a subsequent replication study of detected association effects in an independent sample selected from the Avon Longitudinal Study of Parents and Children (ALSPAC) general-population cohort.11,12 The association screen utilized 806 individuals from 211 families ascertained by the SLI Consortium (SLIC). This nuclear-family cohort was collected from five sites across the UK (The Newcomen Centre at Guy's Hospital, London; the Cambridge Language and Speech Project (CLASP)13; the Child Life and Health Department at the University of Edinburgh14; the Department of Child Health at the University of Aberdeen; and the Manchester Language Study15,16) and included the families in whom the SLI1 linkage was originally identified. Ethical permission for each collection was granted by local ethics committees. SLIC families were all selected on the basis of a single proband with receptive and/or expressive language skills more than 1.5 SD below the normative mean for his or her age. A more detailed description of these samples and the exclusionary criteria applied to the SLIC collection can be found in previous publications.4,6,7 Genotyping for the association screen was performed in two phases with a combination of Sequenom and Illumina technologies. We performed an initial high-density screen involving 1906 SNPs to tag all 58 genes (including introns, exons, and 5 Kb 5′ and 2 Kb 3′ of coding sequences) mapped to the 10.29 Mb SLI1 region of linkage (D16S3138–D16S413. Chromosome 16 position 76.16 Mb–86.45 Mb [B35]). Haplotype blocks were built within Haploview17 via the Gabriel method.18 Any between-block gap that was more than 15 Kb in size was tagged with the Tagger algorithm. Two genes that mapped to the region (CDH13 [MIM #601364] and WWOX [MIM #605131]) were found to be larger than 1 Mb in size. For these two genes, blocks were built to cover the exonic regions only. Any region containing a SNP that met our predefined significance threshold (p < 0.001 in any one analysis or p < 0.01 across both analyses) was then supplemented with additional markers in a follow-up panel that included 138 SNPs, eight of which had previously been genotyped. Both phases of genotyping were completed prior to the replication study and were subjected to consistent quality-control procedures. The total genotype mismatch rate was 0.73% for duplicated SNPs and 0.76% for duplicated samples. Across both phases, 261 (12.7%) of SNPs were excluded at the quality-control stage. These included SNPs with a genotype rate of <80%, a minor-allele frequency of <2.5%, SNPs with unusual Beadstudio cluster patterns (Illumina) or atypical peaks in MassArray TyperAnalyser (Sequenom), SNPs with a GenTrain score of <0.5 (Illumina), and markers that showed consistent bad inheritances (>10 errors after data clean up). Across the entire region, the merged data set consisted of, on average, one SNP every 6.4 Kb. Across the known genes, there was on average one SNP every 4.5 Kb, and the largest remaining gap between blocks was 19,579 bp. Details of SNP coverage can be found in Table S1. Q-Q plots can be found in Figure S1. Given the consistent linkage between SLI1 and nonword repetition, all association analyses were based upon this measure. Our principal analysis involved the variance-components modeling of 28-item nonword repetition scores8 within 211 SLIC families (ao option) as a quantitative trait and was performed within QTDT.19 In addition, we performed a categorical case-control allelic test of association within PLINK.20 In this case-control analysis, SLIC individuals with low nonword-repetition scores (>2 SD below population mean, n = 79) were chosen as cases, and family members with above-average performance (>0.5 SD above population mean, n = 71) were used as controls. To avoid interdependence, we selected only one case or control from each family unit. The initial screen involved 1678 SNPs, of which thirteen (0.77%) exceeded our significance threshold, highlighting two primary regions of association (Table 1 and Figure 1). The follow-up panel chiefly included SNPs in these two regions and supported the association seen in the screen while reducing the evidence for association at other loci (Table 2 and Figure 1). Of the 105 SNPs tested in the follow-up panel, five (4.8%) were found to be significantly associated (Table 2 and Figure 1). The first identified cluster of association lay across 26 Kb (exons 2–4) of the CMIP gene (MIM #610112; seven significant SNPs, minP = 5 × 10−7). This gene encodes an adaptor protein and has two isoforms, the shorter of which is involved in cell signaling pathways and is upregulated in minimal change nephrotic syndrome (MCNS), a childhood kidney disease.21 Little is known about the function of the longer transcript. Both isoforms are expressed in the brain.21 The second region of association was observed between exons 7 and 12 (10.8 Kb) of the ATP2C2 gene (six significant SNPs, minP = 2 × 10−5). This gene is one of two secretory-pathway Ca2+-ATPases (SPCAs) that move cytosolic calcium and manganese ions into the golgi.22 Its expression is limited to the brain, testis, gastrointestinal tract, and respiratory tissues and mammary, salivary, and thyroid glands.22 In the mammary gland, ATP2C2 expression facilitates the secretion of Ca2+ into casein micelles during lactation.23 Three lines of evidence indicate that the associations at CMIP and ATP2C2 represent separate effects. First, we did not see any indication of long-range linkage disequilibrium between the two loci (which lie almost 3 Mb apart) in the SLIC cohort or public data (Figure S2). Second, the inclusion of a CMIP covariate in the linkage or association model did not affect the level of linkage or association seen at ATP2C2 (or vice versa for ATP2C2 covariates) (Figure S3). Finally, in a stepwise regression model, the group mean for SLIC individuals carrying a double-risk genotype was found to be significantly lower than those who were homozygous for risk at a single locus (p = 3.7 × 10−6, Table 3). In this model, the group mean for double-risk individuals was 15.8 points (1.05 SD) below that of individuals carrying nonrisk variants at both loci (Table 3). We therefore propose that CMIP and ATP2C2 independently regulate nonword repetition performance and together underlie the linkage seen between SLI and chromosome 16. Our replication sample consisted of 490 cases selected from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort.11,12 This is a general-population sample that follows the development of 14,062 live-born individuals born in the southwest of England. The ALSPAC group periodically performs an assessment of the development of consenting individuals, and these measurements include tests of language ability. Informed written consent was obtained from the parents at the time of enrolment. Ethical approval for the study was obtained from the ALSPAC Law and Ethics Committee and the Local Research Ethics Committees. Because the current study focuses upon language impairment, we selected individuals from the lower extreme of language-related phenotype distributions (Children's Communication Checklist (CCC)24 and Wechsler Objective Language Dimensions (WOLD)25) for our replication sample. This included 665 individuals (10.3%) with a CCC pragmatic composite 1–3 SD below the ALSPAC population mean (123 ≤ × ≤ 145) or a WOLD listening comprehension score ≥2 SD below the ALSPAC population mean (≤3). Of these individuals, 490 had completed a 12-item nonword repetition test. Because the genotyping in the replication sample was restricted to a single individual from each family, we performed a quantitative association analysis within PLINK20 by using nonword repetition in a linear-regression framework. In addition, we used PLINK20 to carry out a case-control analysis analogous to that described for SLIC. We selected cases and controls from the extremes of the nonword repetition performance distribution of the 490 selected individuals. As expected, given the extreme nature of the language impairment in the SLIC samples, the distribution of nonword repetition differed between the SLIC and ALSPAC cohorts. Therefore, in the replication cohort, the cut-offs used for cases and controls were less extreme than those applied for the association screen. Cases were selected from the identified replication sample to have nonword repetition scores ≥1 SD below the general-population mean (n = 112), and controls had nonword repetition scores ≥1 SD above the general-population mean (n = 72). Data were analyzed for three CMIP and three ATP2C2 SNPs (rs12927866, rs4265801, and rs16955705; and rs16973771, rs2875891, and rs8045507, respectively), and significant associations (p < 0.05) were seen for two CMIP and two ATP2C2 SNPs (Table 4 and Figure 2). Regression trends for ATP2C2 followed those seen in SLIC, replicating the previously described association. Association to CMIP was in an opposite direction from that described above (Table 4 and Figure 2). Although this result might represent a type I error, the consistency of significant association in light of the low number of SNPs tested supports a role for CMIP. Associations can occur in opposite directions if the relationship between the observed and causal variants differs between populations.26 This is particularly true if multiple risk loci interact in an additive or multiplicative fashion26, as is predicted for CMIP. Identification of the causal variant will enable the further characterization of the relationship between risk variants in different populations. Given the partial replication of association, we investigated whether the primary associated SNPs in ATP2C2 and CMIP had an effect upon additional language- and memory-related measures (Table S2). In SLIC, we found borderline association for ATP2C2 with measures of receptive language (oral directions27 [p = 0.006], word classes27 [p = 0.04], and comprehension28 [p = 0.03]), expressive language (formulating sentences27 [p = 0.04]), and vocabulary28 (p = 0.04). In the replication cohort, aside from nonword repetition, we only observed borderline association between ATP2C2 and counting span, a measure of working memory (p = 0.01). In the replication sample, nonword repetition performance had been scored according to the number of syllables the nonword contained. For both CMIP and ATP2C2, the majority of association came from the five-syllable nonwords (p = 0.016 and p = 6 × 10−4, respectively) (Table S2). In neither sample did we observe association to reading-related tasks, which have been reported to show linkage to SLI1.6 Nor did we find any association to digit span28 or recalling sentences,27 two measures that have a high memory load. This is consistent with the finding that nonword repetition correlates with SLI to a higher degree than other short-term memory tests (e.g., digit span). The sensitivity of nonword repetition to SLI could be because it places heavier demands on processing of speech sounds than other memory tests as a result of the child's having to perceive and produce an unfamiliar sequence.29 It is important to note that, although nonword repetition is a good marker for SLI, poor performance on nonword repetition is not a perfect correlate of this disorder.30 In our study, 50% of SLIC probands performed poorly (>1 SD below the expected population mean) on nonword repetition, but a significant number (27%) scored above the expected population mean. These findings support recent opinion that deficits across multiple domains are required to cause persistent language impairments.31 A recent genome-wide association study of ADHD listed a SNP (rs10514604; p = 8 × 10−7) in ATP2C2 within the top 30 significant associations.32 Despite distinct defining characteristics, ADHD and SLI show a high level of comorbidity both with each other32 and with disorders such as developmental coordination disorder, speech-sound disorder (SSD; MIM #608445), and dyslexia.33–35 For example, individuals with SLI, SSD, ADHD, or dyslexia often present with linguistic deficits and impairments in short-term memory.33 It has therefore been suggested that certain aspects of these disorders might share a common etiology. Given the high levels of co-occurrence, we did not exclude children affected by ADHD and dyslexia from our study samples. However, in some of our SLIC samples, data were available for the presence of hyperactivity, coordination, and reading problems. From this, we estimate that approximately one-third of our SLIC samples showed some evidence of ADHD or developmental coordination disorder and that approximately one-half of our probands had reading problems. In the entire ASLPAC sample, 1.3% of individuals met criteria for ADHD. In the selected ALSPAC replication sample, the rate of ADHD increased to 3.7%. Thus, as expected, it is clear that the rate of developmental disorders across our cohorts is elevated over that expected in a population sample. Nonetheless, the association detected in our samples shows a strong correlation to nonword-repetition ability which has repeatedly been shown to be a strong indicator of language impairment.9,10 Furthermore, in ADHD samples, performance on the nonword-repetition task is correlated with linguistic ability rather than the presence of hyperactivity.33,36 Thus, we conclude that variants in ATP2C2 might account for shared aspects of the linguistic deficit in SLI and ADHD. Given this possibility, we also postulate that ATP2C2 might contribute to phonological short-term memory in other developmental disorders. Finally, we investigated the effects of ATP2C2 and CMIP on nonword-repetition performance at the population level. Across the entire unselected ALSPAC population (n = 3612), there was no evidence for quantitative association between nonword-repetition ability and either locus (minP = 0.48). Moreover, there were no differences in allele frequency for ATP2C2 or CMIP SNPs between either SLIC or replication-sample individuals and unselected European population controls (data not shown). Taken together, these data indicate that ATP2C2 and CMIP do not modulate nonword-repetition performance across the entire population, nor, in isolation, do they cause a predisposition to SLI. Instead, we propose that when combined with additional, as-yet-unidentified, susceptibility factors (either genetic or environmental), variants in ATP2C2 and CMIP have a detrimental effect upon nonword repetition performance and thus heighten the risk of developmental language impairments. This situation demonstrates a fundamental principle often overlooked in the mapping of complex disorders: that genetic variants might have selective effects in specific populations depending upon the genetic and environmental background. The question as to whether SLI constitutes a qualitatively distinct disorder caused by abnormal development of language abilities or merely represents the tail end of normal linguistic development is a matter of recent debate.37 Although the absence of association in our population sample could reflect insufficient sample sizes or the insensitivity of psychometric tests to quantify variation beyond the lower extremes of the spectrum, it is obvious that the effects of ATP2C2 and CMIP upon nonword-repetition performance are particularly pertinent to individuals with language difficulties. As such, this investigation provides molecular evidence that, at least in terms of the effects described here, SLI represents a distinct disorder caused by genetic variants discrete from those that influence language ability in the general population. In summary, we have used a positional fine-mapping approach to demonstrate association between ATP2C2 and CMIP and nonword repetition performance across two independent language-impaired populations. We propose that variants in both loci combine to modulate nonword-repetition performance in language-impaired populations. Both genes are expressed in the brain and represent good candidates for language- and memory-related processes. ATP2C2 is involved in the translocation of cytosolic calcium and manganese ions to the golgi.22 Calcium homeostasis is important for the regulation of many neuronal processes, including working memory, synaptic plasticity, and neuronal motility38, and manganese dysregulation has been linked to Parkinsonism (MIM #168600), Alzheimer disease (MIM #104300), and disordered memory.39 The functional role of CMIP is less defined, but it is known to interact with filamin A (MIM #300017)40 and the NF-kappaB subunit RelA (MIM #164014).41 The filaminA protein is involved in the reorganization of the actin cytoskeleton, which is of importance in the formation of the dendritic spine.40 The NF-κB family of transcription factors plays a central role in many neuronal processes, including synaptic activity and memory formation, and members of this family have been implicated in neurodegenerative disorders.42 Further characterization of the observed associations has enabled us to infer that SLI represents a qualitatively distinct disorder caused by a combination of genetic variants that disrupt multiple pathways important to the development of language. It is anticipated that the functional characterization of ATP2C2 and CMIP will promote a better understanding of the molecular basis of language acquisition and aid in the diagnosis and treatment of individuals affected by language disorders.
sec	Main Text Developmental speech and language disorders are a heterogeneous group of childhood conditions with variable presentation and etiology. Together, they account for 40% of pediatric referrals1 and statements of educational need.2 The term specific language impairment (SLI) defines a category of speech and language disorders in which a profound language impairment represents the primary deficit.2 This disorder affects 5%–8% of preschool children2 and is highly heritable.3 Nonetheless, in contrast to other related developmental disabilities (e.g., dyslexia [MIM #127700] and attention deficit hyperactivity disorder [ADHD, MIM #143465]), relatively few genetic studies have been performed for SLI. SLI is a prototypical multifactorial disorder that is predicted to involve numerous genetic loci and environmental factors.3 Three primary sites of linkage have been described4,5, the most robust of which is on chromosome 16q (SLI1, MIM #606711). This region is of interest because the linkage is highly specific to a single psychometric measure (nonword repetition).4,6,7 The test for nonword repetition involves the repetition of nonsensical words of increasing length and complexity and is regarded as a measure of phonological (speech sound) processing and short-term memory.8 Individuals with SLI typically perform particularly poorly on nonword repetition, even when their language difficulties have apparently resolved, leading to the postulation that a short-term memory deficit causes susceptibility to SLI9 by impairing the retention of novel verbal information.10 This paper incorporates two contingent investigations: an association screen of the SLI1 region in a cohort of language-impaired families and a subsequent replication study of detected association effects in an independent sample selected from the Avon Longitudinal Study of Parents and Children (ALSPAC) general-population cohort.11,12 The association screen utilized 806 individuals from 211 families ascertained by the SLI Consortium (SLIC). This nuclear-family cohort was collected from five sites across the UK (The Newcomen Centre at Guy's Hospital, London; the Cambridge Language and Speech Project (CLASP)13; the Child Life and Health Department at the University of Edinburgh14; the Department of Child Health at the University of Aberdeen; and the Manchester Language Study15,16) and included the families in whom the SLI1 linkage was originally identified. Ethical permission for each collection was granted by local ethics committees. SLIC families were all selected on the basis of a single proband with receptive and/or expressive language skills more than 1.5 SD below the normative mean for his or her age. A more detailed description of these samples and the exclusionary criteria applied to the SLIC collection can be found in previous publications.4,6,7 Genotyping for the association screen was performed in two phases with a combination of Sequenom and Illumina technologies. We performed an initial high-density screen involving 1906 SNPs to tag all 58 genes (including introns, exons, and 5 Kb 5′ and 2 Kb 3′ of coding sequences) mapped to the 10.29 Mb SLI1 region of linkage (D16S3138–D16S413. Chromosome 16 position 76.16 Mb–86.45 Mb [B35]). Haplotype blocks were built within Haploview17 via the Gabriel method.18 Any between-block gap that was more than 15 Kb in size was tagged with the Tagger algorithm. Two genes that mapped to the region (CDH13 [MIM #601364] and WWOX [MIM #605131]) were found to be larger than 1 Mb in size. For these two genes, blocks were built to cover the exonic regions only. Any region containing a SNP that met our predefined significance threshold (p < 0.001 in any one analysis or p < 0.01 across both analyses) was then supplemented with additional markers in a follow-up panel that included 138 SNPs, eight of which had previously been genotyped. Both phases of genotyping were completed prior to the replication study and were subjected to consistent quality-control procedures. The total genotype mismatch rate was 0.73% for duplicated SNPs and 0.76% for duplicated samples. Across both phases, 261 (12.7%) of SNPs were excluded at the quality-control stage. These included SNPs with a genotype rate of <80%, a minor-allele frequency of <2.5%, SNPs with unusual Beadstudio cluster patterns (Illumina) or atypical peaks in MassArray TyperAnalyser (Sequenom), SNPs with a GenTrain score of <0.5 (Illumina), and markers that showed consistent bad inheritances (>10 errors after data clean up). Across the entire region, the merged data set consisted of, on average, one SNP every 6.4 Kb. Across the known genes, there was on average one SNP every 4.5 Kb, and the largest remaining gap between blocks was 19,579 bp. Details of SNP coverage can be found in Table S1. Q-Q plots can be found in Figure S1. Given the consistent linkage between SLI1 and nonword repetition, all association analyses were based upon this measure. Our principal analysis involved the variance-components modeling of 28-item nonword repetition scores8 within 211 SLIC families (ao option) as a quantitative trait and was performed within QTDT.19 In addition, we performed a categorical case-control allelic test of association within PLINK.20 In this case-control analysis, SLIC individuals with low nonword-repetition scores (>2 SD below population mean, n = 79) were chosen as cases, and family members with above-average performance (>0.5 SD above population mean, n = 71) were used as controls. To avoid interdependence, we selected only one case or control from each family unit. The initial screen involved 1678 SNPs, of which thirteen (0.77%) exceeded our significance threshold, highlighting two primary regions of association (Table 1 and Figure 1). The follow-up panel chiefly included SNPs in these two regions and supported the association seen in the screen while reducing the evidence for association at other loci (Table 2 and Figure 1). Of the 105 SNPs tested in the follow-up panel, five (4.8%) were found to be significantly associated (Table 2 and Figure 1). The first identified cluster of association lay across 26 Kb (exons 2–4) of the CMIP gene (MIM #610112; seven significant SNPs, minP = 5 × 10−7). This gene encodes an adaptor protein and has two isoforms, the shorter of which is involved in cell signaling pathways and is upregulated in minimal change nephrotic syndrome (MCNS), a childhood kidney disease.21 Little is known about the function of the longer transcript. Both isoforms are expressed in the brain.21 The second region of association was observed between exons 7 and 12 (10.8 Kb) of the ATP2C2 gene (six significant SNPs, minP = 2 × 10−5). This gene is one of two secretory-pathway Ca2+-ATPases (SPCAs) that move cytosolic calcium and manganese ions into the golgi.22 Its expression is limited to the brain, testis, gastrointestinal tract, and respiratory tissues and mammary, salivary, and thyroid glands.22 In the mammary gland, ATP2C2 expression facilitates the secretion of Ca2+ into casein micelles during lactation.23 Three lines of evidence indicate that the associations at CMIP and ATP2C2 represent separate effects. First, we did not see any indication of long-range linkage disequilibrium between the two loci (which lie almost 3 Mb apart) in the SLIC cohort or public data (Figure S2). Second, the inclusion of a CMIP covariate in the linkage or association model did not affect the level of linkage or association seen at ATP2C2 (or vice versa for ATP2C2 covariates) (Figure S3). Finally, in a stepwise regression model, the group mean for SLIC individuals carrying a double-risk genotype was found to be significantly lower than those who were homozygous for risk at a single locus (p = 3.7 × 10−6, Table 3). In this model, the group mean for double-risk individuals was 15.8 points (1.05 SD) below that of individuals carrying nonrisk variants at both loci (Table 3). We therefore propose that CMIP and ATP2C2 independently regulate nonword repetition performance and together underlie the linkage seen between SLI and chromosome 16. Our replication sample consisted of 490 cases selected from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort.11,12 This is a general-population sample that follows the development of 14,062 live-born individuals born in the southwest of England. The ALSPAC group periodically performs an assessment of the development of consenting individuals, and these measurements include tests of language ability. Informed written consent was obtained from the parents at the time of enrolment. Ethical approval for the study was obtained from the ALSPAC Law and Ethics Committee and the Local Research Ethics Committees. Because the current study focuses upon language impairment, we selected individuals from the lower extreme of language-related phenotype distributions (Children's Communication Checklist (CCC)24 and Wechsler Objective Language Dimensions (WOLD)25) for our replication sample. This included 665 individuals (10.3%) with a CCC pragmatic composite 1–3 SD below the ALSPAC population mean (123 ≤ × ≤ 145) or a WOLD listening comprehension score ≥2 SD below the ALSPAC population mean (≤3). Of these individuals, 490 had completed a 12-item nonword repetition test. Because the genotyping in the replication sample was restricted to a single individual from each family, we performed a quantitative association analysis within PLINK20 by using nonword repetition in a linear-regression framework. In addition, we used PLINK20 to carry out a case-control analysis analogous to that described for SLIC. We selected cases and controls from the extremes of the nonword repetition performance distribution of the 490 selected individuals. As expected, given the extreme nature of the language impairment in the SLIC samples, the distribution of nonword repetition differed between the SLIC and ALSPAC cohorts. Therefore, in the replication cohort, the cut-offs used for cases and controls were less extreme than those applied for the association screen. Cases were selected from the identified replication sample to have nonword repetition scores ≥1 SD below the general-population mean (n = 112), and controls had nonword repetition scores ≥1 SD above the general-population mean (n = 72). Data were analyzed for three CMIP and three ATP2C2 SNPs (rs12927866, rs4265801, and rs16955705; and rs16973771, rs2875891, and rs8045507, respectively), and significant associations (p < 0.05) were seen for two CMIP and two ATP2C2 SNPs (Table 4 and Figure 2). Regression trends for ATP2C2 followed those seen in SLIC, replicating the previously described association. Association to CMIP was in an opposite direction from that described above (Table 4 and Figure 2). Although this result might represent a type I error, the consistency of significant association in light of the low number of SNPs tested supports a role for CMIP. Associations can occur in opposite directions if the relationship between the observed and causal variants differs between populations.26 This is particularly true if multiple risk loci interact in an additive or multiplicative fashion26, as is predicted for CMIP. Identification of the causal variant will enable the further characterization of the relationship between risk variants in different populations. Given the partial replication of association, we investigated whether the primary associated SNPs in ATP2C2 and CMIP had an effect upon additional language- and memory-related measures (Table S2). In SLIC, we found borderline association for ATP2C2 with measures of receptive language (oral directions27 [p = 0.006], word classes27 [p = 0.04], and comprehension28 [p = 0.03]), expressive language (formulating sentences27 [p = 0.04]), and vocabulary28 (p = 0.04). In the replication cohort, aside from nonword repetition, we only observed borderline association between ATP2C2 and counting span, a measure of working memory (p = 0.01). In the replication sample, nonword repetition performance had been scored according to the number of syllables the nonword contained. For both CMIP and ATP2C2, the majority of association came from the five-syllable nonwords (p = 0.016 and p = 6 × 10−4, respectively) (Table S2). In neither sample did we observe association to reading-related tasks, which have been reported to show linkage to SLI1.6 Nor did we find any association to digit span28 or recalling sentences,27 two measures that have a high memory load. This is consistent with the finding that nonword repetition correlates with SLI to a higher degree than other short-term memory tests (e.g., digit span). The sensitivity of nonword repetition to SLI could be because it places heavier demands on processing of speech sounds than other memory tests as a result of the child's having to perceive and produce an unfamiliar sequence.29 It is important to note that, although nonword repetition is a good marker for SLI, poor performance on nonword repetition is not a perfect correlate of this disorder.30 In our study, 50% of SLIC probands performed poorly (>1 SD below the expected population mean) on nonword repetition, but a significant number (27%) scored above the expected population mean. These findings support recent opinion that deficits across multiple domains are required to cause persistent language impairments.31 A recent genome-wide association study of ADHD listed a SNP (rs10514604; p = 8 × 10−7) in ATP2C2 within the top 30 significant associations.32 Despite distinct defining characteristics, ADHD and SLI show a high level of comorbidity both with each other32 and with disorders such as developmental coordination disorder, speech-sound disorder (SSD; MIM #608445), and dyslexia.33–35 For example, individuals with SLI, SSD, ADHD, or dyslexia often present with linguistic deficits and impairments in short-term memory.33 It has therefore been suggested that certain aspects of these disorders might share a common etiology. Given the high levels of co-occurrence, we did not exclude children affected by ADHD and dyslexia from our study samples. However, in some of our SLIC samples, data were available for the presence of hyperactivity, coordination, and reading problems. From this, we estimate that approximately one-third of our SLIC samples showed some evidence of ADHD or developmental coordination disorder and that approximately one-half of our probands had reading problems. In the entire ASLPAC sample, 1.3% of individuals met criteria for ADHD. In the selected ALSPAC replication sample, the rate of ADHD increased to 3.7%. Thus, as expected, it is clear that the rate of developmental disorders across our cohorts is elevated over that expected in a population sample. Nonetheless, the association detected in our samples shows a strong correlation to nonword-repetition ability which has repeatedly been shown to be a strong indicator of language impairment.9,10 Furthermore, in ADHD samples, performance on the nonword-repetition task is correlated with linguistic ability rather than the presence of hyperactivity.33,36 Thus, we conclude that variants in ATP2C2 might account for shared aspects of the linguistic deficit in SLI and ADHD. Given this possibility, we also postulate that ATP2C2 might contribute to phonological short-term memory in other developmental disorders. Finally, we investigated the effects of ATP2C2 and CMIP on nonword-repetition performance at the population level. Across the entire unselected ALSPAC population (n = 3612), there was no evidence for quantitative association between nonword-repetition ability and either locus (minP = 0.48). Moreover, there were no differences in allele frequency for ATP2C2 or CMIP SNPs between either SLIC or replication-sample individuals and unselected European population controls (data not shown). Taken together, these data indicate that ATP2C2 and CMIP do not modulate nonword-repetition performance across the entire population, nor, in isolation, do they cause a predisposition to SLI. Instead, we propose that when combined with additional, as-yet-unidentified, susceptibility factors (either genetic or environmental), variants in ATP2C2 and CMIP have a detrimental effect upon nonword repetition performance and thus heighten the risk of developmental language impairments. This situation demonstrates a fundamental principle often overlooked in the mapping of complex disorders: that genetic variants might have selective effects in specific populations depending upon the genetic and environmental background. The question as to whether SLI constitutes a qualitatively distinct disorder caused by abnormal development of language abilities or merely represents the tail end of normal linguistic development is a matter of recent debate.37 Although the absence of association in our population sample could reflect insufficient sample sizes or the insensitivity of psychometric tests to quantify variation beyond the lower extremes of the spectrum, it is obvious that the effects of ATP2C2 and CMIP upon nonword-repetition performance are particularly pertinent to individuals with language difficulties. As such, this investigation provides molecular evidence that, at least in terms of the effects described here, SLI represents a distinct disorder caused by genetic variants discrete from those that influence language ability in the general population. In summary, we have used a positional fine-mapping approach to demonstrate association between ATP2C2 and CMIP and nonword repetition performance across two independent language-impaired populations. We propose that variants in both loci combine to modulate nonword-repetition performance in language-impaired populations. Both genes are expressed in the brain and represent good candidates for language- and memory-related processes. ATP2C2 is involved in the translocation of cytosolic calcium and manganese ions to the golgi.22 Calcium homeostasis is important for the regulation of many neuronal processes, including working memory, synaptic plasticity, and neuronal motility38, and manganese dysregulation has been linked to Parkinsonism (MIM #168600), Alzheimer disease (MIM #104300), and disordered memory.39 The functional role of CMIP is less defined, but it is known to interact with filamin A (MIM #300017)40 and the NF-kappaB subunit RelA (MIM #164014).41 The filaminA protein is involved in the reorganization of the actin cytoskeleton, which is of importance in the formation of the dendritic spine.40 The NF-κB family of transcription factors plays a central role in many neuronal processes, including synaptic activity and memory formation, and members of this family have been implicated in neurodegenerative disorders.42 Further characterization of the observed associations has enabled us to infer that SLI represents a qualitatively distinct disorder caused by a combination of genetic variants that disrupt multiple pathways important to the development of language. It is anticipated that the functional characterization of ATP2C2 and CMIP will promote a better understanding of the molecular basis of language acquisition and aid in the diagnosis and treatment of individuals affected by language disorders.
title	Main Text
p	Developmental speech and language disorders are a heterogeneous group of childhood conditions with variable presentation and etiology. Together, they account for 40% of pediatric referrals1 and statements of educational need.2 The term specific language impairment (SLI) defines a category of speech and language disorders in which a profound language impairment represents the primary deficit.2 This disorder affects 5%–8% of preschool children2 and is highly heritable.3 Nonetheless, in contrast to other related developmental disabilities (e.g., dyslexia [MIM #127700] and attention deficit hyperactivity disorder [ADHD, MIM #143465]), relatively few genetic studies have been performed for SLI. SLI is a prototypical multifactorial disorder that is predicted to involve numerous genetic loci and environmental factors.3 Three primary sites of linkage have been described4,5, the most robust of which is on chromosome 16q (SLI1, MIM #606711). This region is of interest because the linkage is highly specific to a single psychometric measure (nonword repetition).4,6,7 The test for nonword repetition involves the repetition of nonsensical words of increasing length and complexity and is regarded as a measure of phonological (speech sound) processing and short-term memory.8 Individuals with SLI typically perform particularly poorly on nonword repetition, even when their language difficulties have apparently resolved, leading to the postulation that a short-term memory deficit causes susceptibility to SLI9 by impairing the retention of novel verbal information.10 This paper incorporates two contingent investigations: an association screen of the SLI1 region in a cohort of language-impaired families and a subsequent replication study of detected association effects in an independent sample selected from the Avon Longitudinal Study of Parents and Children (ALSPAC) general-population cohort.11,12
p	The association screen utilized 806 individuals from 211 families ascertained by the SLI Consortium (SLIC). This nuclear-family cohort was collected from five sites across the UK (The Newcomen Centre at Guy's Hospital, London; the Cambridge Language and Speech Project (CLASP)13; the Child Life and Health Department at the University of Edinburgh14; the Department of Child Health at the University of Aberdeen; and the Manchester Language Study15,16) and included the families in whom the SLI1 linkage was originally identified. Ethical permission for each collection was granted by local ethics committees. SLIC families were all selected on the basis of a single proband with receptive and/or expressive language skills more than 1.5 SD below the normative mean for his or her age. A more detailed description of these samples and the exclusionary criteria applied to the SLIC collection can be found in previous publications.4,6,7
p	Genotyping for the association screen was performed in two phases with a combination of Sequenom and Illumina technologies. We performed an initial high-density screen involving 1906 SNPs to tag all 58 genes (including introns, exons, and 5 Kb 5′ and 2 Kb 3′ of coding sequences) mapped to the 10.29 Mb SLI1 region of linkage (D16S3138–D16S413. Chromosome 16 position 76.16 Mb–86.45 Mb [B35]). Haplotype blocks were built within Haploview17 via the Gabriel method.18 Any between-block gap that was more than 15 Kb in size was tagged with the Tagger algorithm. Two genes that mapped to the region (CDH13 [MIM #601364] and WWOX [MIM #605131]) were found to be larger than 1 Mb in size. For these two genes, blocks were built to cover the exonic regions only. Any region containing a SNP that met our predefined significance threshold (p < 0.001 in any one analysis or p < 0.01 across both analyses) was then supplemented with additional markers in a follow-up panel that included 138 SNPs, eight of which had previously been genotyped. Both phases of genotyping were completed prior to the replication study and were subjected to consistent quality-control procedures. The total genotype mismatch rate was 0.73% for duplicated SNPs and 0.76% for duplicated samples. Across both phases, 261 (12.7%) of SNPs were excluded at the quality-control stage. These included SNPs with a genotype rate of <80%, a minor-allele frequency of <2.5%, SNPs with unusual Beadstudio cluster patterns (Illumina) or atypical peaks in MassArray TyperAnalyser (Sequenom), SNPs with a GenTrain score of <0.5 (Illumina), and markers that showed consistent bad inheritances (>10 errors after data clean up). Across the entire region, the merged data set consisted of, on average, one SNP every 6.4 Kb. Across the known genes, there was on average one SNP every 4.5 Kb, and the largest remaining gap between blocks was 19,579 bp. Details of SNP coverage can be found in Table S1. Q-Q plots can be found in Figure S1. Given the consistent linkage between SLI1 and nonword repetition, all association analyses were based upon this measure. Our principal analysis involved the variance-components modeling of 28-item nonword repetition scores8 within 211 SLIC families (ao option) as a quantitative trait and was performed within QTDT.19 In addition, we performed a categorical case-control allelic test of association within PLINK.20 In this case-control analysis, SLIC individuals with low nonword-repetition scores (>2 SD below population mean, n = 79) were chosen as cases, and family members with above-average performance (>0.5 SD above population mean, n = 71) were used as controls. To avoid interdependence, we selected only one case or control from each family unit.
p	The initial screen involved 1678 SNPs, of which thirteen (0.77%) exceeded our significance threshold, highlighting two primary regions of association (Table 1 and Figure 1). The follow-up panel chiefly included SNPs in these two regions and supported the association seen in the screen while reducing the evidence for association at other loci (Table 2 and Figure 1). Of the 105 SNPs tested in the follow-up panel, five (4.8%) were found to be significantly associated (Table 2 and Figure 1). The first identified cluster of association lay across 26 Kb (exons 2–4) of the CMIP gene (MIM #610112; seven significant SNPs, minP = 5 × 10−7). This gene encodes an adaptor protein and has two isoforms, the shorter of which is involved in cell signaling pathways and is upregulated in minimal change nephrotic syndrome (MCNS), a childhood kidney disease.21 Little is known about the function of the longer transcript. Both isoforms are expressed in the brain.21 The second region of association was observed between exons 7 and 12 (10.8 Kb) of the ATP2C2 gene (six significant SNPs, minP = 2 × 10−5). This gene is one of two secretory-pathway Ca2+-ATPases (SPCAs) that move cytosolic calcium and manganese ions into the golgi.22 Its expression is limited to the brain, testis, gastrointestinal tract, and respiratory tissues and mammary, salivary, and thyroid glands.22 In the mammary gland, ATP2C2 expression facilitates the secretion of Ca2+ into casein micelles during lactation.23
p	Three lines of evidence indicate that the associations at CMIP and ATP2C2 represent separate effects. First, we did not see any indication of long-range linkage disequilibrium between the two loci (which lie almost 3 Mb apart) in the SLIC cohort or public data (Figure S2). Second, the inclusion of a CMIP covariate in the linkage or association model did not affect the level of linkage or association seen at ATP2C2 (or vice versa for ATP2C2 covariates) (Figure S3). Finally, in a stepwise regression model, the group mean for SLIC individuals carrying a double-risk genotype was found to be significantly lower than those who were homozygous for risk at a single locus (p = 3.7 × 10−6, Table 3). In this model, the group mean for double-risk individuals was 15.8 points (1.05 SD) below that of individuals carrying nonrisk variants at both loci (Table 3). We therefore propose that CMIP and ATP2C2 independently regulate nonword repetition performance and together underlie the linkage seen between SLI and chromosome 16.
p	Our replication sample consisted of 490 cases selected from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort.11,12 This is a general-population sample that follows the development of 14,062 live-born individuals born in the southwest of England. The ALSPAC group periodically performs an assessment of the development of consenting individuals, and these measurements include tests of language ability. Informed written consent was obtained from the parents at the time of enrolment. Ethical approval for the study was obtained from the ALSPAC Law and Ethics Committee and the Local Research Ethics Committees. Because the current study focuses upon language impairment, we selected individuals from the lower extreme of language-related phenotype distributions (Children's Communication Checklist (CCC)24 and Wechsler Objective Language Dimensions (WOLD)25) for our replication sample. This included 665 individuals (10.3%) with a CCC pragmatic composite 1–3 SD below the ALSPAC population mean (123 ≤ × ≤ 145) or a WOLD listening comprehension score ≥2 SD below the ALSPAC population mean (≤3). Of these individuals, 490 had completed a 12-item nonword repetition test. Because the genotyping in the replication sample was restricted to a single individual from each family, we performed a quantitative association analysis within PLINK20 by using nonword repetition in a linear-regression framework. In addition, we used PLINK20 to carry out a case-control analysis analogous to that described for SLIC. We selected cases and controls from the extremes of the nonword repetition performance distribution of the 490 selected individuals. As expected, given the extreme nature of the language impairment in the SLIC samples, the distribution of nonword repetition differed between the SLIC and ALSPAC cohorts. Therefore, in the replication cohort, the cut-offs used for cases and controls were less extreme than those applied for the association screen. Cases were selected from the identified replication sample to have nonword repetition scores ≥1 SD below the general-population mean (n = 112), and controls had nonword repetition scores ≥1 SD above the general-population mean (n = 72). Data were analyzed for three CMIP and three ATP2C2 SNPs (rs12927866, rs4265801, and rs16955705; and rs16973771, rs2875891, and rs8045507, respectively), and significant associations (p < 0.05) were seen for two CMIP and two ATP2C2 SNPs (Table 4 and Figure 2). Regression trends for ATP2C2 followed those seen in SLIC, replicating the previously described association. Association to CMIP was in an opposite direction from that described above (Table 4 and Figure 2). Although this result might represent a type I error, the consistency of significant association in light of the low number of SNPs tested supports a role for CMIP. Associations can occur in opposite directions if the relationship between the observed and causal variants differs between populations.26 This is particularly true if multiple risk loci interact in an additive or multiplicative fashion26, as is predicted for CMIP. Identification of the causal variant will enable the further characterization of the relationship between risk variants in different populations.
p	Given the partial replication of association, we investigated whether the primary associated SNPs in ATP2C2 and CMIP had an effect upon additional language- and memory-related measures (Table S2). In SLIC, we found borderline association for ATP2C2 with measures of receptive language (oral directions27 [p = 0.006], word classes27 [p = 0.04], and comprehension28 [p = 0.03]), expressive language (formulating sentences27 [p = 0.04]), and vocabulary28 (p = 0.04). In the replication cohort, aside from nonword repetition, we only observed borderline association between ATP2C2 and counting span, a measure of working memory (p = 0.01). In the replication sample, nonword repetition performance had been scored according to the number of syllables the nonword contained. For both CMIP and ATP2C2, the majority of association came from the five-syllable nonwords (p = 0.016 and p = 6 × 10−4, respectively) (Table S2). In neither sample did we observe association to reading-related tasks, which have been reported to show linkage to SLI1.6 Nor did we find any association to digit span28 or recalling sentences,27 two measures that have a high memory load. This is consistent with the finding that nonword repetition correlates with SLI to a higher degree than other short-term memory tests (e.g., digit span). The sensitivity of nonword repetition to SLI could be because it places heavier demands on processing of speech sounds than other memory tests as a result of the child's having to perceive and produce an unfamiliar sequence.29 It is important to note that, although nonword repetition is a good marker for SLI, poor performance on nonword repetition is not a perfect correlate of this disorder.30 In our study, 50% of SLIC probands performed poorly (>1 SD below the expected population mean) on nonword repetition, but a significant number (27%) scored above the expected population mean. These findings support recent opinion that deficits across multiple domains are required to cause persistent language impairments.31
p	A recent genome-wide association study of ADHD listed a SNP (rs10514604; p = 8 × 10−7) in ATP2C2 within the top 30 significant associations.32 Despite distinct defining characteristics, ADHD and SLI show a high level of comorbidity both with each other32 and with disorders such as developmental coordination disorder, speech-sound disorder (SSD; MIM #608445), and dyslexia.33–35 For example, individuals with SLI, SSD, ADHD, or dyslexia often present with linguistic deficits and impairments in short-term memory.33 It has therefore been suggested that certain aspects of these disorders might share a common etiology. Given the high levels of co-occurrence, we did not exclude children affected by ADHD and dyslexia from our study samples. However, in some of our SLIC samples, data were available for the presence of hyperactivity, coordination, and reading problems. From this, we estimate that approximately one-third of our SLIC samples showed some evidence of ADHD or developmental coordination disorder and that approximately one-half of our probands had reading problems. In the entire ASLPAC sample, 1.3% of individuals met criteria for ADHD. In the selected ALSPAC replication sample, the rate of ADHD increased to 3.7%. Thus, as expected, it is clear that the rate of developmental disorders across our cohorts is elevated over that expected in a population sample. Nonetheless, the association detected in our samples shows a strong correlation to nonword-repetition ability which has repeatedly been shown to be a strong indicator of language impairment.9,10 Furthermore, in ADHD samples, performance on the nonword-repetition task is correlated with linguistic ability rather than the presence of hyperactivity.33,36 Thus, we conclude that variants in ATP2C2 might account for shared aspects of the linguistic deficit in SLI and ADHD. Given this possibility, we also postulate that ATP2C2 might contribute to phonological short-term memory in other developmental disorders.
p	Finally, we investigated the effects of ATP2C2 and CMIP on nonword-repetition performance at the population level. Across the entire unselected ALSPAC population (n = 3612), there was no evidence for quantitative association between nonword-repetition ability and either locus (minP = 0.48). Moreover, there were no differences in allele frequency for ATP2C2 or CMIP SNPs between either SLIC or replication-sample individuals and unselected European population controls (data not shown). Taken together, these data indicate that ATP2C2 and CMIP do not modulate nonword-repetition performance across the entire population, nor, in isolation, do they cause a predisposition to SLI. Instead, we propose that when combined with additional, as-yet-unidentified, susceptibility factors (either genetic or environmental), variants in ATP2C2 and CMIP have a detrimental effect upon nonword repetition performance and thus heighten the risk of developmental language impairments. This situation demonstrates a fundamental principle often overlooked in the mapping of complex disorders: that genetic variants might have selective effects in specific populations depending upon the genetic and environmental background. The question as to whether SLI constitutes a qualitatively distinct disorder caused by abnormal development of language abilities or merely represents the tail end of normal linguistic development is a matter of recent debate.37 Although the absence of association in our population sample could reflect insufficient sample sizes or the insensitivity of psychometric tests to quantify variation beyond the lower extremes of the spectrum, it is obvious that the effects of ATP2C2 and CMIP upon nonword-repetition performance are particularly pertinent to individuals with language difficulties. As such, this investigation provides molecular evidence that, at least in terms of the effects described here, SLI represents a distinct disorder caused by genetic variants discrete from those that influence language ability in the general population.
p	In summary, we have used a positional fine-mapping approach to demonstrate association between ATP2C2 and CMIP and nonword repetition performance across two independent language-impaired populations. We propose that variants in both loci combine to modulate nonword-repetition performance in language-impaired populations. Both genes are expressed in the brain and represent good candidates for language- and memory-related processes. ATP2C2 is involved in the translocation of cytosolic calcium and manganese ions to the golgi.22 Calcium homeostasis is important for the regulation of many neuronal processes, including working memory, synaptic plasticity, and neuronal motility38, and manganese dysregulation has been linked to Parkinsonism (MIM #168600), Alzheimer disease (MIM #104300), and disordered memory.39 The functional role of CMIP is less defined, but it is known to interact with filamin A (MIM #300017)40 and the NF-kappaB subunit RelA (MIM #164014).41 The filaminA protein is involved in the reorganization of the actin cytoskeleton, which is of importance in the formation of the dendritic spine.40 The NF-κB family of transcription factors plays a central role in many neuronal processes, including synaptic activity and memory formation, and members of this family have been implicated in neurodegenerative disorders.42 Further characterization of the observed associations has enabled us to infer that SLI represents a qualitatively distinct disorder caused by a combination of genetic variants that disrupt multiple pathways important to the development of language. It is anticipated that the functional characterization of ATP2C2 and CMIP will promote a better understanding of the molecular basis of language acquisition and aid in the diagnosis and treatment of individuals affected by language disorders.
back	Supplemental Data Document S1. Three Figures and Two Tables Web Resources The URLs for data presented herein are as follows:Illumina, www.illumina.com/ Sequenom, http://www.sequenom.com/ GE Healthcare, http://www6.gelifesciences.com/ Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim Tagger, http://www.broad.mit.edu/mpg/tagger/ Haploview, http://www.broad.mit.edu/mpg/haploview/ QTDT, http://www.sph.umich.edu/csg/abecasis/QTDT/ PLINK, http://pngu.mgh.harvard.edu/∼purcell/plink/ MERLIN, http://www.sph.umich.edu/csg/abecasis/Merlin/ PEDSTATS, http://www.sph.umich.edu/csg/abecasis/PedStats/ HAPMAP, http://www.hapmap.org/ R, http://www.r-project.org/ The Monaco Group at the Wellcome Trust Centre for Human Genetics (Neurogenetics), http://www.well.ox.ac.uk/monaco/ ALSPAC, http://www.bristol.ac.uk/alspac/ Manchester Language Study, http://www.manchesterlanguagestudy.co.uk/ Acknowledgments We thank all the families and professionals who participated in the study, Caroline Durrant and Jean-Baptiste Cazier for statistical advice, members of the Monaco lab for support, and Leila Jannoun, Jane Addison, Clare Craven, Deborah Jones, Tilly Storr, Til Utting-Brown, Margaret Main, Jane Steele, and Alan MacLean for assistance with data collection and management. We are extremely grateful to all the families who took part in the ALSPAC study, to the midwives for their help in recruiting them, and to the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists, and nurses. The UK Medical Research Council, the Wellcome Trust, and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors, and D.F. Newbury and A.P. Monaco will serve as guarantors for this paper's contents. The Wellcome Trust specifically funded this research. All laboratory work and the collection of data from families ascertained by Guy's Hospital and the University of Manchester were funded by The Wellcome Trust. CLASP was funded by The Wellcome Trust, British Telecom, Isaac Newton Trust, National Health Service (NHS) Anglia & Oxford Regional R&D Strategic Investment Award, and an NHS Eastern Region R&D Training Fellowship Award. The Edinburgh group was supported by the Chief Scientist's Office, Scotland. The Aberdeen group was supported by Grampian Healthcare Trust and Grampian Primary Care NHS Trust. D.V.M. Bishop is a Wellcome Trust Principal Research Fellow, and S.E. Fisher is a Royal Society Research Fellow.
sec	Supplemental Data Document S1. Three Figures and Two Tables
title	Supplemental Data
p	Document S1. Three Figures and Two Tables
caption	Document S1. Three Figures and Two Tables
title	Document S1. Three Figures and Two Tables
sec	Web Resources The URLs for data presented herein are as follows:Illumina, www.illumina.com/ Sequenom, http://www.sequenom.com/ GE Healthcare, http://www6.gelifesciences.com/ Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim Tagger, http://www.broad.mit.edu/mpg/tagger/ Haploview, http://www.broad.mit.edu/mpg/haploview/ QTDT, http://www.sph.umich.edu/csg/abecasis/QTDT/ PLINK, http://pngu.mgh.harvard.edu/∼purcell/plink/ MERLIN, http://www.sph.umich.edu/csg/abecasis/Merlin/ PEDSTATS, http://www.sph.umich.edu/csg/abecasis/PedStats/ HAPMAP, http://www.hapmap.org/ R, http://www.r-project.org/ The Monaco Group at the Wellcome Trust Centre for Human Genetics (Neurogenetics), http://www.well.ox.ac.uk/monaco/ ALSPAC, http://www.bristol.ac.uk/alspac/ Manchester Language Study, http://www.manchesterlanguagestudy.co.uk/
title	Web Resources
p	The URLs for data presented herein are as follows:Illumina, www.illumina.com/ Sequenom, http://www.sequenom.com/ GE Healthcare, http://www6.gelifesciences.com/ Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim Tagger, http://www.broad.mit.edu/mpg/tagger/ Haploview, http://www.broad.mit.edu/mpg/haploview/ QTDT, http://www.sph.umich.edu/csg/abecasis/QTDT/ PLINK, http://pngu.mgh.harvard.edu/∼purcell/plink/ MERLIN, http://www.sph.umich.edu/csg/abecasis/Merlin/ PEDSTATS, http://www.sph.umich.edu/csg/abecasis/PedStats/ HAPMAP, http://www.hapmap.org/ R, http://www.r-project.org/ The Monaco Group at the Wellcome Trust Centre for Human Genetics (Neurogenetics), http://www.well.ox.ac.uk/monaco/ ALSPAC, http://www.bristol.ac.uk/alspac/ Manchester Language Study, http://www.manchesterlanguagestudy.co.uk/
p	Illumina, www.illumina.com/
p	Sequenom, http://www.sequenom.com/
p	GE Healthcare, http://www6.gelifesciences.com/
p	Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim
p	Tagger, http://www.broad.mit.edu/mpg/tagger/
p	Haploview, http://www.broad.mit.edu/mpg/haploview/
p	QTDT, http://www.sph.umich.edu/csg/abecasis/QTDT/
p	PLINK, http://pngu.mgh.harvard.edu/∼purcell/plink/
p	MERLIN, http://www.sph.umich.edu/csg/abecasis/Merlin/
p	PEDSTATS, http://www.sph.umich.edu/csg/abecasis/PedStats/
p	HAPMAP, http://www.hapmap.org/
p	R, http://www.r-project.org/
p	The Monaco Group at the Wellcome Trust Centre for Human Genetics (Neurogenetics), http://www.well.ox.ac.uk/monaco/
p	ALSPAC, http://www.bristol.ac.uk/alspac/
p	Manchester Language Study, http://www.manchesterlanguagestudy.co.uk/
ack	Acknowledgments We thank all the families and professionals who participated in the study, Caroline Durrant and Jean-Baptiste Cazier for statistical advice, members of the Monaco lab for support, and Leila Jannoun, Jane Addison, Clare Craven, Deborah Jones, Tilly Storr, Til Utting-Brown, Margaret Main, Jane Steele, and Alan MacLean for assistance with data collection and management. We are extremely grateful to all the families who took part in the ALSPAC study, to the midwives for their help in recruiting them, and to the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists, and nurses. The UK Medical Research Council, the Wellcome Trust, and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors, and D.F. Newbury and A.P. Monaco will serve as guarantors for this paper's contents. The Wellcome Trust specifically funded this research. All laboratory work and the collection of data from families ascertained by Guy's Hospital and the University of Manchester were funded by The Wellcome Trust. CLASP was funded by The Wellcome Trust, British Telecom, Isaac Newton Trust, National Health Service (NHS) Anglia & Oxford Regional R&D Strategic Investment Award, and an NHS Eastern Region R&D Training Fellowship Award. The Edinburgh group was supported by the Chief Scientist's Office, Scotland. The Aberdeen group was supported by Grampian Healthcare Trust and Grampian Primary Care NHS Trust. D.V.M. Bishop is a Wellcome Trust Principal Research Fellow, and S.E. Fisher is a Royal Society Research Fellow.
title	Acknowledgments
p	We thank all the families and professionals who participated in the study, Caroline Durrant and Jean-Baptiste Cazier for statistical advice, members of the Monaco lab for support, and Leila Jannoun, Jane Addison, Clare Craven, Deborah Jones, Tilly Storr, Til Utting-Brown, Margaret Main, Jane Steele, and Alan MacLean for assistance with data collection and management. We are extremely grateful to all the families who took part in the ALSPAC study, to the midwives for their help in recruiting them, and to the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists, and nurses. The UK Medical Research Council, the Wellcome Trust, and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors, and D.F. Newbury and A.P. Monaco will serve as guarantors for this paper's contents. The Wellcome Trust specifically funded this research. All laboratory work and the collection of data from families ascertained by Guy's Hospital and the University of Manchester were funded by The Wellcome Trust. CLASP was funded by The Wellcome Trust, British Telecom, Isaac Newton Trust, National Health Service (NHS) Anglia & Oxford Regional R&D Strategic Investment Award, and an NHS Eastern Region R&D Training Fellowship Award. The Edinburgh group was supported by the Chief Scientist's Office, Scotland. The Aberdeen group was supported by Grampian Healthcare Trust and Grampian Primary Care NHS Trust. D.V.M. Bishop is a Wellcome Trust Principal Research Fellow, and S.E. Fisher is a Royal Society Research Fellow.
figure	Figure 1 Association in SLIC Cohort Association results for family-based quantitaive analysis and case-control analysis of nonword repetition across the SLI1 region. In the case-control analysis, cases and controls were selected on the basis of their nonword-repetition performance (see text). Gaps in data represent regions where there are no mapped genes. SNPS included in the screen genotype panel are shown as +, and SNPs included in the follow-up genotype panel are shown as x.
label	Figure 1
caption	Association in SLIC Cohort Association results for family-based quantitaive analysis and case-control analysis of nonword repetition across the SLI1 region. In the case-control analysis, cases and controls were selected on the basis of their nonword-repetition performance (see text). Gaps in data represent regions where there are no mapped genes. SNPS included in the screen genotype panel are shown as +, and SNPs included in the follow-up genotype panel are shown as x.
p	Association in SLIC Cohort
p	Association results for family-based quantitaive analysis and case-control analysis of nonword repetition across the SLI1 region. In the case-control analysis, cases and controls were selected on the basis of their nonword-repetition performance (see text). Gaps in data represent regions where there are no mapped genes. SNPS included in the screen genotype panel are shown as +, and SNPs included in the follow-up genotype panel are shown as x.
figure	Figure 2 Nonword-Repetition Means for CMIP and ATP2C2 in SLIC and Replication Cohorts (A) CMIP. (B) ATP2C2. All means are for age- and sex-adjusted nonword-repetition scores standardized with a mean of 0 and a SD of 1. The three CMIP SNPs (rs12927866, rs4265801, and rs16955705) show genotype trends in the opposite direction from SLIC (A), whereas the three ATP2C2 SNPs (rs16973771, rs2875891, and rs8045507) show genotype trends in the same direction as SLIC (B).
label	Figure 2
caption	Nonword-Repetition Means for CMIP and ATP2C2 in SLIC and Replication Cohorts (A) CMIP. (B) ATP2C2. All means are for age- and sex-adjusted nonword-repetition scores standardized with a mean of 0 and a SD of 1. The three CMIP SNPs (rs12927866, rs4265801, and rs16955705) show genotype trends in the opposite direction from SLIC (A), whereas the three ATP2C2 SNPs (rs16973771, rs2875891, and rs8045507) show genotype trends in the same direction as SLIC (B).
p	Nonword-Repetition Means for CMIP and ATP2C2 in SLIC and Replication Cohorts
p	(A) CMIP.
p	(B) ATP2C2.
p	All means are for age- and sex-adjusted nonword-repetition scores standardized with a mean of 0 and a SD of 1. The three CMIP SNPs (rs12927866, rs4265801, and rs16955705) show genotype trends in the opposite direction from SLIC (A), whereas the three ATP2C2 SNPs (rs16973771, rs2875891, and rs8045507) show genotype trends in the same direction as SLIC (B).
table-wrap	Table 1 Significant Association in the SLIC Association Screen SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) A1 CEPH Frequency Typed Strand p Quant Effect Size Heritability p Emp QTDT p Case-Cont Frequency of A1 Cases Frequency of A1 Controls Odds ratio (95% CI) p Emp PLINK rs8051754 78,554,834 intergenic T/C∗ 0.46 − 0.0931 −0.28 ± 0.11 0.019 0.0892 0.0007∗ 0.64 0.85 3.1 (1.6–6.0) 0.0018∗ rs4417561 78,568,860 intergenic G∗/C 0.26 − 0.0244 −0.30 ± 0.11 0.022 0.0252 0.0004∗ 0.37 0.15 3.2 (1.7–6.3) 0.0011∗ rs2316184 79,204,885 CDYL2 G/A∗ 0.14 + 0.0032∗ -0.48 ± 0.12∗ 0.045 0.0034∗ 0.0096∗ 0.15 0.30 2.5 (1.2–4.9) 0.0126 rs12927866 80,209,823 CMIP A/G∗ 0.47 − 0.4104 −0.27 ± 0.10 0.019 0.3581 0.0003∗ 0.29 0.49 2.4 (1.5–3.9) 0.0004∗ rs4265801 80,222,553 CMIP T∗/G 0.43 + 0.3446 −0.09 ± 0.09 0.030 0.5065 4 × 10−5∗ 0.61 0.29 3.9 (2.0–7.6) 0.0393∗ rs7201632 80,234,949 CMIP C/T∗ 0.49 + 0.8966 −0.25 ± 0.09 0.017 0.7975 0.0004∗ 0.36 0.56 2.3 (1.4–3.7) 0.0004∗ rs3785054 82,918,978 WFDC1 C∗/T 0.36 − 0.0044∗ −0.29 ± 0.10∗ 0.019 0.0033∗ 0.0089∗ 0.34 0.20 2.0 (1.2–3.4) 0.0102 rs8053211 83,011,254 ATP2C2 A∗/G 0.46 + 5 × 10−5∗ −0.38 ± 0.09∗ 0.040 3 × 10−5∗ 0.0014∗ 0.61 0.43 2.1 (1.3–3.3) 0.0029∗ rs11860694 83,014,948 ATP2C2 C∗/G 0.54 − 2 × 10−5∗ −0.37 ± 0.09∗ 0.039 9 × 10−6∗ 0.0018∗ 0.61 0.43 2.1 (1.3–3.3) 0.0027∗ rs16973771 83,018,079 ATP2C2 G/A∗ 0.48 − 0.0003∗ −0.35 ± 0.09∗ 0.034 0.0006∗ 0.0025∗ 0.34 0.51 2.0 (1.3–3.2) 0.0036∗ rs2875891 83,021,410 ATP2C2 T/C∗ 0.44 + 0.0057∗ −0.34∗ ± 0.10∗ 0.031 0.0063∗ 0.0022∗ 0.30 0.47 2.1 (1.3–3.4) 0.0026∗ rs8045507 83,022,078 ATP2C2 T/C∗ 0.48 − 0.0017∗ −0.33 ± 0.09∗ 0.029 0.0020∗ 0.0022∗ 0.34 0.51 2.1 (1.3–3.3) 0.0028∗ Three significant SNPs fell within the CMIP gene, and five fell within ATP2C2. The remaining four significant SNPs were either intergenic or isolated signals of association. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. P Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are marked with an asterisk. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK.
label	Table 1
caption	Significant Association in the SLIC Association Screen
p	Significant Association in the SLIC Association Screen
table	SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) A1 CEPH Frequency Typed Strand p Quant Effect Size Heritability p Emp QTDT p Case-Cont Frequency of A1 Cases Frequency of A1 Controls Odds ratio (95% CI) p Emp PLINK rs8051754 78,554,834 intergenic T/C∗ 0.46 − 0.0931 −0.28 ± 0.11 0.019 0.0892 0.0007∗ 0.64 0.85 3.1 (1.6–6.0) 0.0018∗ rs4417561 78,568,860 intergenic G∗/C 0.26 − 0.0244 −0.30 ± 0.11 0.022 0.0252 0.0004∗ 0.37 0.15 3.2 (1.7–6.3) 0.0011∗ rs2316184 79,204,885 CDYL2 G/A∗ 0.14 + 0.0032∗ -0.48 ± 0.12∗ 0.045 0.0034∗ 0.0096∗ 0.15 0.30 2.5 (1.2–4.9) 0.0126 rs12927866 80,209,823 CMIP A/G∗ 0.47 − 0.4104 −0.27 ± 0.10 0.019 0.3581 0.0003∗ 0.29 0.49 2.4 (1.5–3.9) 0.0004∗ rs4265801 80,222,553 CMIP T∗/G 0.43 + 0.3446 −0.09 ± 0.09 0.030 0.5065 4 × 10−5∗ 0.61 0.29 3.9 (2.0–7.6) 0.0393∗ rs7201632 80,234,949 CMIP C/T∗ 0.49 + 0.8966 −0.25 ± 0.09 0.017 0.7975 0.0004∗ 0.36 0.56 2.3 (1.4–3.7) 0.0004∗ rs3785054 82,918,978 WFDC1 C∗/T 0.36 − 0.0044∗ −0.29 ± 0.10∗ 0.019 0.0033∗ 0.0089∗ 0.34 0.20 2.0 (1.2–3.4) 0.0102 rs8053211 83,011,254 ATP2C2 A∗/G 0.46 + 5 × 10−5∗ −0.38 ± 0.09∗ 0.040 3 × 10−5∗ 0.0014∗ 0.61 0.43 2.1 (1.3–3.3) 0.0029∗ rs11860694 83,014,948 ATP2C2 C∗/G 0.54 − 2 × 10−5∗ −0.37 ± 0.09∗ 0.039 9 × 10−6∗ 0.0018∗ 0.61 0.43 2.1 (1.3–3.3) 0.0027∗ rs16973771 83,018,079 ATP2C2 G/A∗ 0.48 − 0.0003∗ −0.35 ± 0.09∗ 0.034 0.0006∗ 0.0025∗ 0.34 0.51 2.0 (1.3–3.2) 0.0036∗ rs2875891 83,021,410 ATP2C2 T/C∗ 0.44 + 0.0057∗ −0.34∗ ± 0.10∗ 0.031 0.0063∗ 0.0022∗ 0.30 0.47 2.1 (1.3–3.4) 0.0026∗ rs8045507 83,022,078 ATP2C2 T/C∗ 0.48 − 0.0017∗ −0.33 ± 0.09∗ 0.029 0.0020∗ 0.0022∗ 0.34 0.51 2.1 (1.3–3.3) 0.0028∗
tr	SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) A1 CEPH Frequency Typed Strand p Quant Effect Size Heritability p Emp QTDT p Case-Cont Frequency of A1 Cases Frequency of A1 Controls Odds ratio (95% CI) p Emp PLINK
th	SNP
th	Chromosome Position (bp – B36)
th	Gene
th	Alleles (A1/A2)
th	A1 CEPH Frequency
th	Typed Strand
th	p Quant
th	Effect Size
th	Heritability
th	p Emp QTDT
th	p Case-Cont
th	Frequency of A1 Cases
th	Frequency of A1 Controls
th	Odds ratio (95% CI)
th	p Emp PLINK
tr	rs8051754 78,554,834 intergenic T/C∗ 0.46 − 0.0931 −0.28 ± 0.11 0.019 0.0892 0.0007∗ 0.64 0.85 3.1 (1.6–6.0) 0.0018∗
td	rs8051754
td	78,554,834
td	intergenic
td	T/C∗
td	0.46
td	−
td	0.0931
td	−0.28 ± 0.11
td	0.019
td	0.0892
td	0.0007∗
td	0.64
td	0.85
td	3.1 (1.6–6.0)
td	0.0018∗
tr	rs4417561 78,568,860 intergenic G∗/C 0.26 − 0.0244 −0.30 ± 0.11 0.022 0.0252 0.0004∗ 0.37 0.15 3.2 (1.7–6.3) 0.0011∗
td	rs4417561
td	78,568,860
td	intergenic
td	G∗/C
td	0.26
td	−
td	0.0244
td	−0.30 ± 0.11
td	0.022
td	0.0252
td	0.0004∗
td	0.37
td	0.15
td	3.2 (1.7–6.3)
td	0.0011∗
tr	rs2316184 79,204,885 CDYL2 G/A∗ 0.14 + 0.0032∗ -0.48 ± 0.12∗ 0.045 0.0034∗ 0.0096∗ 0.15 0.30 2.5 (1.2–4.9) 0.0126
td	rs2316184
td	79,204,885
td	CDYL2
td	G/A∗
td	0.14
td	+
td	0.0032∗
td	-0.48 ± 0.12∗
td	0.045
td	0.0034∗
td	0.0096∗
td	0.15
td	0.30
td	2.5 (1.2–4.9)
td	0.0126
tr	rs12927866 80,209,823 CMIP A/G∗ 0.47 − 0.4104 −0.27 ± 0.10 0.019 0.3581 0.0003∗ 0.29 0.49 2.4 (1.5–3.9) 0.0004∗
td	rs12927866
td	80,209,823
td	CMIP
td	A/G∗
td	0.47
td	−
td	0.4104
td	−0.27 ± 0.10
td	0.019
td	0.3581
td	0.0003∗
td	0.29
td	0.49
td	2.4 (1.5–3.9)
td	0.0004∗
tr	rs4265801 80,222,553 CMIP T∗/G 0.43 + 0.3446 −0.09 ± 0.09 0.030 0.5065 4 × 10−5∗ 0.61 0.29 3.9 (2.0–7.6) 0.0393∗
td	rs4265801
td	80,222,553
td	CMIP
td	T∗/G
td	0.43
td	+
td	0.3446
td	−0.09 ± 0.09
td	0.030
td	0.5065
td	4 × 10−5∗
td	0.61
td	0.29
td	3.9 (2.0–7.6)
td	0.0393∗
tr	rs7201632 80,234,949 CMIP C/T∗ 0.49 + 0.8966 −0.25 ± 0.09 0.017 0.7975 0.0004∗ 0.36 0.56 2.3 (1.4–3.7) 0.0004∗
td	rs7201632
td	80,234,949
td	CMIP
td	C/T∗
td	0.49
td	+
td	0.8966
td	−0.25 ± 0.09
td	0.017
td	0.7975
td	0.0004∗
td	0.36
td	0.56
td	2.3 (1.4–3.7)
td	0.0004∗
tr	rs3785054 82,918,978 WFDC1 C∗/T 0.36 − 0.0044∗ −0.29 ± 0.10∗ 0.019 0.0033∗ 0.0089∗ 0.34 0.20 2.0 (1.2–3.4) 0.0102
td	rs3785054
td	82,918,978
td	WFDC1
td	C∗/T
td	0.36
td	−
td	0.0044∗
td	−0.29 ± 0.10∗
td	0.019
td	0.0033∗
td	0.0089∗
td	0.34
td	0.20
td	2.0 (1.2–3.4)
td	0.0102
tr	rs8053211 83,011,254 ATP2C2 A∗/G 0.46 + 5 × 10−5∗ −0.38 ± 0.09∗ 0.040 3 × 10−5∗ 0.0014∗ 0.61 0.43 2.1 (1.3–3.3) 0.0029∗
td	rs8053211
td	83,011,254
td	ATP2C2
td	A∗/G
td	0.46
td	+
td	5 × 10−5∗
td	−0.38 ± 0.09∗
td	0.040
td	3 × 10−5∗
td	0.0014∗
td	0.61
td	0.43
td	2.1 (1.3–3.3)
td	0.0029∗
tr	rs11860694 83,014,948 ATP2C2 C∗/G 0.54 − 2 × 10−5∗ −0.37 ± 0.09∗ 0.039 9 × 10−6∗ 0.0018∗ 0.61 0.43 2.1 (1.3–3.3) 0.0027∗
td	rs11860694
td	83,014,948
td	ATP2C2
td	C∗/G
td	0.54
td	−
td	2 × 10−5∗
td	−0.37 ± 0.09∗
td	0.039
td	9 × 10−6∗
td	0.0018∗
td	0.61
td	0.43
td	2.1 (1.3–3.3)
td	0.0027∗
tr	rs16973771 83,018,079 ATP2C2 G/A∗ 0.48 − 0.0003∗ −0.35 ± 0.09∗ 0.034 0.0006∗ 0.0025∗ 0.34 0.51 2.0 (1.3–3.2) 0.0036∗
td	rs16973771
td	83,018,079
td	ATP2C2
td	G/A∗
td	0.48
td	−
td	0.0003∗
td	−0.35 ± 0.09∗
td	0.034
td	0.0006∗
td	0.0025∗
td	0.34
td	0.51
td	2.0 (1.3–3.2)
td	0.0036∗
tr	rs2875891 83,021,410 ATP2C2 T/C∗ 0.44 + 0.0057∗ −0.34∗ ± 0.10∗ 0.031 0.0063∗ 0.0022∗ 0.30 0.47 2.1 (1.3–3.4) 0.0026∗
td	rs2875891
td	83,021,410
td	ATP2C2
td	T/C∗
td	0.44
td	+
td	0.0057∗
td	−0.34∗ ± 0.10∗
td	0.031
td	0.0063∗
td	0.0022∗
td	0.30
td	0.47
td	2.1 (1.3–3.4)
td	0.0026∗
tr	rs8045507 83,022,078 ATP2C2 T/C∗ 0.48 − 0.0017∗ −0.33 ± 0.09∗ 0.029 0.0020∗ 0.0022∗ 0.34 0.51 2.1 (1.3–3.3) 0.0028∗
td	rs8045507
td	83,022,078
td	ATP2C2
td	T/C∗
td	0.48
td	−
td	0.0017∗
td	−0.33 ± 0.09∗
td	0.029
td	0.0020∗
td	0.0022∗
td	0.34
td	0.51
td	2.1 (1.3–3.3)
td	0.0028∗
table-wrap-foot	Three significant SNPs fell within the CMIP gene, and five fell within ATP2C2. The remaining four significant SNPs were either intergenic or isolated signals of association. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. P Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are marked with an asterisk. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK.
footnote	Three significant SNPs fell within the CMIP gene, and five fell within ATP2C2. The remaining four significant SNPs were either intergenic or isolated signals of association. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. P Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are marked with an asterisk. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK.
p	Three significant SNPs fell within the CMIP gene, and five fell within ATP2C2. The remaining four significant SNPs were either intergenic or isolated signals of association. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. P Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are marked with an asterisk. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK.
table-wrap	Table 2 Significant Association in the SLIC Cohort with the Follow-up Panel SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) A1 CEPH Frequency Typed Strand P Quant Effect Size Heritability p Emp QTDT p Case-Cont Frequency of A1 Cases Frequency of A1 Controls Odds Ratio (95% CI) p Emp PLINK rs6564903 80,211,158 CMIP C∗/T 0.48 + 0.1279 −0.37 ± 0.10 0.038 0.1225 5 × 10−7∗ 0.79 0.38 3.5 (2.1–5.9) 1 × 10−6∗ rs3935802 80,219,068 CMIP G∗/C 0.46 − 0.2667 −0.31 ± 0.10 0.025 0.2486 0.0003∗ 0.71 0.49 2.5 (1.5–4.2) 0.0006∗ rs16955705 80,230,851 CMIP C/A∗ 0.50 + 0.3916 −0.25 ± 0.10 0.017 0.3627 0.0003∗ 0.31 0.54 2.6 (1.5–4.4) 0.0003∗ rs4243209 80,247,592 CMIP C/T∗ 0.22 + 0.0065∗ −0.42 ± 0.12 0.027 0.0043∗ 0.0007∗ 0.11 0.26 3.0 (1.6–5.8) 0.0012∗ rs12149426 83,022,607 ATP2C2 A/C∗ 0.26 + 0.0064∗ −0.31 ± 0.12 0.017 0.0082∗ 0.0082∗ 0.14 0.27 2.3 (1.2–4.2) 0.0039∗ Of the 105 SNPs analyzed in the follow-up panel, 16 lay in CMIP, 76 lay in the ATP2C2 gene, and the remaining 13 lay in other regions that had shown association in the screen (see Table 1). Eight SNPs were genotyped in both the screen and follow-up panels. All of these markers showed some evidence of association in the screen phase (p < 0.01) but had genotype success rates of <95%, and none lay within CMIP or ATP2C2. Each of the duplicated SNPs showed increased success rates and decreased association levels in the follow-up panel. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. p Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are shown in bold. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK.
label	Table 2
caption	Significant Association in the SLIC Cohort with the Follow-up Panel
p	Significant Association in the SLIC Cohort with the Follow-up Panel
table	SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) A1 CEPH Frequency Typed Strand P Quant Effect Size Heritability p Emp QTDT p Case-Cont Frequency of A1 Cases Frequency of A1 Controls Odds Ratio (95% CI) p Emp PLINK rs6564903 80,211,158 CMIP C∗/T 0.48 + 0.1279 −0.37 ± 0.10 0.038 0.1225 5 × 10−7∗ 0.79 0.38 3.5 (2.1–5.9) 1 × 10−6∗ rs3935802 80,219,068 CMIP G∗/C 0.46 − 0.2667 −0.31 ± 0.10 0.025 0.2486 0.0003∗ 0.71 0.49 2.5 (1.5–4.2) 0.0006∗ rs16955705 80,230,851 CMIP C/A∗ 0.50 + 0.3916 −0.25 ± 0.10 0.017 0.3627 0.0003∗ 0.31 0.54 2.6 (1.5–4.4) 0.0003∗ rs4243209 80,247,592 CMIP C/T∗ 0.22 + 0.0065∗ −0.42 ± 0.12 0.027 0.0043∗ 0.0007∗ 0.11 0.26 3.0 (1.6–5.8) 0.0012∗ rs12149426 83,022,607 ATP2C2 A/C∗ 0.26 + 0.0064∗ −0.31 ± 0.12 0.017 0.0082∗ 0.0082∗ 0.14 0.27 2.3 (1.2–4.2) 0.0039∗
tr	SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) A1 CEPH Frequency Typed Strand P Quant Effect Size Heritability p Emp QTDT p Case-Cont Frequency of A1 Cases Frequency of A1 Controls Odds Ratio (95% CI) p Emp PLINK
th	SNP
th	Chromosome Position (bp – B36)
th	Gene
th	Alleles (A1/A2)
th	A1 CEPH Frequency
th	Typed Strand
th	P Quant
th	Effect Size
th	Heritability
th	p Emp QTDT
th	p Case-Cont
th	Frequency of A1 Cases
th	Frequency of A1 Controls
th	Odds Ratio (95% CI)
th	p Emp PLINK
tr	rs6564903 80,211,158 CMIP C∗/T 0.48 + 0.1279 −0.37 ± 0.10 0.038 0.1225 5 × 10−7∗ 0.79 0.38 3.5 (2.1–5.9) 1 × 10−6∗
td	rs6564903
td	80,211,158
td	CMIP
td	C∗/T
td	0.48
td	+
td	0.1279
td	−0.37 ± 0.10
td	0.038
td	0.1225
td	5 × 10−7∗
td	0.79
td	0.38
td	3.5 (2.1–5.9)
td	1 × 10−6∗
tr	rs3935802 80,219,068 CMIP G∗/C 0.46 − 0.2667 −0.31 ± 0.10 0.025 0.2486 0.0003∗ 0.71 0.49 2.5 (1.5–4.2) 0.0006∗
td	rs3935802
td	80,219,068
td	CMIP
td	G∗/C
td	0.46
td	−
td	0.2667
td	−0.31 ± 0.10
td	0.025
td	0.2486
td	0.0003∗
td	0.71
td	0.49
td	2.5 (1.5–4.2)
td	0.0006∗
tr	rs16955705 80,230,851 CMIP C/A∗ 0.50 + 0.3916 −0.25 ± 0.10 0.017 0.3627 0.0003∗ 0.31 0.54 2.6 (1.5–4.4) 0.0003∗
td	rs16955705
td	80,230,851
td	CMIP
td	C/A∗
td	0.50
td	+
td	0.3916
td	−0.25 ± 0.10
td	0.017
td	0.3627
td	0.0003∗
td	0.31
td	0.54
td	2.6 (1.5–4.4)
td	0.0003∗
tr	rs4243209 80,247,592 CMIP C/T∗ 0.22 + 0.0065∗ −0.42 ± 0.12 0.027 0.0043∗ 0.0007∗ 0.11 0.26 3.0 (1.6–5.8) 0.0012∗
td	rs4243209
td	80,247,592
td	CMIP
td	C/T∗
td	0.22
td	+
td	0.0065∗
td	−0.42 ± 0.12
td	0.027
td	0.0043∗
td	0.0007∗
td	0.11
td	0.26
td	3.0 (1.6–5.8)
td	0.0012∗
tr	rs12149426 83,022,607 ATP2C2 A/C∗ 0.26 + 0.0064∗ −0.31 ± 0.12 0.017 0.0082∗ 0.0082∗ 0.14 0.27 2.3 (1.2–4.2) 0.0039∗
td	rs12149426
td	83,022,607
td	ATP2C2
td	A/C∗
td	0.26
td	+
td	0.0064∗
td	−0.31 ± 0.12
td	0.017
td	0.0082∗
td	0.0082∗
td	0.14
td	0.27
td	2.3 (1.2–4.2)
td	0.0039∗
table-wrap-foot	Of the 105 SNPs analyzed in the follow-up panel, 16 lay in CMIP, 76 lay in the ATP2C2 gene, and the remaining 13 lay in other regions that had shown association in the screen (see Table 1). Eight SNPs were genotyped in both the screen and follow-up panels. All of these markers showed some evidence of association in the screen phase (p < 0.01) but had genotype success rates of <95%, and none lay within CMIP or ATP2C2. Each of the duplicated SNPs showed increased success rates and decreased association levels in the follow-up panel. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. p Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are shown in bold. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK.
footnote	Of the 105 SNPs analyzed in the follow-up panel, 16 lay in CMIP, 76 lay in the ATP2C2 gene, and the remaining 13 lay in other regions that had shown association in the screen (see Table 1). Eight SNPs were genotyped in both the screen and follow-up panels. All of these markers showed some evidence of association in the screen phase (p < 0.01) but had genotype success rates of <95%, and none lay within CMIP or ATP2C2. Each of the duplicated SNPs showed increased success rates and decreased association levels in the follow-up panel. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. p Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are shown in bold. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK.
p	Of the 105 SNPs analyzed in the follow-up panel, 16 lay in CMIP, 76 lay in the ATP2C2 gene, and the remaining 13 lay in other regions that had shown association in the screen (see Table 1). Eight SNPs were genotyped in both the screen and follow-up panels. All of these markers showed some evidence of association in the screen phase (p < 0.01) but had genotype success rates of <95%, and none lay within CMIP or ATP2C2. Each of the duplicated SNPs showed increased success rates and decreased association levels in the follow-up panel. SNP alleles are given with the minor allele in the SLIC sample first. Putative risk alleles are marked with an asterisk. p Quant gives the p value for the quantitative, family-based analysis. p case-cont gives the p value for the case-control analysis. p values <0.01 are shown in bold. The odds ratios indicate the ratio of case/control odds for each additional copy of the putative risk allele. Odds ratios were calculated within PLINK. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD ± SE). Effect sizes were calculated with MERLIN. Heritability gives the proportion of total variance explained by the SNP. Heritability estimates were calculated with MERLIN. The p Emp column gives empirical p values for the given SNP; these values were derived from permutations within QTDT or PLINK.
table-wrap	Table 3 Nonword-Repetition Group Means for CMIP and ATP2C2 Risk Variants Genotype (Number of Risk Alleles) Single SNP rs6564903 (CMIP) TT (0) CT (1) CC (2) Single SNP 96.62 92.57 86.30 rs11860694 (ATP2C2) GG (0) 96.54 99.14 99.85 89.65 CG (1) 91.77 99.40 93.10 85.84 CC (2) 87.03 88.44 88.33 83.32 The effects of CMIP (rs6564903) and ATP2C2 (rs11860694) on nonword-repetition performance were modeled as additive effects within a regression framework in the R package. This regression model included all available SLIC children with genotype and nonword-repetition data (n = 503). Group means were calculated for each SNP in isolation (“Single SNP” entries) and in combinations of genotypes (3 × 3 grid) across risk SNPs. Note that individuals carrying combinations of risk alleles performed significantly worse than those carrying risk variants at a single locus. Nonword-repetition scores are age adjusted and standardized against normal population controls with a mean of 100 and a SD of 15.
label	Table 3
caption	Nonword-Repetition Group Means for CMIP and ATP2C2 Risk Variants
p	Nonword-Repetition Group Means for CMIP and ATP2C2 Risk Variants
table	Genotype (Number of Risk Alleles) Single SNP rs6564903 (CMIP) TT (0) CT (1) CC (2) Single SNP 96.62 92.57 86.30 rs11860694 (ATP2C2) GG (0) 96.54 99.14 99.85 89.65 CG (1) 91.77 99.40 93.10 85.84 CC (2) 87.03 88.44 88.33 83.32
tr	Genotype (Number of Risk Alleles) Single SNP rs6564903 (CMIP)
th	Genotype (Number of Risk Alleles)
th	Single SNP
th	rs6564903 (CMIP)
tr	TT (0) CT (1) CC (2)
th	TT (0)
th	CT (1)
th	CC (2)
tr	Single SNP 96.62 92.57 86.30
td	Single SNP
td	96.62
td	92.57
td	86.30
tr	rs11860694 (ATP2C2) GG (0) 96.54 99.14 99.85 89.65
td	rs11860694 (ATP2C2)
td	GG (0)
td	96.54
td	99.14
td	99.85
td	89.65
tr	CG (1) 91.77 99.40 93.10 85.84
td	CG (1)
td	91.77
td	99.40
td	93.10
td	85.84
tr	CC (2) 87.03 88.44 88.33 83.32
td	CC (2)
td	87.03
td	88.44
td	88.33
td	83.32
table-wrap-foot	The effects of CMIP (rs6564903) and ATP2C2 (rs11860694) on nonword-repetition performance were modeled as additive effects within a regression framework in the R package. This regression model included all available SLIC children with genotype and nonword-repetition data (n = 503). Group means were calculated for each SNP in isolation (“Single SNP” entries) and in combinations of genotypes (3 × 3 grid) across risk SNPs. Note that individuals carrying combinations of risk alleles performed significantly worse than those carrying risk variants at a single locus. Nonword-repetition scores are age adjusted and standardized against normal population controls with a mean of 100 and a SD of 15.
footnote	The effects of CMIP (rs6564903) and ATP2C2 (rs11860694) on nonword-repetition performance were modeled as additive effects within a regression framework in the R package. This regression model included all available SLIC children with genotype and nonword-repetition data (n = 503). Group means were calculated for each SNP in isolation (“Single SNP” entries) and in combinations of genotypes (3 × 3 grid) across risk SNPs. Note that individuals carrying combinations of risk alleles performed significantly worse than those carrying risk variants at a single locus. Nonword-repetition scores are age adjusted and standardized against normal population controls with a mean of 100 and a SD of 15.
p	The effects of CMIP (rs6564903) and ATP2C2 (rs11860694) on nonword-repetition performance were modeled as additive effects within a regression framework in the R package. This regression model included all available SLIC children with genotype and nonword-repetition data (n = 503). Group means were calculated for each SNP in isolation (“Single SNP” entries) and in combinations of genotypes (3 × 3 grid) across risk SNPs. Note that individuals carrying combinations of risk alleles performed significantly worse than those carrying risk variants at a single locus. Nonword-repetition scores are age adjusted and standardized against normal population controls with a mean of 100 and a SD of 15.
table-wrap	Table 4 Association in the Replication Cohort SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) SLIC Risk Allele A1 CEPH Frequency Typed Strand p Quant Effect Size p Case-Cont Frequency of A1 Cases Frequency of A1 controls Odds Ratio (95% CI) rs12927866 80,209,823 CMIP T/C C 0.47 + 0.1623 −0.08 0.0955 0.39 0.30 1.5 (0.9-2.3) rs4265801 80,222,553 CMIP T/G∗ T 0.43 + 0.0182∗ −0.15 0.0214∗ 0.43 0.56 1.6 (1.1-2.5) rs16955705 80,230,851 CMIP C∗/A A 0.50 + 0.0238∗ −0.14 0.0257∗ 0.48 0.36 1.6 (1.1-2.5) rs16973771 83,018,079 ATP2C2 C/T∗ T 0.48 + 0.0079∗ −0.14 0.0135∗ 0.32 0.45 1.7 (1.1-2.7) rs2875891 83,021,410 ATP2C2 T/C C 0.44 + 0.0668 −0.06 0.0802 0.29 0.37 1.5 (1.0-2.3) rs8045507 83,022,078 ATP2C2 A/G∗ G 0.48 + 0.0058∗ −0.15 0.0110∗ 0.31 0.44 1.8 (1.1-2.7) SNP alleles are given with the minor allele first. Putative risk alleles in the replication cohort are marked with an asterisk. p Quant shows the p value for the quantitative analysis. p < 0.05 are highlighted in bold. The odds ratio indicates the ratio of case/control odds for each additional copy of the putative risk allele. The 95% confidence intervals for the odds ratios of all significantly associated SNPs exceeded 1.0. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD).
label	Table 4
caption	Association in the Replication Cohort
p	Association in the Replication Cohort
table	SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) SLIC Risk Allele A1 CEPH Frequency Typed Strand p Quant Effect Size p Case-Cont Frequency of A1 Cases Frequency of A1 controls Odds Ratio (95% CI) rs12927866 80,209,823 CMIP T/C C 0.47 + 0.1623 −0.08 0.0955 0.39 0.30 1.5 (0.9-2.3) rs4265801 80,222,553 CMIP T/G∗ T 0.43 + 0.0182∗ −0.15 0.0214∗ 0.43 0.56 1.6 (1.1-2.5) rs16955705 80,230,851 CMIP C∗/A A 0.50 + 0.0238∗ −0.14 0.0257∗ 0.48 0.36 1.6 (1.1-2.5) rs16973771 83,018,079 ATP2C2 C/T∗ T 0.48 + 0.0079∗ −0.14 0.0135∗ 0.32 0.45 1.7 (1.1-2.7) rs2875891 83,021,410 ATP2C2 T/C C 0.44 + 0.0668 −0.06 0.0802 0.29 0.37 1.5 (1.0-2.3) rs8045507 83,022,078 ATP2C2 A/G∗ G 0.48 + 0.0058∗ −0.15 0.0110∗ 0.31 0.44 1.8 (1.1-2.7)
tr	SNP Chromosome Position (bp – B36) Gene Alleles (A1/A2) SLIC Risk Allele A1 CEPH Frequency Typed Strand p Quant Effect Size p Case-Cont Frequency of A1 Cases Frequency of A1 controls Odds Ratio (95% CI)
th	SNP
th	Chromosome Position (bp – B36)
th	Gene
th	Alleles (A1/A2)
th	SLIC Risk Allele
th	A1 CEPH Frequency
th	Typed Strand
th	p Quant
th	Effect Size
th	p Case-Cont
th	Frequency of A1 Cases
th	Frequency of A1 controls
th	Odds Ratio (95% CI)
tr	rs12927866 80,209,823 CMIP T/C C 0.47 + 0.1623 −0.08 0.0955 0.39 0.30 1.5 (0.9-2.3)
td	rs12927866
td	80,209,823
td	CMIP
td	T/C
td	C
td	0.47
td	+
td	0.1623
td	−0.08
td	0.0955
td	0.39
td	0.30
td	1.5 (0.9-2.3)
tr	rs4265801 80,222,553 CMIP T/G∗ T 0.43 + 0.0182∗ −0.15 0.0214∗ 0.43 0.56 1.6 (1.1-2.5)
td	rs4265801
td	80,222,553
td	CMIP
td	T/G∗
td	T
td	0.43
td	+
td	0.0182∗
td	−0.15
td	0.0214∗
td	0.43
td	0.56
td	1.6 (1.1-2.5)
tr	rs16955705 80,230,851 CMIP C∗/A A 0.50 + 0.0238∗ −0.14 0.0257∗ 0.48 0.36 1.6 (1.1-2.5)
td	rs16955705
td	80,230,851
td	CMIP
td	C∗/A
td	A
td	0.50
td	+
td	0.0238∗
td	−0.14
td	0.0257∗
td	0.48
td	0.36
td	1.6 (1.1-2.5)
tr	rs16973771 83,018,079 ATP2C2 C/T∗ T 0.48 + 0.0079∗ −0.14 0.0135∗ 0.32 0.45 1.7 (1.1-2.7)
td	rs16973771
td	83,018,079
td	ATP2C2
td	C/T∗
td	T
td	0.48
td	+
td	0.0079∗
td	−0.14
td	0.0135∗
td	0.32
td	0.45
td	1.7 (1.1-2.7)
tr	rs2875891 83,021,410 ATP2C2 T/C C 0.44 + 0.0668 −0.06 0.0802 0.29 0.37 1.5 (1.0-2.3)
td	rs2875891
td	83,021,410
td	ATP2C2
td	T/C
td	C
td	0.44
td	+
td	0.0668
td	−0.06
td	0.0802
td	0.29
td	0.37
td	1.5 (1.0-2.3)
tr	rs8045507 83,022,078 ATP2C2 A/G∗ G 0.48 + 0.0058∗ −0.15 0.0110∗ 0.31 0.44 1.8 (1.1-2.7)
td	rs8045507
td	83,022,078
td	ATP2C2
td	A/G∗
td	G
td	0.48
td	+
td	0.0058∗
td	−0.15
td	0.0110∗
td	0.31
td	0.44
td	1.8 (1.1-2.7)
table-wrap-foot	SNP alleles are given with the minor allele first. Putative risk alleles in the replication cohort are marked with an asterisk. p Quant shows the p value for the quantitative analysis. p < 0.05 are highlighted in bold. The odds ratio indicates the ratio of case/control odds for each additional copy of the putative risk allele. The 95% confidence intervals for the odds ratios of all significantly associated SNPs exceeded 1.0. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD).
footnote	SNP alleles are given with the minor allele first. Putative risk alleles in the replication cohort are marked with an asterisk. p Quant shows the p value for the quantitative analysis. p < 0.05 are highlighted in bold. The odds ratio indicates the ratio of case/control odds for each additional copy of the putative risk allele. The 95% confidence intervals for the odds ratios of all significantly associated SNPs exceeded 1.0. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD).
p	SNP alleles are given with the minor allele first. Putative risk alleles in the replication cohort are marked with an asterisk. p Quant shows the p value for the quantitative analysis. p < 0.05 are highlighted in bold. The odds ratio indicates the ratio of case/control odds for each additional copy of the putative risk allele. The 95% confidence intervals for the odds ratios of all significantly associated SNPs exceeded 1.0. The effect size is the estimated effect of each risk allele on the nonword-repetition score (in SD).

Annnotations

blinded

PMC:2725236 JSONTXT 7 Projects

Document structure show

Annnotations

PMC:2725236 JSON TXT 7 Projects