PMC:3431878 / 6380-12315 JSONTXT

Annnotations TAB JSON ListView MergeView

{"target":"http://pubannotation.org/docs/sourcedb/PMC/sourceid/3431878","sourcedb":"PMC","sourceid":"3431878","source_url":"https://www.ncbi.nlm.nih.gov/pmc/3431878","text":"3. Nucleotide and Amino Acid Sequence\nThe currently available genomic information on CA125 is based on two autonomous studies conducted in 2001 [17,18]. By screening a cDNA library generated using OVCAR-3 mRNA, Yin et al. identified a clone with a 5797 base pair (bp) nucleotide sequence [17] (reviewed in [19,20]). Although this region translated to a partial CA125 protein, it confirmed and revealed the existence of some key features. Importantly, they established the presence of partially conserved tandem repeats and were the first to describe the homology to the “sea urchin sperm protein, enterokinase, agrin” (SEA) domain. Yin et al. also postulated the existence of a transmembrane domain based upon a cluster of hydrophobic residues and a phosphorylation site based on a tyrosine phosphorylation consensus motif [17]. Furthermore, O-glycosylation was inferred due to the serine, threonine and proline content as well as N-glycosylation due to the asparagine content of the protein. These findings were supported and extended on by O’Brien and co-workers [18,21]. Here, a large transcript encoding CA125 was sequenced based upon a cyanogen bromide cleavage fragment of CA125 and a translated expressed sequence tag [16]. The resultant protein was described in three parts; an amino terminal domain, a repeat domain and a carboxy terminal domain (Table 1, Figure 1). The carboxy terminal domain reported by O’Brien et al. was homologous to the findings of Yin et al. [17,18,20]. One feature described only by O’Brien was the existence of a potential proteolytic cleavage site at amino acid residues 171–181 [18].\nAs described previously, the repeat domain consists of partially conserved tandem repeats that are 156 amino acids in length. The SEA domain was noted to constitute the first 131 amino acids of this region [18]. However, the amino acid sequence exhibiting a SEA domain is variable in length as the main feature of the consensus sequence is a “strand-strand-helix-strand-strand-helix” secondary structure motif and is unaffected by small sequence gaps between these units [24]. Therefore, as most CA125 tandem repeats are polymorphic, the SEA domain may not necessarily be 131 amino acids long. This is exemplified by the identification of a SEA domain consensus sequence with a length of 59 amino acids located in the non-tandem repeat region of the C-terminal domain [17]. Furthermore, the repeat domain contains a highly conserved methionine at position 24 as well as two highly conserved cysteine residues at position 59 and 79 [18]. These cysteine residues in the repeat domain were hypothesized by O’Brien to form a 19 amino acid loop to which the M11 antibody binds. Additionally, O’Brien demonstrated a loss of binding of the M11 antibody to a recombinant repeat domain when digested with Asp-N or Lys-C as this would have disrupted the cysteine loop [18].\nIn the study by O’Brien et al., 45 unique and 15 redundant tandem repeat sequences were identified [18]. It was concluded that CA125 consists of at least 60 tandem repeats. However, due to difficulties associated with sequencing repeat regions some may be wrongly positioned or unknown, leaving the exact number of repeats undefined [18]. Furthermore, O’Brien states that the genome database is deficient in repeat information as repetitive sequences were potentially deleted during the compilation process of the human genome [18]. This suggests that the currently available nucleotide sequence of CA125 may not be correct.\nThe amino terminus was first described as a 1637 amino acid domain coded by 5 exons [17,18]. This was later revised by O’Brien et al. where an additional 37,700 nucleotides were sequenced from chromosome 19 [21]. Acquisition of the complete amino domain was confirmed by the identification of a Kozak translational initiation sequence as well as supporting Northern blot and mRNA expression data [21]. The main feature of this domain is the abundance of serine and threonine residues, which represent many potential sites of O-glycosylation, and disperse asparagine residues as potential sites of N-glycosylation [21].\nThe elucidation of the almost complete CA125 nucleotide sequence represented a significant leap in understanding. It enabled clarification of known and/or suspected features such as glycosylation (amino domain), multivalent antibody binding (repeat domain) as well as anchorage to and release from the membrane (carboxy domain) [12,27]. The findings also raise questions regarding the generation of lower molecular weight isoforms. Only the existence of a CA125 precursor protein of approximately 400 kDa has been found to exist in the cytoplasm of OVCAR-3 cells prior to protein maturation [28]. CA125 variants are also thought to arise from proteolytic cleavage. Proteins such as MUC1 have a proteolytic cleavage motif, GSVVV, in their SEA domain [29,30], however the presence of a motif with a similar consensus sequence has only been confirmed in one SEA domain of CA125 thus far [22]. A cluster analysis of a human CA125 sequence, containing the first 12 tandem repeats and a murine homolog to the C-terminus of CA125, was performed by Maeda et al. [22]. Interestingly, a DSVLV sequence in the second SEA domain from the cytosolic side of CA125 was clustered with the GSVVV sequence. Therefore, it was concluded that proteolytic cleavage might also occur in this domain of CA125 [22]. However, the formation of further proteolytic fragments of CA125, by the mechanism of the DSVLV or GSVVV sequence, in additional SEA domains has not been reported. Furthermore, O’Brien et al. suggested that the formation of alternative splicing products of this protein seems unlikely, based on the inherent variation of the five exons encoding for each repeat sequence [18]. Therefore, a discrepancy exists between antibody findings of low molecular weight isoforms and the protein sequence data (discussed below).","divisions":[{"label":"title","span":{"begin":0,"end":37}},{"label":"p","span":{"begin":38,"end":1621}},{"label":"p","span":{"begin":1622,"end":2885}},{"label":"p","span":{"begin":2886,"end":3510}},{"label":"p","span":{"begin":3511,"end":4129}}],"tracks":[{"project":"2_test","denotations":[{"id":"22949880-11369781-50810654","span":{"begin":145,"end":147},"obj":"11369781"},{"id":"22949880-11786729-50810655","span":{"begin":148,"end":150},"obj":"11786729"},{"id":"22949880-11369781-50810656","span":{"begin":289,"end":291},"obj":"11369781"},{"id":"22949880-15788735-50810657","span":{"begin":307,"end":309},"obj":"15788735"},{"id":"22949880-11786728-50810658","span":{"begin":310,"end":312},"obj":"11786728"},{"id":"22949880-11369781-50810659","span":{"begin":824,"end":826},"obj":"11369781"},{"id":"22949880-11786729-50810660","span":{"begin":1066,"end":1068},"obj":"11786729"},{"id":"22949880-12218296-50810661","span":{"begin":1069,"end":1071},"obj":"12218296"},{"id":"22949880-10228899-50810662","span":{"begin":1226,"end":1228},"obj":"10228899"},{"id":"22949880-11369781-50810663","span":{"begin":1477,"end":1479},"obj":"11369781"},{"id":"22949880-11786729-50810664","span":{"begin":1480,"end":1482},"obj":"11786729"},{"id":"22949880-11786728-50810665","span":{"begin":1483,"end":1485},"obj":"11786728"},{"id":"22949880-11786729-50810666","span":{"begin":1617,"end":1619},"obj":"11786729"},{"id":"22949880-11786729-50810667","span":{"begin":1829,"end":1831},"obj":"11786729"},{"id":"22949880-7670383-50810668","span":{"begin":2094,"end":2096},"obj":"7670383"},{"id":"22949880-11369781-50810669","span":{"begin":2391,"end":2393},"obj":"11369781"},{"id":"22949880-11786729-50810670","span":{"begin":2554,"end":2556},"obj":"11786729"},{"id":"22949880-11786729-50810671","span":{"begin":2881,"end":2883},"obj":"11786729"},{"id":"22949880-11786729-50810672","span":{"begin":2986,"end":2988},"obj":"11786729"},{"id":"22949880-11786729-50810673","span":{"begin":3220,"end":3222},"obj":"11786729"},{"id":"22949880-11786729-50810674","span":{"begin":3414,"end":3416},"obj":"11786729"},{"id":"22949880-11369781-50810675","span":{"begin":3596,"end":3598},"obj":"11369781"},{"id":"22949880-11786729-50810676","span":{"begin":3599,"end":3601},"obj":"11786729"},{"id":"22949880-12218296-50810677","span":{"begin":3719,"end":3721},"obj":"12218296"},{"id":"22949880-12218296-50810678","span":{"begin":3908,"end":3910},"obj":"12218296"},{"id":"22949880-12218296-50810679","span":{"begin":4125,"end":4127},"obj":"12218296"},{"id":"22949880-12615972-50810680","span":{"begin":4459,"end":4461},"obj":"12615972"},{"id":"22949880-11125279-50810681","span":{"begin":4722,"end":4724},"obj":"11125279"},{"id":"22949880-11341784-50810682","span":{"begin":4880,"end":4882},"obj":"11341784"},{"id":"22949880-15943821-50810683","span":{"begin":4883,"end":4885},"obj":"15943821"},{"id":"22949880-14764598-50810684","span":{"begin":5015,"end":5017},"obj":"14764598"},{"id":"22949880-14764598-50810685","span":{"begin":5185,"end":5187},"obj":"14764598"},{"id":"22949880-14764598-50810686","span":{"begin":5415,"end":5417},"obj":"14764598"},{"id":"22949880-11786729-50810687","span":{"begin":5791,"end":5793},"obj":"11786729"},{"id":"T57307","span":{"begin":145,"end":147},"obj":"11369781"},{"id":"T17647","span":{"begin":148,"end":150},"obj":"11786729"},{"id":"T74833","span":{"begin":289,"end":291},"obj":"11369781"},{"id":"T21795","span":{"begin":307,"end":309},"obj":"15788735"},{"id":"T20828","span":{"begin":310,"end":312},"obj":"11786728"},{"id":"T91204","span":{"begin":824,"end":826},"obj":"11369781"},{"id":"T98513","span":{"begin":1066,"end":1068},"obj":"11786729"},{"id":"T75911","span":{"begin":1069,"end":1071},"obj":"12218296"},{"id":"T13278","span":{"begin":1226,"end":1228},"obj":"10228899"},{"id":"T89329","span":{"begin":1477,"end":1479},"obj":"11369781"},{"id":"T48694","span":{"begin":1480,"end":1482},"obj":"11786729"},{"id":"T2259","span":{"begin":1483,"end":1485},"obj":"11786728"},{"id":"T16302","span":{"begin":1617,"end":1619},"obj":"11786729"},{"id":"T96982","span":{"begin":1829,"end":1831},"obj":"11786729"},{"id":"T55839","span":{"begin":2094,"end":2096},"obj":"7670383"},{"id":"T85475","span":{"begin":2391,"end":2393},"obj":"11369781"},{"id":"T12134","span":{"begin":2554,"end":2556},"obj":"11786729"},{"id":"T19155","span":{"begin":2881,"end":2883},"obj":"11786729"},{"id":"T77706","span":{"begin":2986,"end":2988},"obj":"11786729"},{"id":"T1458","span":{"begin":3220,"end":3222},"obj":"11786729"},{"id":"T6748","span":{"begin":3414,"end":3416},"obj":"11786729"},{"id":"T1994","span":{"begin":3596,"end":3598},"obj":"11369781"},{"id":"T25205","span":{"begin":3599,"end":3601},"obj":"11786729"},{"id":"T32330","span":{"begin":3719,"end":3721},"obj":"12218296"},{"id":"T45522","span":{"begin":3908,"end":3910},"obj":"12218296"},{"id":"T96224","span":{"begin":4125,"end":4127},"obj":"12218296"},{"id":"T7728","span":{"begin":4459,"end":4461},"obj":"12615972"},{"id":"T85789","span":{"begin":4722,"end":4724},"obj":"11125279"},{"id":"T50797","span":{"begin":4880,"end":4882},"obj":"11341784"},{"id":"T23709","span":{"begin":4883,"end":4885},"obj":"15943821"},{"id":"T137","span":{"begin":5015,"end":5017},"obj":"14764598"},{"id":"T83000","span":{"begin":5185,"end":5187},"obj":"14764598"},{"id":"T25173","span":{"begin":5415,"end":5417},"obj":"14764598"},{"id":"T49369","span":{"begin":5791,"end":5793},"obj":"11786729"}],"attributes":[{"subj":"22949880-11369781-50810654","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810655","pred":"source","obj":"2_test"},{"subj":"22949880-11369781-50810656","pred":"source","obj":"2_test"},{"subj":"22949880-15788735-50810657","pred":"source","obj":"2_test"},{"subj":"22949880-11786728-50810658","pred":"source","obj":"2_test"},{"subj":"22949880-11369781-50810659","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810660","pred":"source","obj":"2_test"},{"subj":"22949880-12218296-50810661","pred":"source","obj":"2_test"},{"subj":"22949880-10228899-50810662","pred":"source","obj":"2_test"},{"subj":"22949880-11369781-50810663","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810664","pred":"source","obj":"2_test"},{"subj":"22949880-11786728-50810665","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810666","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810667","pred":"source","obj":"2_test"},{"subj":"22949880-7670383-50810668","pred":"source","obj":"2_test"},{"subj":"22949880-11369781-50810669","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810670","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810671","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810672","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810673","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810674","pred":"source","obj":"2_test"},{"subj":"22949880-11369781-50810675","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810676","pred":"source","obj":"2_test"},{"subj":"22949880-12218296-50810677","pred":"source","obj":"2_test"},{"subj":"22949880-12218296-50810678","pred":"source","obj":"2_test"},{"subj":"22949880-12218296-50810679","pred":"source","obj":"2_test"},{"subj":"22949880-12615972-50810680","pred":"source","obj":"2_test"},{"subj":"22949880-11125279-50810681","pred":"source","obj":"2_test"},{"subj":"22949880-11341784-50810682","pred":"source","obj":"2_test"},{"subj":"22949880-15943821-50810683","pred":"source","obj":"2_test"},{"subj":"22949880-14764598-50810684","pred":"source","obj":"2_test"},{"subj":"22949880-14764598-50810685","pred":"source","obj":"2_test"},{"subj":"22949880-14764598-50810686","pred":"source","obj":"2_test"},{"subj":"22949880-11786729-50810687","pred":"source","obj":"2_test"},{"subj":"T57307","pred":"source","obj":"2_test"},{"subj":"T17647","pred":"source","obj":"2_test"},{"subj":"T74833","pred":"source","obj":"2_test"},{"subj":"T21795","pred":"source","obj":"2_test"},{"subj":"T20828","pred":"source","obj":"2_test"},{"subj":"T91204","pred":"source","obj":"2_test"},{"subj":"T98513","pred":"source","obj":"2_test"},{"subj":"T75911","pred":"source","obj":"2_test"},{"subj":"T13278","pred":"source","obj":"2_test"},{"subj":"T89329","pred":"source","obj":"2_test"},{"subj":"T48694","pred":"source","obj":"2_test"},{"subj":"T2259","pred":"source","obj":"2_test"},{"subj":"T16302","pred":"source","obj":"2_test"},{"subj":"T96982","pred":"source","obj":"2_test"},{"subj":"T55839","pred":"source","obj":"2_test"},{"subj":"T85475","pred":"source","obj":"2_test"},{"subj":"T12134","pred":"source","obj":"2_test"},{"subj":"T19155","pred":"source","obj":"2_test"},{"subj":"T77706","pred":"source","obj":"2_test"},{"subj":"T1458","pred":"source","obj":"2_test"},{"subj":"T6748","pred":"source","obj":"2_test"},{"subj":"T1994","pred":"source","obj":"2_test"},{"subj":"T25205","pred":"source","obj":"2_test"},{"subj":"T32330","pred":"source","obj":"2_test"},{"subj":"T45522","pred":"source","obj":"2_test"},{"subj":"T96224","pred":"source","obj":"2_test"},{"subj":"T7728","pred":"source","obj":"2_test"},{"subj":"T85789","pred":"source","obj":"2_test"},{"subj":"T50797","pred":"source","obj":"2_test"},{"subj":"T23709","pred":"source","obj":"2_test"},{"subj":"T137","pred":"source","obj":"2_test"},{"subj":"T83000","pred":"source","obj":"2_test"},{"subj":"T25173","pred":"source","obj":"2_test"},{"subj":"T49369","pred":"source","obj":"2_test"}]}],"config":{"attribute types":[{"pred":"source","value type":"selection","values":[{"id":"2_test","color":"#a393ec","default":true}]}]}}