PMC:1069646 / 47242-52140 JSONTXT

Annnotations TAB JSON ListView MergeView

    2_test

    {"project":"2_test","denotations":[{"id":"15780005-15111095-84825115","span":{"begin":307,"end":310},"obj":"15111095"},{"id":"15780005-9521922-84825116","span":{"begin":1687,"end":1690},"obj":"9521922"},{"id":"15780005-9521923-84825117","span":{"begin":1691,"end":1694},"obj":"9521923"},{"id":"15780005-9521921-84825118","span":{"begin":1695,"end":1698},"obj":"9521921"},{"id":"15780005-15099548-84825119","span":{"begin":2113,"end":2116},"obj":"15099548"},{"id":"15780005-11812492-84825120","span":{"begin":2176,"end":2179},"obj":"11812492"},{"id":"15780005-15111095-84825121","span":{"begin":2250,"end":2253},"obj":"15111095"},{"id":"15780005-12519973-84825122","span":{"begin":2291,"end":2294},"obj":"12519973"},{"id":"15780005-11930014-84825123","span":{"begin":2325,"end":2328},"obj":"11930014"},{"id":"15780005-9023104-84825124","span":{"begin":2557,"end":2560},"obj":"9023104"},{"id":"15780005-2231712-84825125","span":{"begin":2624,"end":2627},"obj":"2231712"},{"id":"15780005-12969510-84825126","span":{"begin":2929,"end":2932},"obj":"12969510"},{"id":"15780005-2231712-84825127","span":{"begin":3296,"end":3299},"obj":"2231712"},{"id":"15780005-2231712-84825128","span":{"begin":3391,"end":3394},"obj":"2231712"},{"id":"15780005-11410670-84825129","span":{"begin":3625,"end":3628},"obj":"11410670"},{"id":"15780005-14681379-84825130","span":{"begin":4066,"end":4069},"obj":"14681379"},{"id":"15780005-9254694-84825131","span":{"begin":4157,"end":4160},"obj":"9254694"},{"id":"15780005-14681412-84825132","span":{"begin":4182,"end":4185},"obj":"14681412"},{"id":"15780005-12519973-84825133","span":{"begin":4296,"end":4299},"obj":"12519973"},{"id":"15780005-2231712-84825134","span":{"begin":4534,"end":4537},"obj":"2231712"},{"id":"15780005-10065704-84825135","span":{"begin":4610,"end":4613},"obj":"10065704"},{"id":"15780005-10869041-84825136","span":{"begin":4683,"end":4686},"obj":"10869041"},{"id":"15780005-11230160-84825137","span":{"begin":4763,"end":4766},"obj":"11230160"},{"id":"15780005-15637156-84825138","span":{"begin":4888,"end":4891},"obj":"15637156"},{"id":"15780005-15618402-84825139","span":{"begin":4892,"end":4895},"obj":"15618402"},{"id":"T94517","span":{"begin":307,"end":310},"obj":"15111095"},{"id":"T38185","span":{"begin":1687,"end":1690},"obj":"9521922"},{"id":"T28847","span":{"begin":1691,"end":1694},"obj":"9521923"},{"id":"T56423","span":{"begin":1695,"end":1698},"obj":"9521921"},{"id":"T6272","span":{"begin":2113,"end":2116},"obj":"15099548"},{"id":"T40836","span":{"begin":2176,"end":2179},"obj":"11812492"},{"id":"T85829","span":{"begin":2250,"end":2253},"obj":"15111095"},{"id":"T2968","span":{"begin":2291,"end":2294},"obj":"12519973"},{"id":"T79035","span":{"begin":2325,"end":2328},"obj":"11930014"},{"id":"T13072","span":{"begin":2557,"end":2560},"obj":"9023104"},{"id":"T15131","span":{"begin":2624,"end":2627},"obj":"2231712"},{"id":"T2045","span":{"begin":2929,"end":2932},"obj":"12969510"},{"id":"T83977","span":{"begin":3296,"end":3299},"obj":"2231712"},{"id":"T77906","span":{"begin":3391,"end":3394},"obj":"2231712"},{"id":"T59263","span":{"begin":3625,"end":3628},"obj":"11410670"},{"id":"T11718","span":{"begin":4066,"end":4069},"obj":"14681379"},{"id":"T66455","span":{"begin":4157,"end":4160},"obj":"9254694"},{"id":"T76014","span":{"begin":4182,"end":4185},"obj":"14681412"},{"id":"T40237","span":{"begin":4296,"end":4299},"obj":"12519973"},{"id":"T73224","span":{"begin":4534,"end":4537},"obj":"2231712"},{"id":"T58020","span":{"begin":4610,"end":4613},"obj":"10065704"},{"id":"T28254","span":{"begin":4683,"end":4686},"obj":"10869041"},{"id":"T53244","span":{"begin":4763,"end":4766},"obj":"11230160"},{"id":"T10404","span":{"begin":4888,"end":4891},"obj":"15637156"},{"id":"T7906","span":{"begin":4892,"end":4895},"obj":"15618402"}],"text":"Materials and Methods\n\nMaterials and Methods \nB. malayi microfilaria worms were purchased from TRS Labs (Athens, GA, United States) for preparation of DNA. Because of the difficulties in obtaining purified Wolbachia DNA from the B. malayi host, bacterial artificial chromosome (BAC) libraries were created [114]. From these libraries, a minimum tiling path of 21 Wolbachia BACs was created and used for subcloning into plasmid vectors for genomic sequencing. This ordered BAC approach was useful in the assembly phase of the project because of the highly repetitive nature of this genome.\nFor plasmid library generation, equal amounts of BAC DNAs were pooled and 50 μg of DNA from the pool was sheared into 2.0–3.0 kb fragments (HydroShear device, GeneMachines, Genomic Solutions, Ann Arbor, Michigan, United States). Sheared DNA was purified from a 0.7% agarose gel, blunted, and cloned into cleaved, dephosphorylated plasmid vectors. Libraries were generated containing DNA from 1 to 9 BACs.\nPlasmid DNA was isolated by a modified alkaline lysis protocol. Sequencing reactions were performed at Integrated Genomics (Chicago, Illinois, United States) using the DYEnamic ET Dye Terminator Cycle Sequencing Kit (Amersham Biosciences, Little Chalfont, United Kingdom). Unincorporated dye was removed by isopropanol precipitation as recommended by the manufacturer. Samples were run on MegaBace 1000 (Amersham Biosciences) sequencers; 87% of plasmid sequencing reactions were successful. The genome was sequenced to an average coverage of 10.7X and at 2X minimum coverage (at least once in each direction) and assembled.\nThe sequence was assembled into contigs by using PHRED–PHRAP–CONSED [165,166,167], and gaps were initially closed by primer walking (1,766 reactions). Regions considered to be potential frame shifts or sequencing errors after the first round of annotation were resequenced from direct genomic PCR products. The completed sequence was used to identify homologous sequences in the independent ongoing B. malayi sequence project (TIGR parasites genome database: http://www.tigr.org/tdb/e2k1/bma1/ [126]). The sequence of one BAC had been previously determined [163]. The final assembly was in full agreement with the BAC physical map [114].\nIntegrated Genomics ERGO software [168] and other software programs [169] were used for ORF calling, gene identification, and feature recognition. Computational analysis of the genome sequence was performed as previously described. Briefly, the tRNA genes were identified using the tRNA-SCAN program [170], and the rRNA genes were identified using the BLASTN program [171]. For the identification of the protein-coding genes, the genome sequence was conceptually translated in six frames to generate potential protein products of ORFs longer than 100 codons. These potential protein sequences were compared to the database of proteins from the COG database using COGNITOR [172].\nAfter manual verification of the COG assignments, the validated COG members from wBm were called as protein-coding genes. The COG assignment procedure was repeated with ORFs of greater than 60 codons from the intergenic regions. Additionally, the potential protein sequences were compared to the nonredundant protein sequence database using the BLASTP program [171] and to a six-frame translation of unfinished microbial genomes using the TBLASTN program [171], and those sequences that produced hits with E (expectation) values less than 0.01 were added to the protein set after an examination of the alignments. Finally, protein-coding regions were predicted using the GeneMarkS program [173]. After manual refinement, the genes predicted with these methods in the regions between evolutionarily conserved genes were added to produce the final protein set. Protein function prediction was based primarily on the COG assignments. In addition, searches for conserved domains were performed using the Conserved Domain Database (CDD) search option of BLAST (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) and the SMART system [174], and in-depth, iterative database searches were performed using thePSI-BLAST program [175]. The KEGG database [176] (http://www.genome.ad.jp/kegg/metabolism.html) and the Integrated Genomics ERGO database pathway collection [168] were used, in addition to the COGs, for the reconstruction of metabolic pathways. Paralogous protein families were identified by single-linkage clustering after comparing the predicted protein set to itself using the BLASTP program [171]. Signal peptides in proteins were predicted using the SignalP program [177], and transmembrane helices were predicted using the MEMSAT program [178]. Gene orders in bacterial genomes were compared using the Lamarck program [179].\nTwo closely related genome sequences were completed and published since the above comparative analysis was undertaken [180,181].\n"}