PMC:7033698 / 2861-6004 JSONTXT

Annnotations TAB JSON ListView MergeView

    LitCovid-PD-FMA-UBERON

    {"project":"LitCovid-PD-FMA-UBERON","denotations":[{"id":"T21","span":{"begin":98,"end":101},"obj":"Body_part"},{"id":"T22","span":{"begin":215,"end":218},"obj":"Body_part"},{"id":"T23","span":{"begin":284,"end":287},"obj":"Body_part"},{"id":"T24","span":{"begin":551,"end":554},"obj":"Body_part"},{"id":"T25","span":{"begin":810,"end":813},"obj":"Body_part"},{"id":"T26","span":{"begin":959,"end":962},"obj":"Body_part"},{"id":"T27","span":{"begin":1055,"end":1058},"obj":"Body_part"},{"id":"T28","span":{"begin":1184,"end":1187},"obj":"Body_part"},{"id":"T29","span":{"begin":1209,"end":1212},"obj":"Body_part"},{"id":"T30","span":{"begin":1215,"end":1221},"obj":"Body_part"},{"id":"T31","span":{"begin":1355,"end":1358},"obj":"Body_part"},{"id":"T32","span":{"begin":1425,"end":1428},"obj":"Body_part"},{"id":"T33","span":{"begin":1560,"end":1563},"obj":"Body_part"},{"id":"T34","span":{"begin":1694,"end":1697},"obj":"Body_part"},{"id":"T35","span":{"begin":1731,"end":1734},"obj":"Body_part"},{"id":"T36","span":{"begin":1746,"end":1758},"obj":"Body_part"},{"id":"T37","span":{"begin":1888,"end":1891},"obj":"Body_part"},{"id":"T38","span":{"begin":1903,"end":1915},"obj":"Body_part"},{"id":"T39","span":{"begin":1999,"end":2002},"obj":"Body_part"},{"id":"T40","span":{"begin":2111,"end":2114},"obj":"Body_part"},{"id":"T41","span":{"begin":2231,"end":2234},"obj":"Body_part"},{"id":"T42","span":{"begin":2237,"end":2244},"obj":"Body_part"},{"id":"T43","span":{"begin":2306,"end":2309},"obj":"Body_part"},{"id":"T44","span":{"begin":2323,"end":2326},"obj":"Body_part"},{"id":"T45","span":{"begin":2400,"end":2406},"obj":"Body_part"},{"id":"T46","span":{"begin":2503,"end":2507},"obj":"Body_part"},{"id":"T47","span":{"begin":2638,"end":2641},"obj":"Body_part"},{"id":"T48","span":{"begin":2781,"end":2784},"obj":"Body_part"},{"id":"T49","span":{"begin":3080,"end":3083},"obj":"Body_part"}],"attributes":[{"id":"A21","pred":"fma_id","subj":"T21","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A22","pred":"fma_id","subj":"T22","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A23","pred":"fma_id","subj":"T23","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A24","pred":"fma_id","subj":"T24","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A25","pred":"fma_id","subj":"T25","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A26","pred":"fma_id","subj":"T26","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A27","pred":"fma_id","subj":"T27","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A28","pred":"fma_id","subj":"T28","obj":"http://purl.org/sig/ont/fma/fma63011"},{"id":"A29","pred":"fma_id","subj":"T29","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A30","pred":"fma_id","subj":"T30","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A31","pred":"fma_id","subj":"T31","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A32","pred":"fma_id","subj":"T32","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A33","pred":"fma_id","subj":"T33","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A34","pred":"fma_id","subj":"T34","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A35","pred":"fma_id","subj":"T35","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A36","pred":"fma_id","subj":"T36","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A37","pred":"fma_id","subj":"T37","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A38","pred":"fma_id","subj":"T38","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A39","pred":"fma_id","subj":"T39","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A40","pred":"fma_id","subj":"T40","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A41","pred":"fma_id","subj":"T41","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A42","pred":"fma_id","subj":"T42","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A43","pred":"fma_id","subj":"T43","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A44","pred":"fma_id","subj":"T44","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A45","pred":"fma_id","subj":"T45","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A46","pred":"fma_id","subj":"T46","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A47","pred":"fma_id","subj":"T47","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A48","pred":"fma_id","subj":"T48","obj":"http://purl.org/sig/ont/fma/fma278683"},{"id":"A49","pred":"fma_id","subj":"T49","obj":"http://purl.org/sig/ont/fma/fma63011"}],"text":"Current report conducted careful examination of the sequences of 2019-nCoV, other CoV viruses and HIV-1 as well as GenBank database. Our results demonstrated no evidence that the sequences of these four inserts are HIV-1 specific or the 2019-nCoV viruses obtain these insertions from HIV-1. First, the results of blast search of these motifs against GenBank shows that the top 100 identical or highly homologous hits are all from host genes of mammalian, insects, bacterial and others. There are only a few hits on coronaviruses, but none of them are HIV-1 related. Blast against viral sequence database also showed these insertion sequences widely exist in all kinds of viruses from bacteriophage, influenza, to giant eukaryotic viruses (Table 1). More hits were found for coronaviruses and a few also hit on HIV-1 sequences than the search against the entire database (Table 1). However, while the 100% match between the insertion 1 and 2 sequences and the HIV sequences were found in 19 entries, the matches between the insertion 3 and 4 sequences and HIV-1 sequences were rather poor (from 42% to 88%). Moreover, the insertion 4 sequence ambiguously hit multiple different genes (gag, pol and env) in the HIV-1 genome, suggesting that similarities (as low as 42%) between them are too low to be reliable. Search these four insertion sequences against HIV-1 Sequence Database (https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html) yielded similar results. Sequences that completely match the insertion 3 and 4 sequences were not found in any HIV-1 sequences. This clearly shows that these insertioin sequences are widely present in living organisms including viruses, but not HIV-1 specific. All these regions in HIV-1 envelope glycoprotein are highly variable with many large insertions and deletions, indicating that they are not essential for biological functions of HIV-1 envelope glycoprotein. The detection of completely matched sequences of 1 and 2 insertions in only a few HIV-1 strains demonstrated that four insertions are very rare or not present among tens of thousands of natural HIV-1 sequences. This also explains why four insertion homolog sequences could only be independently found in different HIV-1 genomes [8]. Because of their poor identities to and rareness in the HIV-1 sequences, HIV-1 could not be the source for those insertion sequences in the 2019-nCoV genome.\nTable 1. Blast search results of four insertion sequences against sequence databases.\nDatabase Gene source Insertion 1 TNGTKR Insertion 2 HKNNKS Insertion 3 RSYLTPGDSSSG Insertion 4 QTNSPRRA\nWhole database CoV 2 (2) 0 3 (3) 2 (2)\nHIV-1 0 0 0 0\nProkaryotic 27 (27) 3 (3) 74 (0) 66 (1)\nEukaryotic 71 (71) 97 (97) 23 (0) 32 (1)\nOnly viral database CoV 3 (3) 3 (3) 5 (3) 3 (2)\nHIV-1 18 (18) 1 (1) 4 (0)* 6 (0)**\nOther Eukaryotic viruses 49 (2) 66 (8) 69 (0) 62 (0)\nProkaryotic viruses 29 (13) 30 (1) 21 (0) 28 (0)\nUnclassified virus 1 (1) 0 1 (0) 1 (0)\nTop 100 hits are analyzed and the numbers of 100% matches are shown in parentheses. * Similarity at 67%; ** Random hits in Gag, Pro and Env sequences with similarity between 42% and 88%."}

    LitCovid-PD-MONDO

    {"project":"LitCovid-PD-MONDO","denotations":[{"id":"T6","span":{"begin":699,"end":708},"obj":"Disease"}],"attributes":[{"id":"A6","pred":"mondo_id","subj":"T6","obj":"http://purl.obolibrary.org/obo/MONDO_0005812"}],"text":"Current report conducted careful examination of the sequences of 2019-nCoV, other CoV viruses and HIV-1 as well as GenBank database. Our results demonstrated no evidence that the sequences of these four inserts are HIV-1 specific or the 2019-nCoV viruses obtain these insertions from HIV-1. First, the results of blast search of these motifs against GenBank shows that the top 100 identical or highly homologous hits are all from host genes of mammalian, insects, bacterial and others. There are only a few hits on coronaviruses, but none of them are HIV-1 related. Blast against viral sequence database also showed these insertion sequences widely exist in all kinds of viruses from bacteriophage, influenza, to giant eukaryotic viruses (Table 1). More hits were found for coronaviruses and a few also hit on HIV-1 sequences than the search against the entire database (Table 1). However, while the 100% match between the insertion 1 and 2 sequences and the HIV sequences were found in 19 entries, the matches between the insertion 3 and 4 sequences and HIV-1 sequences were rather poor (from 42% to 88%). Moreover, the insertion 4 sequence ambiguously hit multiple different genes (gag, pol and env) in the HIV-1 genome, suggesting that similarities (as low as 42%) between them are too low to be reliable. Search these four insertion sequences against HIV-1 Sequence Database (https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html) yielded similar results. Sequences that completely match the insertion 3 and 4 sequences were not found in any HIV-1 sequences. This clearly shows that these insertioin sequences are widely present in living organisms including viruses, but not HIV-1 specific. All these regions in HIV-1 envelope glycoprotein are highly variable with many large insertions and deletions, indicating that they are not essential for biological functions of HIV-1 envelope glycoprotein. The detection of completely matched sequences of 1 and 2 insertions in only a few HIV-1 strains demonstrated that four insertions are very rare or not present among tens of thousands of natural HIV-1 sequences. This also explains why four insertion homolog sequences could only be independently found in different HIV-1 genomes [8]. Because of their poor identities to and rareness in the HIV-1 sequences, HIV-1 could not be the source for those insertion sequences in the 2019-nCoV genome.\nTable 1. Blast search results of four insertion sequences against sequence databases.\nDatabase Gene source Insertion 1 TNGTKR Insertion 2 HKNNKS Insertion 3 RSYLTPGDSSSG Insertion 4 QTNSPRRA\nWhole database CoV 2 (2) 0 3 (3) 2 (2)\nHIV-1 0 0 0 0\nProkaryotic 27 (27) 3 (3) 74 (0) 66 (1)\nEukaryotic 71 (71) 97 (97) 23 (0) 32 (1)\nOnly viral database CoV 3 (3) 3 (3) 5 (3) 3 (2)\nHIV-1 18 (18) 1 (1) 4 (0)* 6 (0)**\nOther Eukaryotic viruses 49 (2) 66 (8) 69 (0) 62 (0)\nProkaryotic viruses 29 (13) 30 (1) 21 (0) 28 (0)\nUnclassified virus 1 (1) 0 1 (0) 1 (0)\nTop 100 hits are analyzed and the numbers of 100% matches are shown in parentheses. * Similarity at 67%; ** Random hits in Gag, Pro and Env sequences with similarity between 42% and 88%."}

    LitCovid-PD-CLO

    {"project":"LitCovid-PD-CLO","denotations":[{"id":"T33","span":{"begin":86,"end":93},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T34","span":{"begin":247,"end":254},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T35","span":{"begin":435,"end":440},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T36","span":{"begin":501,"end":502},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T37","span":{"begin":671,"end":678},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T38","span":{"begin":719,"end":729},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_2759"},{"id":"T39","span":{"begin":730,"end":737},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T40","span":{"begin":792,"end":793},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T41","span":{"begin":1177,"end":1182},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T42","span":{"begin":1657,"end":1666},"obj":"http://purl.obolibrary.org/obo/OBI_0100026"},{"id":"T43","span":{"begin":1657,"end":1666},"obj":"http://purl.obolibrary.org/obo/UBERON_0000468"},{"id":"T44","span":{"begin":1677,"end":1684},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T45","span":{"begin":1993,"end":1994},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T46","span":{"begin":2503,"end":2507},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T47","span":{"begin":2618,"end":2623},"obj":"http://purl.obolibrary.org/obo/CLO_0050507"},{"id":"T48","span":{"begin":2632,"end":2637},"obj":"http://purl.obolibrary.org/obo/CLO_0050507"},{"id":"T49","span":{"begin":2664,"end":2666},"obj":"http://purl.obolibrary.org/obo/CLO_0050509"},{"id":"T50","span":{"begin":2668,"end":2670},"obj":"http://purl.obolibrary.org/obo/CLO_0050509"},{"id":"T51","span":{"begin":2692,"end":2702},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_2759"},{"id":"T52","span":{"begin":2703,"end":2705},"obj":"http://purl.obolibrary.org/obo/CLO_0054055"},{"id":"T53","span":{"begin":2707,"end":2709},"obj":"http://purl.obolibrary.org/obo/CLO_0054055"},{"id":"T54","span":{"begin":2765,"end":2770},"obj":"http://purl.obolibrary.org/obo/CLO_0001000"},{"id":"T55","span":{"begin":2787,"end":2789},"obj":"http://purl.obolibrary.org/obo/CLO_0050510"},{"id":"T56","span":{"begin":2790,"end":2796},"obj":"http://purl.obolibrary.org/obo/CLO_0054057"},{"id":"T57","span":{"begin":2791,"end":2793},"obj":"http://purl.obolibrary.org/obo/CLO_0050510"},{"id":"T58","span":{"begin":2822,"end":2832},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_2759"},{"id":"T59","span":{"begin":2833,"end":2840},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T60","span":{"begin":2881,"end":2888},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T61","span":{"begin":2900,"end":2906},"obj":"http://purl.obolibrary.org/obo/CLO_0001053"},{"id":"T62","span":{"begin":2931,"end":2936},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T63","span":{"begin":2937,"end":2942},"obj":"http://purl.obolibrary.org/obo/CLO_0053733"}],"text":"Current report conducted careful examination of the sequences of 2019-nCoV, other CoV viruses and HIV-1 as well as GenBank database. Our results demonstrated no evidence that the sequences of these four inserts are HIV-1 specific or the 2019-nCoV viruses obtain these insertions from HIV-1. First, the results of blast search of these motifs against GenBank shows that the top 100 identical or highly homologous hits are all from host genes of mammalian, insects, bacterial and others. There are only a few hits on coronaviruses, but none of them are HIV-1 related. Blast against viral sequence database also showed these insertion sequences widely exist in all kinds of viruses from bacteriophage, influenza, to giant eukaryotic viruses (Table 1). More hits were found for coronaviruses and a few also hit on HIV-1 sequences than the search against the entire database (Table 1). However, while the 100% match between the insertion 1 and 2 sequences and the HIV sequences were found in 19 entries, the matches between the insertion 3 and 4 sequences and HIV-1 sequences were rather poor (from 42% to 88%). Moreover, the insertion 4 sequence ambiguously hit multiple different genes (gag, pol and env) in the HIV-1 genome, suggesting that similarities (as low as 42%) between them are too low to be reliable. Search these four insertion sequences against HIV-1 Sequence Database (https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html) yielded similar results. Sequences that completely match the insertion 3 and 4 sequences were not found in any HIV-1 sequences. This clearly shows that these insertioin sequences are widely present in living organisms including viruses, but not HIV-1 specific. All these regions in HIV-1 envelope glycoprotein are highly variable with many large insertions and deletions, indicating that they are not essential for biological functions of HIV-1 envelope glycoprotein. The detection of completely matched sequences of 1 and 2 insertions in only a few HIV-1 strains demonstrated that four insertions are very rare or not present among tens of thousands of natural HIV-1 sequences. This also explains why four insertion homolog sequences could only be independently found in different HIV-1 genomes [8]. Because of their poor identities to and rareness in the HIV-1 sequences, HIV-1 could not be the source for those insertion sequences in the 2019-nCoV genome.\nTable 1. Blast search results of four insertion sequences against sequence databases.\nDatabase Gene source Insertion 1 TNGTKR Insertion 2 HKNNKS Insertion 3 RSYLTPGDSSSG Insertion 4 QTNSPRRA\nWhole database CoV 2 (2) 0 3 (3) 2 (2)\nHIV-1 0 0 0 0\nProkaryotic 27 (27) 3 (3) 74 (0) 66 (1)\nEukaryotic 71 (71) 97 (97) 23 (0) 32 (1)\nOnly viral database CoV 3 (3) 3 (3) 5 (3) 3 (2)\nHIV-1 18 (18) 1 (1) 4 (0)* 6 (0)**\nOther Eukaryotic viruses 49 (2) 66 (8) 69 (0) 62 (0)\nProkaryotic viruses 29 (13) 30 (1) 21 (0) 28 (0)\nUnclassified virus 1 (1) 0 1 (0) 1 (0)\nTop 100 hits are analyzed and the numbers of 100% matches are shown in parentheses. * Similarity at 67%; ** Random hits in Gag, Pro and Env sequences with similarity between 42% and 88%."}

    LitCovid-PD-CHEBI

    {"project":"LitCovid-PD-CHEBI","denotations":[{"id":"T5","span":{"begin":1746,"end":1758},"obj":"Chemical"},{"id":"T6","span":{"begin":1903,"end":1915},"obj":"Chemical"},{"id":"T7","span":{"begin":3085,"end":3088},"obj":"Chemical"}],"attributes":[{"id":"A5","pred":"chebi_id","subj":"T5","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A6","pred":"chebi_id","subj":"T6","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A7","pred":"chebi_id","subj":"T7","obj":"http://purl.obolibrary.org/obo/CHEBI_50342"}],"text":"Current report conducted careful examination of the sequences of 2019-nCoV, other CoV viruses and HIV-1 as well as GenBank database. Our results demonstrated no evidence that the sequences of these four inserts are HIV-1 specific or the 2019-nCoV viruses obtain these insertions from HIV-1. First, the results of blast search of these motifs against GenBank shows that the top 100 identical or highly homologous hits are all from host genes of mammalian, insects, bacterial and others. There are only a few hits on coronaviruses, but none of them are HIV-1 related. Blast against viral sequence database also showed these insertion sequences widely exist in all kinds of viruses from bacteriophage, influenza, to giant eukaryotic viruses (Table 1). More hits were found for coronaviruses and a few also hit on HIV-1 sequences than the search against the entire database (Table 1). However, while the 100% match between the insertion 1 and 2 sequences and the HIV sequences were found in 19 entries, the matches between the insertion 3 and 4 sequences and HIV-1 sequences were rather poor (from 42% to 88%). Moreover, the insertion 4 sequence ambiguously hit multiple different genes (gag, pol and env) in the HIV-1 genome, suggesting that similarities (as low as 42%) between them are too low to be reliable. Search these four insertion sequences against HIV-1 Sequence Database (https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html) yielded similar results. Sequences that completely match the insertion 3 and 4 sequences were not found in any HIV-1 sequences. This clearly shows that these insertioin sequences are widely present in living organisms including viruses, but not HIV-1 specific. All these regions in HIV-1 envelope glycoprotein are highly variable with many large insertions and deletions, indicating that they are not essential for biological functions of HIV-1 envelope glycoprotein. The detection of completely matched sequences of 1 and 2 insertions in only a few HIV-1 strains demonstrated that four insertions are very rare or not present among tens of thousands of natural HIV-1 sequences. This also explains why four insertion homolog sequences could only be independently found in different HIV-1 genomes [8]. Because of their poor identities to and rareness in the HIV-1 sequences, HIV-1 could not be the source for those insertion sequences in the 2019-nCoV genome.\nTable 1. Blast search results of four insertion sequences against sequence databases.\nDatabase Gene source Insertion 1 TNGTKR Insertion 2 HKNNKS Insertion 3 RSYLTPGDSSSG Insertion 4 QTNSPRRA\nWhole database CoV 2 (2) 0 3 (3) 2 (2)\nHIV-1 0 0 0 0\nProkaryotic 27 (27) 3 (3) 74 (0) 66 (1)\nEukaryotic 71 (71) 97 (97) 23 (0) 32 (1)\nOnly viral database CoV 3 (3) 3 (3) 5 (3) 3 (2)\nHIV-1 18 (18) 1 (1) 4 (0)* 6 (0)**\nOther Eukaryotic viruses 49 (2) 66 (8) 69 (0) 62 (0)\nProkaryotic viruses 29 (13) 30 (1) 21 (0) 28 (0)\nUnclassified virus 1 (1) 0 1 (0) 1 (0)\nTop 100 hits are analyzed and the numbers of 100% matches are shown in parentheses. * Similarity at 67%; ** Random hits in Gag, Pro and Env sequences with similarity between 42% and 88%."}

    LitCovid-sentences

    {"project":"LitCovid-sentences","denotations":[{"id":"T24","span":{"begin":0,"end":132},"obj":"Sentence"},{"id":"T25","span":{"begin":133,"end":290},"obj":"Sentence"},{"id":"T26","span":{"begin":291,"end":485},"obj":"Sentence"},{"id":"T27","span":{"begin":486,"end":565},"obj":"Sentence"},{"id":"T28","span":{"begin":566,"end":748},"obj":"Sentence"},{"id":"T29","span":{"begin":749,"end":880},"obj":"Sentence"},{"id":"T30","span":{"begin":881,"end":1106},"obj":"Sentence"},{"id":"T31","span":{"begin":1107,"end":1308},"obj":"Sentence"},{"id":"T32","span":{"begin":1309,"end":1473},"obj":"Sentence"},{"id":"T33","span":{"begin":1474,"end":1576},"obj":"Sentence"},{"id":"T34","span":{"begin":1577,"end":1709},"obj":"Sentence"},{"id":"T35","span":{"begin":1710,"end":1916},"obj":"Sentence"},{"id":"T36","span":{"begin":1917,"end":2127},"obj":"Sentence"},{"id":"T37","span":{"begin":2128,"end":2249},"obj":"Sentence"},{"id":"T38","span":{"begin":2250,"end":2407},"obj":"Sentence"},{"id":"T39","span":{"begin":2408,"end":2416},"obj":"Sentence"},{"id":"T40","span":{"begin":2417,"end":2493},"obj":"Sentence"},{"id":"T41","span":{"begin":2494,"end":2598},"obj":"Sentence"},{"id":"T42","span":{"begin":2599,"end":2637},"obj":"Sentence"},{"id":"T43","span":{"begin":2638,"end":2651},"obj":"Sentence"},{"id":"T44","span":{"begin":2652,"end":2691},"obj":"Sentence"},{"id":"T45","span":{"begin":2692,"end":2732},"obj":"Sentence"},{"id":"T46","span":{"begin":2733,"end":2780},"obj":"Sentence"},{"id":"T47","span":{"begin":2781,"end":2815},"obj":"Sentence"},{"id":"T48","span":{"begin":2816,"end":2868},"obj":"Sentence"},{"id":"T49","span":{"begin":2869,"end":2917},"obj":"Sentence"},{"id":"T50","span":{"begin":2918,"end":2956},"obj":"Sentence"},{"id":"T51","span":{"begin":2957,"end":3143},"obj":"Sentence"}],"namespaces":[{"prefix":"_base","uri":"http://pubannotation.org/ontology/tao.owl#"}],"text":"Current report conducted careful examination of the sequences of 2019-nCoV, other CoV viruses and HIV-1 as well as GenBank database. Our results demonstrated no evidence that the sequences of these four inserts are HIV-1 specific or the 2019-nCoV viruses obtain these insertions from HIV-1. First, the results of blast search of these motifs against GenBank shows that the top 100 identical or highly homologous hits are all from host genes of mammalian, insects, bacterial and others. There are only a few hits on coronaviruses, but none of them are HIV-1 related. Blast against viral sequence database also showed these insertion sequences widely exist in all kinds of viruses from bacteriophage, influenza, to giant eukaryotic viruses (Table 1). More hits were found for coronaviruses and a few also hit on HIV-1 sequences than the search against the entire database (Table 1). However, while the 100% match between the insertion 1 and 2 sequences and the HIV sequences were found in 19 entries, the matches between the insertion 3 and 4 sequences and HIV-1 sequences were rather poor (from 42% to 88%). Moreover, the insertion 4 sequence ambiguously hit multiple different genes (gag, pol and env) in the HIV-1 genome, suggesting that similarities (as low as 42%) between them are too low to be reliable. Search these four insertion sequences against HIV-1 Sequence Database (https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html) yielded similar results. Sequences that completely match the insertion 3 and 4 sequences were not found in any HIV-1 sequences. This clearly shows that these insertioin sequences are widely present in living organisms including viruses, but not HIV-1 specific. All these regions in HIV-1 envelope glycoprotein are highly variable with many large insertions and deletions, indicating that they are not essential for biological functions of HIV-1 envelope glycoprotein. The detection of completely matched sequences of 1 and 2 insertions in only a few HIV-1 strains demonstrated that four insertions are very rare or not present among tens of thousands of natural HIV-1 sequences. This also explains why four insertion homolog sequences could only be independently found in different HIV-1 genomes [8]. Because of their poor identities to and rareness in the HIV-1 sequences, HIV-1 could not be the source for those insertion sequences in the 2019-nCoV genome.\nTable 1. Blast search results of four insertion sequences against sequence databases.\nDatabase Gene source Insertion 1 TNGTKR Insertion 2 HKNNKS Insertion 3 RSYLTPGDSSSG Insertion 4 QTNSPRRA\nWhole database CoV 2 (2) 0 3 (3) 2 (2)\nHIV-1 0 0 0 0\nProkaryotic 27 (27) 3 (3) 74 (0) 66 (1)\nEukaryotic 71 (71) 97 (97) 23 (0) 32 (1)\nOnly viral database CoV 3 (3) 3 (3) 5 (3) 3 (2)\nHIV-1 18 (18) 1 (1) 4 (0)* 6 (0)**\nOther Eukaryotic viruses 49 (2) 66 (8) 69 (0) 62 (0)\nProkaryotic viruses 29 (13) 30 (1) 21 (0) 28 (0)\nUnclassified virus 1 (1) 0 1 (0) 1 (0)\nTop 100 hits are analyzed and the numbers of 100% matches are shown in parentheses. * Similarity at 67%; ** Random hits in Gag, Pro and Env sequences with similarity between 42% and 88%."}

    LitCovid-PubTator

    {"project":"LitCovid-PubTator","denotations":[{"id":"74","span":{"begin":2614,"end":2619},"obj":"Species"},{"id":"75","span":{"begin":2638,"end":2643},"obj":"Species"},{"id":"76","span":{"begin":2515,"end":2524},"obj":"Species"},{"id":"77","span":{"begin":2534,"end":2543},"obj":"Species"},{"id":"78","span":{"begin":2553,"end":2562},"obj":"Species"},{"id":"79","span":{"begin":2578,"end":2587},"obj":"Species"},{"id":"81","span":{"begin":2446,"end":2465},"obj":"Species"},{"id":"85","span":{"begin":3080,"end":3083},"obj":"Gene"},{"id":"86","span":{"begin":3093,"end":3096},"obj":"Gene"},{"id":"87","span":{"begin":3085,"end":3088},"obj":"Chemical"},{"id":"123","span":{"begin":1197,"end":1200},"obj":"Gene"},{"id":"124","span":{"begin":1184,"end":1187},"obj":"Gene"},{"id":"125","span":{"begin":65,"end":74},"obj":"Species"},{"id":"126","span":{"begin":82,"end":85},"obj":"Species"},{"id":"127","span":{"begin":98,"end":103},"obj":"Species"},{"id":"128","span":{"begin":215,"end":220},"obj":"Species"},{"id":"129","span":{"begin":237,"end":246},"obj":"Species"},{"id":"130","span":{"begin":284,"end":289},"obj":"Species"},{"id":"131","span":{"begin":444,"end":453},"obj":"Species"},{"id":"132","span":{"begin":515,"end":528},"obj":"Species"},{"id":"133","span":{"begin":551,"end":556},"obj":"Species"},{"id":"134","span":{"begin":622,"end":641},"obj":"Species"},{"id":"135","span":{"begin":684,"end":697},"obj":"Species"},{"id":"136","span":{"begin":774,"end":787},"obj":"Species"},{"id":"137","span":{"begin":810,"end":815},"obj":"Species"},{"id":"138","span":{"begin":1055,"end":1060},"obj":"Species"},{"id":"139","span":{"begin":1209,"end":1214},"obj":"Species"},{"id":"140","span":{"begin":1327,"end":1346},"obj":"Species"},{"id":"141","span":{"begin":1355,"end":1360},"obj":"Species"},{"id":"142","span":{"begin":1560,"end":1565},"obj":"Species"},{"id":"143","span":{"begin":1694,"end":1699},"obj":"Species"},{"id":"144","span":{"begin":1731,"end":1736},"obj":"Species"},{"id":"145","span":{"begin":1888,"end":1893},"obj":"Species"},{"id":"146","span":{"begin":1999,"end":2004},"obj":"Species"},{"id":"147","span":{"begin":2111,"end":2116},"obj":"Species"},{"id":"148","span":{"begin":2231,"end":2236},"obj":"Species"},{"id":"149","span":{"begin":2306,"end":2311},"obj":"Species"},{"id":"150","span":{"begin":2323,"end":2328},"obj":"Species"},{"id":"151","span":{"begin":2363,"end":2382},"obj":"Species"},{"id":"152","span":{"begin":2390,"end":2399},"obj":"Species"},{"id":"153","span":{"begin":923,"end":932},"obj":"Species"},{"id":"154","span":{"begin":1023,"end":1032},"obj":"Species"},{"id":"155","span":{"begin":1121,"end":1130},"obj":"Species"},{"id":"156","span":{"begin":1510,"end":1519},"obj":"Species"},{"id":"157","span":{"begin":2156,"end":2165},"obj":"Species"}],"attributes":[{"id":"A74","pred":"tao:has_database_id","subj":"74","obj":"Tax:2697049"},{"id":"A75","pred":"tao:has_database_id","subj":"75","obj":"Tax:11676"},{"id":"A76","pred":"tao:has_database_id","subj":"76","obj":"Tax:2673"},{"id":"A77","pred":"tao:has_database_id","subj":"77","obj":"Tax:2673"},{"id":"A78","pred":"tao:has_database_id","subj":"78","obj":"Tax:2673"},{"id":"A79","pred":"tao:has_database_id","subj":"79","obj":"Tax:2673"},{"id":"A81","pred":"tao:has_database_id","subj":"81","obj":"Tax:2673"},{"id":"A85","pred":"tao:has_database_id","subj":"85","obj":"Gene:155030"},{"id":"A86","pred":"tao:has_database_id","subj":"86","obj":"Gene:100616444"},{"id":"A87","pred":"tao:has_database_id","subj":"87","obj":"MESH:D011392"},{"id":"A123","pred":"tao:has_database_id","subj":"123","obj":"Gene:100616444"},{"id":"A124","pred":"tao:has_database_id","subj":"124","obj":"Gene:155030"},{"id":"A125","pred":"tao:has_database_id","subj":"125","obj":"Tax:2697049"},{"id":"A126","pred":"tao:has_database_id","subj":"126","obj":"Tax:11118"},{"id":"A127","pred":"tao:has_database_id","subj":"127","obj":"Tax:11676"},{"id":"A128","pred":"tao:has_database_id","subj":"128","obj":"Tax:11676"},{"id":"A129","pred":"tao:has_database_id","subj":"129","obj":"Tax:2697049"},{"id":"A130","pred":"tao:has_database_id","subj":"130","obj":"Tax:11676"},{"id":"A131","pred":"tao:has_database_id","subj":"131","obj":"Tax:9606"},{"id":"A132","pred":"tao:has_database_id","subj":"132","obj":"Tax:11118"},{"id":"A133","pred":"tao:has_database_id","subj":"133","obj":"Tax:11676"},{"id":"A134","pred":"tao:has_database_id","subj":"134","obj":"Tax:2673"},{"id":"A135","pred":"tao:has_database_id","subj":"135","obj":"Tax:38018"},{"id":"A136","pred":"tao:has_database_id","subj":"136","obj":"Tax:11118"},{"id":"A137","pred":"tao:has_database_id","subj":"137","obj":"Tax:11676"},{"id":"A138","pred":"tao:has_database_id","subj":"138","obj":"Tax:11676"},{"id":"A139","pred":"tao:has_database_id","subj":"139","obj":"Tax:11676"},{"id":"A140","pred":"tao:has_database_id","subj":"140","obj":"Tax:2673"},{"id":"A141","pred":"tao:has_database_id","subj":"141","obj":"Tax:11676"},{"id":"A142","pred":"tao:has_database_id","subj":"142","obj":"Tax:11676"},{"id":"A143","pred":"tao:has_database_id","subj":"143","obj":"Tax:11676"},{"id":"A144","pred":"tao:has_database_id","subj":"144","obj":"Tax:11676"},{"id":"A145","pred":"tao:has_database_id","subj":"145","obj":"Tax:11676"},{"id":"A146","pred":"tao:has_database_id","subj":"146","obj":"Tax:11676"},{"id":"A147","pred":"tao:has_database_id","subj":"147","obj":"Tax:11676"},{"id":"A148","pred":"tao:has_database_id","subj":"148","obj":"Tax:11676"},{"id":"A149","pred":"tao:has_database_id","subj":"149","obj":"Tax:11676"},{"id":"A150","pred":"tao:has_database_id","subj":"150","obj":"Tax:11676"},{"id":"A151","pred":"tao:has_database_id","subj":"151","obj":"Tax:2673"},{"id":"A152","pred":"tao:has_database_id","subj":"152","obj":"Tax:2697049"},{"id":"A153","pred":"tao:has_database_id","subj":"153","obj":"Tax:2673"},{"id":"A154","pred":"tao:has_database_id","subj":"154","obj":"Tax:2673"},{"id":"A155","pred":"tao:has_database_id","subj":"155","obj":"Tax:2673"},{"id":"A156","pred":"tao:has_database_id","subj":"156","obj":"Tax:2673"},{"id":"A157","pred":"tao:has_database_id","subj":"157","obj":"Tax:2673"}],"namespaces":[{"prefix":"Tax","uri":"https://www.ncbi.nlm.nih.gov/taxonomy/"},{"prefix":"MESH","uri":"https://id.nlm.nih.gov/mesh/"},{"prefix":"Gene","uri":"https://www.ncbi.nlm.nih.gov/gene/"},{"prefix":"CVCL","uri":"https://web.expasy.org/cellosaurus/CVCL_"}],"text":"Current report conducted careful examination of the sequences of 2019-nCoV, other CoV viruses and HIV-1 as well as GenBank database. Our results demonstrated no evidence that the sequences of these four inserts are HIV-1 specific or the 2019-nCoV viruses obtain these insertions from HIV-1. First, the results of blast search of these motifs against GenBank shows that the top 100 identical or highly homologous hits are all from host genes of mammalian, insects, bacterial and others. There are only a few hits on coronaviruses, but none of them are HIV-1 related. Blast against viral sequence database also showed these insertion sequences widely exist in all kinds of viruses from bacteriophage, influenza, to giant eukaryotic viruses (Table 1). More hits were found for coronaviruses and a few also hit on HIV-1 sequences than the search against the entire database (Table 1). However, while the 100% match between the insertion 1 and 2 sequences and the HIV sequences were found in 19 entries, the matches between the insertion 3 and 4 sequences and HIV-1 sequences were rather poor (from 42% to 88%). Moreover, the insertion 4 sequence ambiguously hit multiple different genes (gag, pol and env) in the HIV-1 genome, suggesting that similarities (as low as 42%) between them are too low to be reliable. Search these four insertion sequences against HIV-1 Sequence Database (https://www.hiv.lanl.gov/components/sequence/HIV/search/search.html) yielded similar results. Sequences that completely match the insertion 3 and 4 sequences were not found in any HIV-1 sequences. This clearly shows that these insertioin sequences are widely present in living organisms including viruses, but not HIV-1 specific. All these regions in HIV-1 envelope glycoprotein are highly variable with many large insertions and deletions, indicating that they are not essential for biological functions of HIV-1 envelope glycoprotein. The detection of completely matched sequences of 1 and 2 insertions in only a few HIV-1 strains demonstrated that four insertions are very rare or not present among tens of thousands of natural HIV-1 sequences. This also explains why four insertion homolog sequences could only be independently found in different HIV-1 genomes [8]. Because of their poor identities to and rareness in the HIV-1 sequences, HIV-1 could not be the source for those insertion sequences in the 2019-nCoV genome.\nTable 1. Blast search results of four insertion sequences against sequence databases.\nDatabase Gene source Insertion 1 TNGTKR Insertion 2 HKNNKS Insertion 3 RSYLTPGDSSSG Insertion 4 QTNSPRRA\nWhole database CoV 2 (2) 0 3 (3) 2 (2)\nHIV-1 0 0 0 0\nProkaryotic 27 (27) 3 (3) 74 (0) 66 (1)\nEukaryotic 71 (71) 97 (97) 23 (0) 32 (1)\nOnly viral database CoV 3 (3) 3 (3) 5 (3) 3 (2)\nHIV-1 18 (18) 1 (1) 4 (0)* 6 (0)**\nOther Eukaryotic viruses 49 (2) 66 (8) 69 (0) 62 (0)\nProkaryotic viruses 29 (13) 30 (1) 21 (0) 28 (0)\nUnclassified virus 1 (1) 0 1 (0) 1 (0)\nTop 100 hits are analyzed and the numbers of 100% matches are shown in parentheses. * Similarity at 67%; ** Random hits in Gag, Pro and Env sequences with similarity between 42% and 88%."}