PMC:7105881 / 9904-18333
Annnotations
LitCovid-PubTator
{"project":"LitCovid-PubTator","denotations":[{"id":"384","span":{"begin":19,"end":24},"obj":"Species"},{"id":"385","span":{"begin":25,"end":38},"obj":"Species"},{"id":"411","span":{"begin":728,"end":735},"obj":"Gene"},{"id":"412","span":{"begin":782,"end":786},"obj":"Gene"},{"id":"413","span":{"begin":873,"end":877},"obj":"Gene"},{"id":"414","span":{"begin":916,"end":920},"obj":"Gene"},{"id":"415","span":{"begin":1350,"end":1357},"obj":"Gene"},{"id":"416","span":{"begin":1563,"end":1567},"obj":"Gene"},{"id":"417","span":{"begin":1812,"end":1816},"obj":"Gene"},{"id":"418","span":{"begin":1818,"end":1823},"obj":"Gene"},{"id":"419","span":{"begin":1572,"end":1577},"obj":"Gene"},{"id":"420","span":{"begin":100,"end":105},"obj":"Species"},{"id":"421","span":{"begin":106,"end":110},"obj":"Species"},{"id":"422","span":{"begin":227,"end":231},"obj":"Species"},{"id":"423","span":{"begin":233,"end":238},"obj":"Species"},{"id":"424","span":{"begin":239,"end":243},"obj":"Species"},{"id":"425","span":{"begin":1254,"end":1258},"obj":"Species"},{"id":"426","span":{"begin":1268,"end":1277},"obj":"Species"},{"id":"427","span":{"begin":1282,"end":1291},"obj":"Species"},{"id":"428","span":{"begin":1501,"end":1506},"obj":"Species"},{"id":"429","span":{"begin":1507,"end":1511},"obj":"Species"},{"id":"430","span":{"begin":1513,"end":1521},"obj":"Species"},{"id":"431","span":{"begin":1759,"end":1767},"obj":"Species"},{"id":"432","span":{"begin":1979,"end":1987},"obj":"Species"},{"id":"433","span":{"begin":1992,"end":2000},"obj":"Species"},{"id":"434","span":{"begin":2236,"end":2244},"obj":"Species"},{"id":"435","span":{"begin":2249,"end":2257},"obj":"Species"},{"id":"449","span":{"begin":2942,"end":2943},"obj":"Gene"},{"id":"450","span":{"begin":2362,"end":2370},"obj":"Species"},{"id":"451","span":{"begin":2372,"end":2380},"obj":"Species"},{"id":"452","span":{"begin":2386,"end":2395},"obj":"Species"},{"id":"453","span":{"begin":2446,"end":2454},"obj":"Species"},{"id":"454","span":{"begin":2456,"end":2464},"obj":"Species"},{"id":"455","span":{"begin":2470,"end":2479},"obj":"Species"},{"id":"456","span":{"begin":2787,"end":2795},"obj":"Species"},{"id":"457","span":{"begin":2824,"end":2832},"obj":"Species"},{"id":"458","span":{"begin":2869,"end":2878},"obj":"Species"},{"id":"459","span":{"begin":3039,"end":3047},"obj":"Species"},{"id":"460","span":{"begin":3049,"end":3057},"obj":"Species"},{"id":"461","span":{"begin":3063,"end":3072},"obj":"Species"},{"id":"476","span":{"begin":3128,"end":3133},"obj":"Species"},{"id":"477","span":{"begin":3134,"end":3138},"obj":"Species"},{"id":"478","span":{"begin":3607,"end":3616},"obj":"Species"},{"id":"479","span":{"begin":3817,"end":3822},"obj":"Species"},{"id":"480","span":{"begin":3823,"end":3827},"obj":"Species"},{"id":"481","span":{"begin":4785,"end":4790},"obj":"Species"},{"id":"482","span":{"begin":4791,"end":4795},"obj":"Species"},{"id":"483","span":{"begin":3569,"end":3579},"obj":"Species"},{"id":"484","span":{"begin":4405,"end":4410},"obj":"Chemical"},{"id":"485","span":{"begin":3157,"end":3172},"obj":"Disease"},{"id":"486","span":{"begin":3906,"end":3921},"obj":"Disease"},{"id":"487","span":{"begin":4392,"end":4396},"obj":"Disease"},{"id":"488","span":{"begin":4628,"end":4632},"obj":"Disease"},{"id":"489","span":{"begin":4957,"end":4966},"obj":"Disease"},{"id":"507","span":{"begin":5403,"end":5434},"obj":"Gene"},{"id":"508","span":{"begin":5436,"end":5440},"obj":"Gene"},{"id":"509","span":{"begin":5589,"end":5593},"obj":"Gene"},{"id":"510","span":{"begin":5749,"end":5753},"obj":"Gene"},{"id":"511","span":{"begin":5814,"end":5818},"obj":"Gene"},{"id":"512","span":{"begin":6054,"end":6058},"obj":"Gene"},{"id":"513","span":{"begin":6196,"end":6200},"obj":"Gene"},{"id":"514","span":{"begin":5285,"end":5294},"obj":"Species"},{"id":"515","span":{"begin":5329,"end":5337},"obj":"Species"},{"id":"516","span":{"begin":5743,"end":5748},"obj":"Species"},{"id":"517","span":{"begin":5755,"end":5763},"obj":"Species"},{"id":"518","span":{"begin":5808,"end":5813},"obj":"Species"},{"id":"519","span":{"begin":5931,"end":5939},"obj":"Species"},{"id":"520","span":{"begin":5994,"end":5998},"obj":"Species"},{"id":"521","span":{"begin":6031,"end":6035},"obj":"Species"},{"id":"522","span":{"begin":6048,"end":6053},"obj":"Species"},{"id":"523","span":{"begin":6155,"end":6163},"obj":"Species"},{"id":"555","span":{"begin":6519,"end":6541},"obj":"Gene"},{"id":"556","span":{"begin":6543,"end":6547},"obj":"Gene"},{"id":"557","span":{"begin":6657,"end":6661},"obj":"Gene"},{"id":"558","span":{"begin":7005,"end":7009},"obj":"Gene"},{"id":"559","span":{"begin":7232,"end":7236},"obj":"Gene"},{"id":"560","span":{"begin":7371,"end":7375},"obj":"Gene"},{"id":"561","span":{"begin":7637,"end":7641},"obj":"Gene"},{"id":"562","span":{"begin":6251,"end":6259},"obj":"Species"},{"id":"563","span":{"begin":6386,"end":6394},"obj":"Species"},{"id":"564","span":{"begin":6462,"end":6470},"obj":"Species"},{"id":"565","span":{"begin":6501,"end":6509},"obj":"Species"},{"id":"566","span":{"begin":6639,"end":6647},"obj":"Species"},{"id":"567","span":{"begin":6692,"end":6700},"obj":"Species"},{"id":"568","span":{"begin":6705,"end":6713},"obj":"Species"},{"id":"569","span":{"begin":6845,"end":6853},"obj":"Species"},{"id":"570","span":{"begin":6909,"end":6917},"obj":"Species"},{"id":"571","span":{"begin":6982,"end":6990},"obj":"Species"},{"id":"572","span":{"begin":7085,"end":7093},"obj":"Species"},{"id":"573","span":{"begin":7226,"end":7231},"obj":"Species"},{"id":"574","span":{"begin":7301,"end":7305},"obj":"Species"},{"id":"575","span":{"begin":7313,"end":7317},"obj":"Species"},{"id":"576","span":{"begin":7365,"end":7370},"obj":"Species"},{"id":"577","span":{"begin":7483,"end":7486},"obj":"Species"},{"id":"578","span":{"begin":7500,"end":7505},"obj":"Species"},{"id":"579","span":{"begin":7574,"end":7582},"obj":"Species"},{"id":"580","span":{"begin":7122,"end":7126},"obj":"Species"},{"id":"581","span":{"begin":7141,"end":7145},"obj":"Species"},{"id":"582","span":{"begin":7186,"end":7190},"obj":"Species"},{"id":"583","span":{"begin":7210,"end":7214},"obj":"Species"},{"id":"584","span":{"begin":6325,"end":6333},"obj":"Species"},{"id":"585","span":{"begin":7176,"end":7178},"obj":"Chemical"},{"id":"608","span":{"begin":7907,"end":7912},"obj":"Gene"},{"id":"609","span":{"begin":7930,"end":7937},"obj":"Gene"},{"id":"610","span":{"begin":7979,"end":7980},"obj":"Gene"},{"id":"611","span":{"begin":8246,"end":8250},"obj":"Gene"},{"id":"612","span":{"begin":7969,"end":7970},"obj":"Gene"},{"id":"613","span":{"begin":7972,"end":7973},"obj":"Gene"},{"id":"614","span":{"begin":7966,"end":7967},"obj":"Gene"},{"id":"615","span":{"begin":7733,"end":7738},"obj":"Species"},{"id":"616","span":{"begin":7787,"end":7796},"obj":"Species"},{"id":"617","span":{"begin":7845,"end":7853},"obj":"Species"},{"id":"618","span":{"begin":7889,"end":7898},"obj":"Species"},{"id":"619","span":{"begin":8058,"end":8066},"obj":"Species"},{"id":"620","span":{"begin":8071,"end":8079},"obj":"Species"},{"id":"621","span":{"begin":8081,"end":8090},"obj":"Species"},{"id":"622","span":{"begin":8133,"end":8142},"obj":"Species"},{"id":"623","span":{"begin":8176,"end":8184},"obj":"Species"},{"id":"624","span":{"begin":8189,"end":8197},"obj":"Species"},{"id":"625","span":{"begin":8216,"end":8224},"obj":"Species"},{"id":"626","span":{"begin":8226,"end":8235},"obj":"Species"},{"id":"627","span":{"begin":8345,"end":8354},"obj":"Species"},{"id":"628","span":{"begin":7739,"end":7742},"obj":"Species"},{"id":"629","span":{"begin":7744,"end":7753},"obj":"Species"}],"attributes":[{"id":"A384","pred":"tao:has_database_id","subj":"384","obj":"Tax:9606"},{"id":"A385","pred":"tao:has_database_id","subj":"385","obj":"Tax:11118"},{"id":"A411","pred":"tao:has_database_id","subj":"411","obj":"Gene:43740578"},{"id":"A412","pred":"tao:has_database_id","subj":"412","obj":"Gene:5499"},{"id":"A413","pred":"tao:has_database_id","subj":"413","obj":"Gene:10045"},{"id":"A414","pred":"tao:has_database_id","subj":"414","obj":"Gene:92521"},{"id":"A415","pred":"tao:has_database_id","subj":"415","obj":"Gene:3200429"},{"id":"A416","pred":"tao:has_database_id","subj":"416","obj":"Gene:5499"},{"id":"A417","pred":"tao:has_database_id","subj":"417","obj":"Gene:5499"},{"id":"A418","pred":"tao:has_database_id","subj":"418","obj":"Gene:14254602"},{"id":"A419","pred":"tao:has_database_id","subj":"419","obj":"Gene:14254602"},{"id":"A420","pred":"tao:has_database_id","subj":"420","obj":"Tax:9606"},{"id":"A421","pred":"tao:has_database_id","subj":"421","obj":"Tax:11118"},{"id":"A422","pred":"tao:has_database_id","subj":"422","obj":"Tax:11118"},{"id":"A423","pred":"tao:has_database_id","subj":"423","obj":"Tax:9606"},{"id":"A424","pred":"tao:has_database_id","subj":"424","obj":"Tax:11118"},{"id":"A425","pred":"tao:has_database_id","subj":"425","obj":"Tax:11118"},{"id":"A426","pred":"tao:has_database_id","subj":"426","obj":"Tax:31631"},{"id":"A427","pred":"tao:has_database_id","subj":"427","obj":"Tax:290028"},{"id":"A428","pred":"tao:has_database_id","subj":"428","obj":"Tax:9606"},{"id":"A429","pred":"tao:has_database_id","subj":"429","obj":"Tax:11118"},{"id":"A430","pred":"tao:has_database_id","subj":"430","obj":"Tax:694009"},{"id":"A431","pred":"tao:has_database_id","subj":"431","obj":"Tax:1335626"},{"id":"A432","pred":"tao:has_database_id","subj":"432","obj":"Tax:694009"},{"id":"A433","pred":"tao:has_database_id","subj":"433","obj":"Tax:1335626"},{"id":"A434","pred":"tao:has_database_id","subj":"434","obj":"Tax:694009"},{"id":"A435","pred":"tao:has_database_id","subj":"435","obj":"Tax:1335626"},{"id":"A449","pred":"tao:has_database_id","subj":"449","obj":"Gene:43740575"},{"id":"A450","pred":"tao:has_database_id","subj":"450","obj":"Tax:694009"},{"id":"A451","pred":"tao:has_database_id","subj":"451","obj":"Tax:1335626"},{"id":"A452","pred":"tao:has_database_id","subj":"452","obj":"Tax:2697049"},{"id":"A453","pred":"tao:has_database_id","subj":"453","obj":"Tax:694009"},{"id":"A454","pred":"tao:has_database_id","subj":"454","obj":"Tax:1335626"},{"id":"A455","pred":"tao:has_database_id","subj":"455","obj":"Tax:2697049"},{"id":"A456","pred":"tao:has_database_id","subj":"456","obj":"Tax:694009"},{"id":"A457","pred":"tao:has_database_id","subj":"457","obj":"Tax:1335626"},{"id":"A458","pred":"tao:has_database_id","subj":"458","obj":"Tax:2697049"},{"id":"A459","pred":"tao:has_database_id","subj":"459","obj":"Tax:694009"},{"id":"A460","pred":"tao:has_database_id","subj":"460","obj":"Tax:1335626"},{"id":"A461","pred":"tao:has_database_id","subj":"461","obj":"Tax:2697049"},{"id":"A476","pred":"tao:has_database_id","subj":"476","obj":"Tax:9606"},{"id":"A477","pred":"tao:has_database_id","subj":"477","obj":"Tax:11118"},{"id":"A478","pred":"tao:has_database_id","subj":"478","obj":"Tax:31631"},{"id":"A479","pred":"tao:has_database_id","subj":"479","obj":"Tax:9606"},{"id":"A480","pred":"tao:has_database_id","subj":"480","obj":"Tax:11118"},{"id":"A481","pred":"tao:has_database_id","subj":"481","obj":"Tax:9606"},{"id":"A482","pred":"tao:has_database_id","subj":"482","obj":"Tax:11118"},{"id":"A483","pred":"tao:has_database_id","subj":"483","obj":"Tax:11120"},{"id":"A484","pred":"tao:has_database_id","subj":"484","obj":"MESH:D000073893"},{"id":"A485","pred":"tao:has_database_id","subj":"485","obj":"MESH:D001102"},{"id":"A486","pred":"tao:has_database_id","subj":"486","obj":"MESH:D001102"},{"id":"A489","pred":"tao:has_database_id","subj":"489","obj":"MESH:D007239"},{"id":"A507","pred":"tao:has_database_id","subj":"507","obj":"Gene:59272"},{"id":"A508","pred":"tao:has_database_id","subj":"508","obj":"Gene:59272"},{"id":"A509","pred":"tao:has_database_id","subj":"509","obj":"Gene:59272"},{"id":"A510","pred":"tao:has_database_id","subj":"510","obj":"Gene:59272"},{"id":"A511","pred":"tao:has_database_id","subj":"511","obj":"Gene:70008"},{"id":"A512","pred":"tao:has_database_id","subj":"512","obj":"Gene:59272"},{"id":"A513","pred":"tao:has_database_id","subj":"513","obj":"Gene:59272"},{"id":"A514","pred":"tao:has_database_id","subj":"514","obj":"Tax:277944"},{"id":"A515","pred":"tao:has_database_id","subj":"515","obj":"Tax:694009"},{"id":"A516","pred":"tao:has_database_id","subj":"516","obj":"Tax:9606"},{"id":"A517","pred":"tao:has_database_id","subj":"517","obj":"Tax:694009"},{"id":"A518","pred":"tao:has_database_id","subj":"518","obj":"Tax:10090"},{"id":"A519","pred":"tao:has_database_id","subj":"519","obj":"Tax:694009"},{"id":"A520","pred":"tao:has_database_id","subj":"520","obj":"Tax:11118"},{"id":"A521","pred":"tao:has_database_id","subj":"521","obj":"Tax:11118"},{"id":"A522","pred":"tao:has_database_id","subj":"522","obj":"Tax:9606"},{"id":"A523","pred":"tao:has_database_id","subj":"523","obj":"Tax:694009"},{"id":"A555","pred":"tao:has_database_id","subj":"555","obj":"Gene:1803"},{"id":"A556","pred":"tao:has_database_id","subj":"556","obj":"Gene:1803"},{"id":"A557","pred":"tao:has_database_id","subj":"557","obj":"Gene:59272"},{"id":"A558","pred":"tao:has_database_id","subj":"558","obj":"Gene:1803"},{"id":"A559","pred":"tao:has_database_id","subj":"559","obj":"Gene:1803"},{"id":"A560","pred":"tao:has_database_id","subj":"560","obj":"Gene:1803"},{"id":"A561","pred":"tao:has_database_id","subj":"561","obj":"Gene:1803"},{"id":"A562","pred":"tao:has_database_id","subj":"562","obj":"Tax:1335626"},{"id":"A563","pred":"tao:has_database_id","subj":"563","obj":"Tax:694009"},{"id":"A564","pred":"tao:has_database_id","subj":"564","obj":"Tax:1335626"},{"id":"A565","pred":"tao:has_database_id","subj":"565","obj":"Tax:1335626"},{"id":"A566","pred":"tao:has_database_id","subj":"566","obj":"Tax:694009"},{"id":"A567","pred":"tao:has_database_id","subj":"567","obj":"Tax:694009"},{"id":"A568","pred":"tao:has_database_id","subj":"568","obj":"Tax:1335626"},{"id":"A569","pred":"tao:has_database_id","subj":"569","obj":"Tax:1335626"},{"id":"A570","pred":"tao:has_database_id","subj":"570","obj":"Tax:694009"},{"id":"A571","pred":"tao:has_database_id","subj":"571","obj":"Tax:1335626"},{"id":"A572","pred":"tao:has_database_id","subj":"572","obj":"Tax:1335626"},{"id":"A573","pred":"tao:has_database_id","subj":"573","obj":"Tax:9606"},{"id":"A574","pred":"tao:has_database_id","subj":"574","obj":"Tax:11118"},{"id":"A575","pred":"tao:has_database_id","subj":"575","obj":"Tax:11118"},{"id":"A576","pred":"tao:has_database_id","subj":"576","obj":"Tax:9606"},{"id":"A577","pred":"tao:has_database_id","subj":"577","obj":"Tax:11118"},{"id":"A578","pred":"tao:has_database_id","subj":"578","obj":"Tax:9606"},{"id":"A579","pred":"tao:has_database_id","subj":"579","obj":"Tax:1335626"},{"id":"A580","pred":"tao:has_database_id","subj":"580","obj":"Tax:694007"},{"id":"A581","pred":"tao:has_database_id","subj":"581","obj":"Tax:290028"},{"id":"A582","pred":"tao:has_database_id","subj":"582","obj":"Tax:694007"},{"id":"A583","pred":"tao:has_database_id","subj":"583","obj":"Tax:290028"},{"id":"A584","pred":"tao:has_database_id","subj":"584","obj":"Tax:694009"},{"id":"A608","pred":"tao:has_database_id","subj":"608","obj":"Gene:14254602"},{"id":"A609","pred":"tao:has_database_id","subj":"609","obj":"Gene:43740578"},{"id":"A610","pred":"tao:has_database_id","subj":"610","obj":"Gene:43740575"},{"id":"A611","pred":"tao:has_database_id","subj":"611","obj":"Gene:59272"},{"id":"A612","pred":"tao:has_database_id","subj":"612","obj":"Gene:43740570"},{"id":"A613","pred":"tao:has_database_id","subj":"613","obj":"Gene:43740571"},{"id":"A614","pred":"tao:has_database_id","subj":"614","obj":"Gene:43740568"},{"id":"A615","pred":"tao:has_database_id","subj":"615","obj":"Tax:9606"},{"id":"A616","pred":"tao:has_database_id","subj":"616","obj":"Tax:694009"},{"id":"A617","pred":"tao:has_database_id","subj":"617","obj":"Tax:694009"},{"id":"A618","pred":"tao:has_database_id","subj":"618","obj":"Tax:2697049"},{"id":"A619","pred":"tao:has_database_id","subj":"619","obj":"Tax:694009"},{"id":"A620","pred":"tao:has_database_id","subj":"620","obj":"Tax:1335626"},{"id":"A621","pred":"tao:has_database_id","subj":"621","obj":"Tax:2697049"},{"id":"A622","pred":"tao:has_database_id","subj":"622","obj":"Tax:2697049"},{"id":"A623","pred":"tao:has_database_id","subj":"623","obj":"Tax:694009"},{"id":"A624","pred":"tao:has_database_id","subj":"624","obj":"Tax:1335626"},{"id":"A625","pred":"tao:has_database_id","subj":"625","obj":"Tax:694009"},{"id":"A626","pred":"tao:has_database_id","subj":"626","obj":"Tax:2697049"},{"id":"A627","pred":"tao:has_database_id","subj":"627","obj":"Tax:2697049"},{"id":"A628","pred":"tao:has_database_id","subj":"628","obj":"Tax:11118"},{"id":"A629","pred":"tao:has_database_id","subj":"629","obj":"Tax:2697049"}],"namespaces":[{"prefix":"Tax","uri":"https://www.ncbi.nlm.nih.gov/taxonomy/"},{"prefix":"MESH","uri":"https://id.nlm.nih.gov/mesh/"},{"prefix":"Gene","uri":"https://www.ncbi.nlm.nih.gov/gene/"},{"prefix":"CVCL","uri":"https://web.expasy.org/cellosaurus/CVCL_"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid-PD-FMA-UBERON
{"project":"LitCovid-PD-FMA-UBERON","denotations":[{"id":"T16","span":{"begin":0,"end":6},"obj":"Body_part"},{"id":"T17","span":{"begin":87,"end":95},"obj":"Body_part"},{"id":"T18","span":{"begin":172,"end":175},"obj":"Body_part"},{"id":"T19","span":{"begin":176,"end":182},"obj":"Body_part"},{"id":"T20","span":{"begin":270,"end":276},"obj":"Body_part"},{"id":"T21","span":{"begin":315,"end":318},"obj":"Body_part"},{"id":"T22","span":{"begin":351,"end":357},"obj":"Body_part"},{"id":"T23","span":{"begin":491,"end":497},"obj":"Body_part"},{"id":"T24","span":{"begin":537,"end":543},"obj":"Body_part"},{"id":"T25","span":{"begin":594,"end":602},"obj":"Body_part"},{"id":"T26","span":{"begin":610,"end":614},"obj":"Body_part"},{"id":"T27","span":{"begin":751,"end":755},"obj":"Body_part"},{"id":"T28","span":{"begin":849,"end":857},"obj":"Body_part"},{"id":"T29","span":{"begin":1113,"end":1121},"obj":"Body_part"},{"id":"T30","span":{"begin":1152,"end":1160},"obj":"Body_part"},{"id":"T31","span":{"begin":1329,"end":1333},"obj":"Body_part"},{"id":"T32","span":{"begin":1364,"end":1368},"obj":"Body_part"},{"id":"T33","span":{"begin":1403,"end":1410},"obj":"Body_part"},{"id":"T34","span":{"begin":1541,"end":1547},"obj":"Body_part"},{"id":"T35","span":{"begin":1600,"end":1608},"obj":"Body_part"},{"id":"T36","span":{"begin":1647,"end":1655},"obj":"Body_part"},{"id":"T37","span":{"begin":1768,"end":1774},"obj":"Body_part"},{"id":"T38","span":{"begin":1841,"end":1849},"obj":"Body_part"},{"id":"T39","span":{"begin":1887,"end":1895},"obj":"Body_part"},{"id":"T40","span":{"begin":1972,"end":1975},"obj":"Body_part"},{"id":"T41","span":{"begin":2041,"end":2048},"obj":"Body_part"},{"id":"T42","span":{"begin":2072,"end":2080},"obj":"Body_part"},{"id":"T43","span":{"begin":2284,"end":2288},"obj":"Body_part"},{"id":"T44","span":{"begin":2556,"end":2564},"obj":"Body_part"},{"id":"T45","span":{"begin":2635,"end":2643},"obj":"Body_part"},{"id":"T46","span":{"begin":2671,"end":2679},"obj":"Body_part"},{"id":"T47","span":{"begin":2728,"end":2736},"obj":"Body_part"},{"id":"T48","span":{"begin":3098,"end":3106},"obj":"Body_part"},{"id":"T49","span":{"begin":3116,"end":3124},"obj":"Body_part"},{"id":"T50","span":{"begin":3238,"end":3241},"obj":"Body_part"},{"id":"T51","span":{"begin":3313,"end":3321},"obj":"Body_part"},{"id":"T52","span":{"begin":3341,"end":3346},"obj":"Body_part"},{"id":"T53","span":{"begin":3393,"end":3406},"obj":"Body_part"},{"id":"T54","span":{"begin":3469,"end":3476},"obj":"Body_part"},{"id":"T55","span":{"begin":3517,"end":3522},"obj":"Body_part"},{"id":"T56","span":{"begin":3655,"end":3663},"obj":"Body_part"},{"id":"T57","span":{"begin":3790,"end":3798},"obj":"Body_part"},{"id":"T58","span":{"begin":3806,"end":3813},"obj":"Body_part"},{"id":"T59","span":{"begin":3859,"end":3864},"obj":"Body_part"},{"id":"T60","span":{"begin":3965,"end":3972},"obj":"Body_part"},{"id":"T61","span":{"begin":3992,"end":3999},"obj":"Body_part"},{"id":"T62","span":{"begin":4162,"end":4166},"obj":"Body_part"},{"id":"T63","span":{"begin":4405,"end":4410},"obj":"Body_part"},{"id":"T64","span":{"begin":4535,"end":4542},"obj":"Body_part"},{"id":"T65","span":{"begin":4840,"end":4857},"obj":"Body_part"},{"id":"T66","span":{"begin":4858,"end":4866},"obj":"Body_part"},{"id":"T67","span":{"begin":4917,"end":4922},"obj":"Body_part"},{"id":"T68","span":{"begin":4930,"end":4942},"obj":"Body_part"},{"id":"T69","span":{"begin":4930,"end":4934},"obj":"Body_part"},{"id":"T70","span":{"begin":5028,"end":5035},"obj":"Body_part"},{"id":"T71","span":{"begin":5225,"end":5229},"obj":"Body_part"},{"id":"T72","span":{"begin":5472,"end":5476},"obj":"Body_part"},{"id":"T73","span":{"begin":5580,"end":5587},"obj":"Body_part"},{"id":"T74","span":{"begin":5766,"end":5773},"obj":"Body_part"},{"id":"T75","span":{"begin":5793,"end":5797},"obj":"Body_part"},{"id":"T76","span":{"begin":6096,"end":6101},"obj":"Body_part"},{"id":"T77","span":{"begin":7387,"end":7392},"obj":"Body_part"},{"id":"T78","span":{"begin":7585,"end":7592},"obj":"Body_part"},{"id":"T79","span":{"begin":7879,"end":7885},"obj":"Body_part"},{"id":"T80","span":{"begin":7956,"end":7964},"obj":"Body_part"},{"id":"T81","span":{"begin":8001,"end":8009},"obj":"Body_part"},{"id":"T82","span":{"begin":8113,"end":8117},"obj":"Body_part"},{"id":"T83","span":{"begin":8290,"end":8295},"obj":"Body_part"}],"attributes":[{"id":"A16","pred":"fma_id","subj":"T16","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A17","pred":"fma_id","subj":"T17","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A18","pred":"fma_id","subj":"T18","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A19","pred":"fma_id","subj":"T19","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A20","pred":"fma_id","subj":"T20","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A21","pred":"fma_id","subj":"T21","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A22","pred":"fma_id","subj":"T22","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A23","pred":"fma_id","subj":"T23","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A24","pred":"fma_id","subj":"T24","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A25","pred":"fma_id","subj":"T25","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A26","pred":"fma_id","subj":"T26","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A27","pred":"fma_id","subj":"T27","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A28","pred":"fma_id","subj":"T28","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A29","pred":"fma_id","subj":"T29","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A30","pred":"fma_id","subj":"T30","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A31","pred":"fma_id","subj":"T31","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A32","pred":"fma_id","subj":"T32","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A33","pred":"fma_id","subj":"T33","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A34","pred":"fma_id","subj":"T34","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A35","pred":"fma_id","subj":"T35","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A36","pred":"fma_id","subj":"T36","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A37","pred":"fma_id","subj":"T37","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A38","pred":"fma_id","subj":"T38","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A39","pred":"fma_id","subj":"T39","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A40","pred":"fma_id","subj":"T40","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A41","pred":"fma_id","subj":"T41","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A42","pred":"fma_id","subj":"T42","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A43","pred":"fma_id","subj":"T43","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A44","pred":"fma_id","subj":"T44","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A45","pred":"fma_id","subj":"T45","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A46","pred":"fma_id","subj":"T46","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A47","pred":"fma_id","subj":"T47","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A48","pred":"fma_id","subj":"T48","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A49","pred":"fma_id","subj":"T49","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A50","pred":"fma_id","subj":"T50","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A51","pred":"fma_id","subj":"T51","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A52","pred":"fma_id","subj":"T52","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A53","pred":"fma_id","subj":"T53","obj":"http://purl.org/sig/ont/fma/fma9825"},{"id":"A54","pred":"fma_id","subj":"T54","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A55","pred":"fma_id","subj":"T55","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A56","pred":"fma_id","subj":"T56","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A57","pred":"fma_id","subj":"T57","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A58","pred":"fma_id","subj":"T58","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A59","pred":"fma_id","subj":"T59","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A60","pred":"fma_id","subj":"T60","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A61","pred":"fma_id","subj":"T61","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A62","pred":"fma_id","subj":"T62","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A63","pred":"fma_id","subj":"T63","obj":"http://purl.org/sig/ont/fma/fma82737"},{"id":"A64","pred":"fma_id","subj":"T64","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A65","pred":"fma_id","subj":"T65","obj":"http://purl.org/sig/ont/fma/fma63841"},{"id":"A66","pred":"fma_id","subj":"T66","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A67","pred":"fma_id","subj":"T67","obj":"http://purl.org/sig/ont/fma/fma82737"},{"id":"A68","pred":"fma_id","subj":"T68","obj":"http://purl.org/sig/ont/fma/fma67653"},{"id":"A69","pred":"fma_id","subj":"T69","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A70","pred":"fma_id","subj":"T70","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A71","pred":"fma_id","subj":"T71","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A72","pred":"fma_id","subj":"T72","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A73","pred":"fma_id","subj":"T73","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A74","pred":"fma_id","subj":"T74","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A75","pred":"fma_id","subj":"T75","obj":"http://purl.org/sig/ont/fma/fma24920"},{"id":"A76","pred":"fma_id","subj":"T76","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A77","pred":"fma_id","subj":"T77","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A78","pred":"fma_id","subj":"T78","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A79","pred":"fma_id","subj":"T79","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A80","pred":"fma_id","subj":"T80","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A81","pred":"fma_id","subj":"T81","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A82","pred":"fma_id","subj":"T82","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A83","pred":"fma_id","subj":"T83","obj":"http://purl.org/sig/ont/fma/fma68646"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid-PD-UBERON
{"project":"LitCovid-PD-UBERON","denotations":[{"id":"T10","span":{"begin":3393,"end":3406},"obj":"Body_part"}],"attributes":[{"id":"A10","pred":"uberon_id","subj":"T10","obj":"http://purl.obolibrary.org/obo/UBERON_0002405"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid_AGAC
{"project":"LitCovid_AGAC","denotations":[{"id":"p62853s18","span":{"begin":3411,"end":3419},"obj":"PosReg"},{"id":"p62853s20","span":{"begin":3426,"end":3435},"obj":"MPA"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid-PD-MONDO
{"project":"LitCovid-PD-MONDO","denotations":[{"id":"T40","span":{"begin":1513,"end":1521},"obj":"Disease"},{"id":"T41","span":{"begin":1979,"end":1987},"obj":"Disease"},{"id":"T42","span":{"begin":2236,"end":2244},"obj":"Disease"},{"id":"T43","span":{"begin":2362,"end":2370},"obj":"Disease"},{"id":"T44","span":{"begin":2446,"end":2454},"obj":"Disease"},{"id":"T45","span":{"begin":2787,"end":2795},"obj":"Disease"},{"id":"T46","span":{"begin":3039,"end":3047},"obj":"Disease"},{"id":"T47","span":{"begin":3157,"end":3172},"obj":"Disease"},{"id":"T48","span":{"begin":3163,"end":3172},"obj":"Disease"},{"id":"T49","span":{"begin":3569,"end":3579},"obj":"Disease"},{"id":"T50","span":{"begin":3906,"end":3921},"obj":"Disease"},{"id":"T51","span":{"begin":3912,"end":3921},"obj":"Disease"},{"id":"T52","span":{"begin":4344,"end":4347},"obj":"Disease"},{"id":"T54","span":{"begin":4957,"end":4966},"obj":"Disease"},{"id":"T55","span":{"begin":5329,"end":5337},"obj":"Disease"},{"id":"T56","span":{"begin":5755,"end":5763},"obj":"Disease"},{"id":"T57","span":{"begin":5931,"end":5939},"obj":"Disease"},{"id":"T58","span":{"begin":6155,"end":6163},"obj":"Disease"},{"id":"T59","span":{"begin":6325,"end":6333},"obj":"Disease"},{"id":"T60","span":{"begin":6386,"end":6394},"obj":"Disease"},{"id":"T61","span":{"begin":6639,"end":6647},"obj":"Disease"},{"id":"T62","span":{"begin":6692,"end":6700},"obj":"Disease"},{"id":"T63","span":{"begin":6909,"end":6917},"obj":"Disease"},{"id":"T64","span":{"begin":7845,"end":7853},"obj":"Disease"},{"id":"T65","span":{"begin":8058,"end":8066},"obj":"Disease"},{"id":"T66","span":{"begin":8176,"end":8184},"obj":"Disease"},{"id":"T67","span":{"begin":8216,"end":8224},"obj":"Disease"}],"attributes":[{"id":"A40","pred":"mondo_id","subj":"T40","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A41","pred":"mondo_id","subj":"T41","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A42","pred":"mondo_id","subj":"T42","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A43","pred":"mondo_id","subj":"T43","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A44","pred":"mondo_id","subj":"T44","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A45","pred":"mondo_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A46","pred":"mondo_id","subj":"T46","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A47","pred":"mondo_id","subj":"T47","obj":"http://purl.obolibrary.org/obo/MONDO_0005108"},{"id":"A48","pred":"mondo_id","subj":"T48","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A49","pred":"mondo_id","subj":"T49","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A50","pred":"mondo_id","subj":"T50","obj":"http://purl.obolibrary.org/obo/MONDO_0005108"},{"id":"A51","pred":"mondo_id","subj":"T51","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A52","pred":"mondo_id","subj":"T52","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A53","pred":"mondo_id","subj":"T52","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A54","pred":"mondo_id","subj":"T54","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A55","pred":"mondo_id","subj":"T55","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A56","pred":"mondo_id","subj":"T56","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A57","pred":"mondo_id","subj":"T57","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A58","pred":"mondo_id","subj":"T58","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A59","pred":"mondo_id","subj":"T59","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A60","pred":"mondo_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A61","pred":"mondo_id","subj":"T61","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A62","pred":"mondo_id","subj":"T62","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A63","pred":"mondo_id","subj":"T63","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A64","pred":"mondo_id","subj":"T64","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A65","pred":"mondo_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A66","pred":"mondo_id","subj":"T66","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A67","pred":"mondo_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid-PD-CLO
{"project":"LitCovid-PD-CLO","denotations":[{"id":"T89","span":{"begin":19,"end":24},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T90","span":{"begin":100,"end":105},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T91","span":{"begin":125,"end":132},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T92","span":{"begin":138,"end":139},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T93","span":{"begin":233,"end":238},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T94","span":{"begin":278,"end":280},"obj":"http://purl.obolibrary.org/obo/CLO_0050509"},{"id":"T95","span":{"begin":300,"end":302},"obj":"http://purl.obolibrary.org/obo/CLO_0007074"},{"id":"T96","span":{"begin":300,"end":302},"obj":"http://purl.obolibrary.org/obo/CLO_0051988"},{"id":"T97","span":{"begin":319,"end":326},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T98","span":{"begin":358,"end":370},"obj":"http://purl.obolibrary.org/obo/OBI_0000245"},{"id":"T99","span":{"begin":522,"end":523},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T100","span":{"begin":610,"end":614},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T101","span":{"begin":649,"end":657},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T102","span":{"begin":739,"end":740},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T103","span":{"begin":751,"end":755},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T104","span":{"begin":1077,"end":1082},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T105","span":{"begin":1127,"end":1132},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T106","span":{"begin":1329,"end":1333},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T107","span":{"begin":1364,"end":1368},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T108","span":{"begin":1501,"end":1506},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T109","span":{"begin":1532,"end":1533},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T110","span":{"begin":1538,"end":1540},"obj":"http://purl.obolibrary.org/obo/CLO_0007074"},{"id":"T111","span":{"begin":1538,"end":1540},"obj":"http://purl.obolibrary.org/obo/CLO_0051988"},{"id":"T112","span":{"begin":1707,"end":1709},"obj":"http://purl.obolibrary.org/obo/CLO_0001236"},{"id":"T113","span":{"begin":1787,"end":1789},"obj":"http://purl.obolibrary.org/obo/CLO_0007074"},{"id":"T114","span":{"begin":1787,"end":1789},"obj":"http://purl.obolibrary.org/obo/CLO_0051988"},{"id":"T115","span":{"begin":1927,"end":1929},"obj":"http://purl.obolibrary.org/obo/CLO_0001236"},{"id":"T116","span":{"begin":2060,"end":2064},"obj":"http://purl.obolibrary.org/obo/CLO_0007653"},{"id":"T117","span":{"begin":2145,"end":2150},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T118","span":{"begin":2284,"end":2288},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T119","span":{"begin":2398,"end":2399},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T120","span":{"begin":2430,"end":2442},"obj":"http://purl.obolibrary.org/obo/OBI_0000245"},{"id":"T121","span":{"begin":2601,"end":2609},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T122","span":{"begin":2839,"end":2844},"obj":"http://purl.obolibrary.org/obo/CLO_0053817"},{"id":"T123","span":{"begin":2983,"end":2985},"obj":"http://purl.obolibrary.org/obo/CLO_0007074"},{"id":"T124","span":{"begin":2983,"end":2985},"obj":"http://purl.obolibrary.org/obo/CLO_0051988"},{"id":"T125","span":{"begin":3003,"end":3004},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T126","span":{"begin":3128,"end":3133},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T127","span":{"begin":3341,"end":3346},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T128","span":{"begin":3372,"end":3379},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T129","span":{"begin":3393,"end":3406},"obj":"http://purl.obolibrary.org/obo/UBERON_0002405"},{"id":"T130","span":{"begin":3506,"end":3511},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T131","span":{"begin":3517,"end":3522},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T132","span":{"begin":3537,"end":3538},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T133","span":{"begin":3684,"end":3689},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T134","span":{"begin":3817,"end":3822},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T135","span":{"begin":3859,"end":3864},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T136","span":{"begin":3880,"end":3888},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T137","span":{"begin":3976,"end":3977},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T138","span":{"begin":4070,"end":4072},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T139","span":{"begin":4085,"end":4086},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T140","span":{"begin":4105,"end":4107},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T141","span":{"begin":4105,"end":4107},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T142","span":{"begin":4121,"end":4123},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T143","span":{"begin":4151,"end":4156},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T144","span":{"begin":4162,"end":4166},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T145","span":{"begin":4197,"end":4199},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T146","span":{"begin":4197,"end":4199},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T147","span":{"begin":4223,"end":4228},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T148","span":{"begin":4234,"end":4242},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T149","span":{"begin":4251,"end":4253},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T150","span":{"begin":4251,"end":4253},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T151","span":{"begin":4291,"end":4293},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T152","span":{"begin":4353,"end":4354},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T153","span":{"begin":4420,"end":4422},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T154","span":{"begin":4420,"end":4422},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T155","span":{"begin":4430,"end":4432},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T156","span":{"begin":4430,"end":4432},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T157","span":{"begin":4649,"end":4651},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T158","span":{"begin":4785,"end":4790},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T159","span":{"begin":4849,"end":4857},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T160","span":{"begin":4883,"end":4885},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T161","span":{"begin":4899,"end":4900},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T162","span":{"begin":4930,"end":4934},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T163","span":{"begin":4968,"end":4970},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T164","span":{"begin":4968,"end":4970},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T165","span":{"begin":5082,"end":5090},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T166","span":{"begin":5110,"end":5112},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T167","span":{"begin":5110,"end":5112},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T168","span":{"begin":5121,"end":5123},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T169","span":{"begin":5121,"end":5123},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T170","span":{"begin":5225,"end":5229},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T171","span":{"begin":5318,"end":5320},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T172","span":{"begin":5318,"end":5320},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T173","span":{"begin":5391,"end":5393},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T174","span":{"begin":5472,"end":5476},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T175","span":{"begin":5478,"end":5480},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T176","span":{"begin":5478,"end":5480},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T177","span":{"begin":5544,"end":5547},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T178","span":{"begin":5743,"end":5748},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T179","span":{"begin":5808,"end":5813},"obj":"http://purl.obolibrary.org/obo/CLO_0007836"},{"id":"T180","span":{"begin":5821,"end":5823},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T181","span":{"begin":5821,"end":5823},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T182","span":{"begin":5838,"end":5840},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T183","span":{"begin":5838,"end":5840},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T184","span":{"begin":5873,"end":5875},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T185","span":{"begin":5941,"end":5943},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T186","span":{"begin":5941,"end":5943},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T187","span":{"begin":5959,"end":5961},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T188","span":{"begin":5959,"end":5961},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T189","span":{"begin":5984,"end":5987},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T190","span":{"begin":6048,"end":6053},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T191","span":{"begin":6096,"end":6101},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T192","span":{"begin":6248,"end":6249},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T193","span":{"begin":6271,"end":6272},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T194","span":{"begin":6450,"end":6458},"obj":"http://purl.obolibrary.org/obo/CLO_0001658"},{"id":"T195","span":{"begin":6552,"end":6553},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T196","span":{"begin":6834,"end":6836},"obj":"http://purl.obolibrary.org/obo/CLO_0001022"},{"id":"T197","span":{"begin":6834,"end":6836},"obj":"http://purl.obolibrary.org/obo/CLO_0007314"},{"id":"T198","span":{"begin":6878,"end":6879},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T199","span":{"begin":7226,"end":7231},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T200","span":{"begin":7240,"end":7241},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T201","span":{"begin":7345,"end":7349},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T202","span":{"begin":7365,"end":7370},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T203","span":{"begin":7387,"end":7392},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T204","span":{"begin":7406,"end":7410},"obj":"http://purl.obolibrary.org/obo/CLO_0001185"},{"id":"T205","span":{"begin":7500,"end":7505},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T206","span":{"begin":7733,"end":7738},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T207","span":{"begin":8011,"end":8016},"obj":"http://purl.obolibrary.org/obo/CLO_0053817"},{"id":"T208","span":{"begin":8045,"end":8047},"obj":"http://purl.obolibrary.org/obo/CLO_0001236"},{"id":"T209","span":{"begin":8113,"end":8117},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T210","span":{"begin":8290,"end":8295},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T211","span":{"begin":8402,"end":8405},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid-PD-CHEBI
{"project":"LitCovid-PD-CHEBI","denotations":[{"id":"T23","span":{"begin":594,"end":602},"obj":"Chemical"},{"id":"T24","span":{"begin":849,"end":857},"obj":"Chemical"},{"id":"T25","span":{"begin":1113,"end":1121},"obj":"Chemical"},{"id":"T26","span":{"begin":1152,"end":1160},"obj":"Chemical"},{"id":"T27","span":{"begin":1249,"end":1253},"obj":"Chemical"},{"id":"T28","span":{"begin":1403,"end":1410},"obj":"Chemical"},{"id":"T29","span":{"begin":1600,"end":1608},"obj":"Chemical"},{"id":"T30","span":{"begin":1647,"end":1655},"obj":"Chemical"},{"id":"T31","span":{"begin":1841,"end":1849},"obj":"Chemical"},{"id":"T32","span":{"begin":1887,"end":1895},"obj":"Chemical"},{"id":"T33","span":{"begin":2041,"end":2048},"obj":"Chemical"},{"id":"T34","span":{"begin":2072,"end":2080},"obj":"Chemical"},{"id":"T35","span":{"begin":2556,"end":2564},"obj":"Chemical"},{"id":"T36","span":{"begin":2635,"end":2643},"obj":"Chemical"},{"id":"T37","span":{"begin":2671,"end":2679},"obj":"Chemical"},{"id":"T38","span":{"begin":2728,"end":2736},"obj":"Chemical"},{"id":"T39","span":{"begin":3098,"end":3106},"obj":"Chemical"},{"id":"T40","span":{"begin":3116,"end":3124},"obj":"Chemical"},{"id":"T41","span":{"begin":3313,"end":3321},"obj":"Chemical"},{"id":"T42","span":{"begin":3469,"end":3476},"obj":"Chemical"},{"id":"T43","span":{"begin":3655,"end":3663},"obj":"Chemical"},{"id":"T44","span":{"begin":3790,"end":3798},"obj":"Chemical"},{"id":"T45","span":{"begin":3806,"end":3813},"obj":"Chemical"},{"id":"T46","span":{"begin":3941,"end":3943},"obj":"Chemical"},{"id":"T47","span":{"begin":3965,"end":3972},"obj":"Chemical"},{"id":"T48","span":{"begin":3992,"end":3999},"obj":"Chemical"},{"id":"T49","span":{"begin":4055,"end":4060},"obj":"Chemical"},{"id":"T50","span":{"begin":4105,"end":4107},"obj":"Chemical"},{"id":"T51","span":{"begin":4197,"end":4199},"obj":"Chemical"},{"id":"T52","span":{"begin":4251,"end":4253},"obj":"Chemical"},{"id":"T53","span":{"begin":4269,"end":4271},"obj":"Chemical"},{"id":"T54","span":{"begin":4420,"end":4422},"obj":"Chemical"},{"id":"T55","span":{"begin":4430,"end":4432},"obj":"Chemical"},{"id":"T56","span":{"begin":4535,"end":4542},"obj":"Chemical"},{"id":"T57","span":{"begin":4606,"end":4608},"obj":"Chemical"},{"id":"T58","span":{"begin":4732,"end":4734},"obj":"Chemical"},{"id":"T59","span":{"begin":4858,"end":4866},"obj":"Chemical"},{"id":"T60","span":{"begin":4968,"end":4970},"obj":"Chemical"},{"id":"T61","span":{"begin":5028,"end":5035},"obj":"Chemical"},{"id":"T62","span":{"begin":5110,"end":5112},"obj":"Chemical"},{"id":"T63","span":{"begin":5121,"end":5123},"obj":"Chemical"},{"id":"T64","span":{"begin":5318,"end":5320},"obj":"Chemical"},{"id":"T65","span":{"begin":5403,"end":5414},"obj":"Chemical"},{"id":"T66","span":{"begin":5478,"end":5480},"obj":"Chemical"},{"id":"T67","span":{"begin":5580,"end":5587},"obj":"Chemical"},{"id":"T68","span":{"begin":5766,"end":5773},"obj":"Chemical"},{"id":"T69","span":{"begin":5821,"end":5823},"obj":"Chemical"},{"id":"T70","span":{"begin":5838,"end":5840},"obj":"Chemical"},{"id":"T71","span":{"begin":5941,"end":5943},"obj":"Chemical"},{"id":"T72","span":{"begin":5959,"end":5961},"obj":"Chemical"},{"id":"T73","span":{"begin":6103,"end":6105},"obj":"Chemical"},{"id":"T74","span":{"begin":6834,"end":6836},"obj":"Chemical"},{"id":"T75","span":{"begin":7131,"end":7133},"obj":"Chemical"},{"id":"T76","span":{"begin":7200,"end":7202},"obj":"Chemical"},{"id":"T77","span":{"begin":7394,"end":7397},"obj":"Chemical"},{"id":"T78","span":{"begin":7585,"end":7592},"obj":"Chemical"},{"id":"T79","span":{"begin":7956,"end":7964},"obj":"Chemical"},{"id":"T80","span":{"begin":8001,"end":8009},"obj":"Chemical"}],"attributes":[{"id":"A23","pred":"chebi_id","subj":"T23","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A24","pred":"chebi_id","subj":"T24","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A25","pred":"chebi_id","subj":"T25","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A26","pred":"chebi_id","subj":"T26","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A27","pred":"chebi_id","subj":"T27","obj":"http://purl.obolibrary.org/obo/CHEBI_10545"},{"id":"A28","pred":"chebi_id","subj":"T28","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A29","pred":"chebi_id","subj":"T29","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A30","pred":"chebi_id","subj":"T30","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A31","pred":"chebi_id","subj":"T31","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A32","pred":"chebi_id","subj":"T32","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A33","pred":"chebi_id","subj":"T33","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A34","pred":"chebi_id","subj":"T34","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A35","pred":"chebi_id","subj":"T35","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A36","pred":"chebi_id","subj":"T36","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A37","pred":"chebi_id","subj":"T37","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A38","pred":"chebi_id","subj":"T38","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A39","pred":"chebi_id","subj":"T39","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A40","pred":"chebi_id","subj":"T40","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A41","pred":"chebi_id","subj":"T41","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A42","pred":"chebi_id","subj":"T42","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A43","pred":"chebi_id","subj":"T43","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A44","pred":"chebi_id","subj":"T44","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A45","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A46","pred":"chebi_id","subj":"T46","obj":"http://purl.obolibrary.org/obo/CHEBI_33382"},{"id":"A47","pred":"chebi_id","subj":"T47","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A48","pred":"chebi_id","subj":"T48","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A49","pred":"chebi_id","subj":"T49","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A50","pred":"chebi_id","subj":"T50","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A51","pred":"chebi_id","subj":"T51","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A52","pred":"chebi_id","subj":"T52","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A53","pred":"chebi_id","subj":"T53","obj":"http://purl.obolibrary.org/obo/CHEBI_33382"},{"id":"A54","pred":"chebi_id","subj":"T54","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A55","pred":"chebi_id","subj":"T55","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A56","pred":"chebi_id","subj":"T56","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A57","pred":"chebi_id","subj":"T57","obj":"http://purl.obolibrary.org/obo/CHEBI_33382"},{"id":"A58","pred":"chebi_id","subj":"T58","obj":"http://purl.obolibrary.org/obo/CHEBI_33382"},{"id":"A59","pred":"chebi_id","subj":"T59","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A60","pred":"chebi_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A61","pred":"chebi_id","subj":"T61","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A62","pred":"chebi_id","subj":"T62","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A63","pred":"chebi_id","subj":"T63","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A64","pred":"chebi_id","subj":"T64","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A65","pred":"chebi_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/CHEBI_48433"},{"id":"A66","pred":"chebi_id","subj":"T66","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A67","pred":"chebi_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A68","pred":"chebi_id","subj":"T68","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A69","pred":"chebi_id","subj":"T69","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A70","pred":"chebi_id","subj":"T70","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A71","pred":"chebi_id","subj":"T71","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A72","pred":"chebi_id","subj":"T72","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A73","pred":"chebi_id","subj":"T73","obj":"http://purl.obolibrary.org/obo/CHEBI_30441"},{"id":"A74","pred":"chebi_id","subj":"T74","obj":"http://purl.obolibrary.org/obo/CHEBI_30145"},{"id":"A75","pred":"chebi_id","subj":"T75","obj":"http://purl.obolibrary.org/obo/CHEBI_35780"},{"id":"A76","pred":"chebi_id","subj":"T76","obj":"http://purl.obolibrary.org/obo/CHEBI_35780"},{"id":"A77","pred":"chebi_id","subj":"T77","obj":"http://purl.obolibrary.org/obo/CHEBI_32359"},{"id":"A78","pred":"chebi_id","subj":"T78","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A79","pred":"chebi_id","subj":"T79","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A80","pred":"chebi_id","subj":"T80","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid-PD-GO-BP
{"project":"LitCovid-PD-GO-BP","denotations":[{"id":"T2","span":{"begin":896,"end":906},"obj":"http://purl.obolibrary.org/obo/GO_0004175"},{"id":"T3","span":{"begin":938,"end":948},"obj":"http://purl.obolibrary.org/obo/GO_0004175"},{"id":"T4","span":{"begin":3157,"end":3172},"obj":"http://purl.obolibrary.org/obo/GO_0016032"},{"id":"T5","span":{"begin":3180,"end":3192},"obj":"http://purl.obolibrary.org/obo/GO_0009405"},{"id":"T6","span":{"begin":3238,"end":3253},"obj":"http://purl.obolibrary.org/obo/GO_0039703"},{"id":"T7","span":{"begin":3261,"end":3274},"obj":"http://purl.obolibrary.org/obo/GO_0006351"},{"id":"T8","span":{"begin":3322,"end":3340},"obj":"http://purl.obolibrary.org/obo/GO_0051701"},{"id":"T9","span":{"begin":3426,"end":3435},"obj":"http://purl.obolibrary.org/obo/GO_0016032"},{"id":"T10","span":{"begin":3426,"end":3435},"obj":"http://purl.obolibrary.org/obo/GO_0009405"},{"id":"T11","span":{"begin":3684,"end":3698},"obj":"http://purl.obolibrary.org/obo/GO_0019068"},{"id":"T12","span":{"begin":3710,"end":3719},"obj":"http://purl.obolibrary.org/obo/GO_0016032"},{"id":"T13","span":{"begin":3710,"end":3719},"obj":"http://purl.obolibrary.org/obo/GO_0009405"},{"id":"T14","span":{"begin":3837,"end":3864},"obj":"http://purl.obolibrary.org/obo/GO_0046718"},{"id":"T15","span":{"begin":3843,"end":3858},"obj":"http://purl.obolibrary.org/obo/GO_0044409"},{"id":"T16","span":{"begin":3880,"end":3895},"obj":"http://purl.obolibrary.org/obo/GO_0061025"},{"id":"T17","span":{"begin":3906,"end":3921},"obj":"http://purl.obolibrary.org/obo/GO_0016032"},{"id":"T18","span":{"begin":4234,"end":4249},"obj":"http://purl.obolibrary.org/obo/GO_0061025"},{"id":"T19","span":{"begin":5082,"end":5097},"obj":"http://purl.obolibrary.org/obo/GO_0061025"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
2_test
{"project":"2_test","denotations":[{"id":"32265848-25720466-36511010","span":{"begin":390,"end":394},"obj":"25720466"},{"id":"32265848-25720466-36511011","span":{"begin":711,"end":715},"obj":"25720466"},{"id":"32265848-25320088-36511012","span":{"begin":985,"end":989},"obj":"25320088"},{"id":"32265848-25720466-36511013","span":{"begin":1009,"end":1013},"obj":"25720466"},{"id":"32265848-26055715-36511014","span":{"begin":1029,"end":1033},"obj":"26055715"},{"id":"32265848-27712628-36511015","span":{"begin":1051,"end":1055},"obj":"27712628"},{"id":"32265848-25720466-36511016","span":{"begin":1237,"end":1241},"obj":"25720466"},{"id":"32265848-23283955-36511017","span":{"begin":1457,"end":1461},"obj":"23283955"},{"id":"32265848-25926653-36511018","span":{"begin":1477,"end":1481},"obj":"25926653"},{"id":"32265848-12730501-36511019","span":{"begin":1726,"end":1730},"obj":"12730501"},{"id":"32265848-12927536-36511020","span":{"begin":1748,"end":1752},"obj":"12927536"},{"id":"32265848-23170002-36511021","span":{"begin":1953,"end":1957},"obj":"23170002"},{"id":"32265848-24995382-36511022","span":{"begin":2207,"end":2211},"obj":"24995382"},{"id":"32265848-12730500-36511023","span":{"begin":2303,"end":2307},"obj":"12730500"},{"id":"32265848-23075143-36511024","span":{"begin":2322,"end":2326},"obj":"23075143"},{"id":"32265848-27712628-36511025","span":{"begin":3292,"end":3296},"obj":"27712628"},{"id":"32265848-28830941-36511026","span":{"begin":3455,"end":3459},"obj":"28830941"},{"id":"32265848-23283955-36511027","span":{"begin":3636,"end":3640},"obj":"23283955"},{"id":"32265848-24043791-36511028","span":{"begin":3736,"end":3740},"obj":"24043791"},{"id":"32265848-25093995-36511029","span":{"begin":3758,"end":3762},"obj":"25093995"},{"id":"32265848-24473083-36511031","span":{"begin":3952,"end":3956},"obj":"24473083"},{"id":"32265848-24473083-36511033","span":{"begin":4280,"end":4284},"obj":"24473083"},{"id":"32265848-27578435-36511034","span":{"begin":4424,"end":4428},"obj":"27578435"},{"id":"32265848-28923942-36511035","span":{"begin":4441,"end":4445},"obj":"28923942"},{"id":"32265848-28534504-36511036","span":{"begin":4458,"end":4462},"obj":"28534504"},{"id":"32265848-30679277-36511037","span":{"begin":4480,"end":4484},"obj":"30679277"},{"id":"32265848-31160783-36511038","span":{"begin":4504,"end":4508},"obj":"31160783"},{"id":"32265848-14670965-36511039","span":{"begin":4578,"end":4582},"obj":"14670965"},{"id":"32265848-16912312-36511040","span":{"begin":4600,"end":4604},"obj":"16912312"},{"id":"32265848-23831647-36511041","span":{"begin":4617,"end":4621},"obj":"23831647"},{"id":"32265848-23831647-36511042","span":{"begin":4743,"end":4747},"obj":"23831647"},{"id":"32265848-30679277-36511043","span":{"begin":4765,"end":4769},"obj":"30679277"},{"id":"32265848-25428871-36511044","span":{"begin":4972,"end":4976},"obj":"25428871"},{"id":"32265848-25428871-36511045","span":{"begin":5125,"end":5129},"obj":"25428871"},{"id":"32265848-27578435-36511046","span":{"begin":5131,"end":5135},"obj":"27578435"},{"id":"32265848-27936982-36511047","span":{"begin":5148,"end":5152},"obj":"27936982"},{"id":"32265848-15897467-36511049","span":{"begin":5312,"end":5316},"obj":"15897467"},{"id":"32265848-25428871-36511050","span":{"begin":5322,"end":5326},"obj":"25428871"},{"id":"32265848-14647384-36511051","span":{"begin":5489,"end":5493},"obj":"14647384"},{"id":"32265848-28393837-36511052","span":{"begin":5721,"end":5725},"obj":"28393837"},{"id":"32265848-15452268-36511053","span":{"begin":5832,"end":5836},"obj":"15452268"},{"id":"32265848-18448527-36511054","span":{"begin":5842,"end":5846},"obj":"18448527"},{"id":"32265848-18448527-36511056","span":{"begin":5963,"end":5967},"obj":"18448527"},{"id":"32265848-27578435-36511057","span":{"begin":5969,"end":5973},"obj":"27578435"},{"id":"32265848-24172901-36511058","span":{"begin":6114,"end":6118},"obj":"24172901"},{"id":"32265848-29190287-36511059","span":{"begin":6131,"end":6135},"obj":"29190287"},{"id":"32265848-23835475-36511060","span":{"begin":6348,"end":6352},"obj":"23835475"},{"id":"32265848-23486063-36511062","span":{"begin":6605,"end":6609},"obj":"23486063"},{"id":"32265848-25428871-36511063","span":{"begin":6838,"end":6842},"obj":"25428871"},{"id":"32265848-28807998-36511064","span":{"begin":7059,"end":7063},"obj":"28807998"},{"id":"32265848-28393837-36511065","span":{"begin":7078,"end":7082},"obj":"28393837"},{"id":"32265848-29346682-36511067","span":{"begin":7406,"end":7410},"obj":"29346682"},{"id":"32265848-32015507-36511069","span":{"begin":7868,"end":7872},"obj":"32015507"},{"id":"32265848-32015507-36511070","span":{"begin":8310,"end":8314},"obj":"32015507"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid-sentences
{"project":"LitCovid-sentences","denotations":[{"id":"T59","span":{"begin":0,"end":95},"obj":"Sentence"},{"id":"T60","span":{"begin":96,"end":183},"obj":"Sentence"},{"id":"T61","span":{"begin":184,"end":215},"obj":"Sentence"},{"id":"T62","span":{"begin":216,"end":396},"obj":"Sentence"},{"id":"T63","span":{"begin":397,"end":717},"obj":"Sentence"},{"id":"T64","span":{"begin":718,"end":1057},"obj":"Sentence"},{"id":"T65","span":{"begin":1058,"end":1243},"obj":"Sentence"},{"id":"T66","span":{"begin":1244,"end":1483},"obj":"Sentence"},{"id":"T67","span":{"begin":1484,"end":1754},"obj":"Sentence"},{"id":"T68","span":{"begin":1755,"end":1959},"obj":"Sentence"},{"id":"T69","span":{"begin":1960,"end":2134},"obj":"Sentence"},{"id":"T70","span":{"begin":2135,"end":2213},"obj":"Sentence"},{"id":"T71","span":{"begin":2214,"end":2328},"obj":"Sentence"},{"id":"T72","span":{"begin":2329,"end":2480},"obj":"Sentence"},{"id":"T73","span":{"begin":2481,"end":2544},"obj":"Sentence"},{"id":"T74","span":{"begin":2545,"end":2894},"obj":"Sentence"},{"id":"T75","span":{"begin":2895,"end":2982},"obj":"Sentence"},{"id":"T76","span":{"begin":2983,"end":3107},"obj":"Sentence"},{"id":"T77","span":{"begin":3108,"end":3193},"obj":"Sentence"},{"id":"T78","span":{"begin":3194,"end":3298},"obj":"Sentence"},{"id":"T79","span":{"begin":3299,"end":3461},"obj":"Sentence"},{"id":"T80","span":{"begin":3462,"end":3642},"obj":"Sentence"},{"id":"T81","span":{"begin":3643,"end":3764},"obj":"Sentence"},{"id":"T82","span":{"begin":3765,"end":3958},"obj":"Sentence"},{"id":"T83","span":{"begin":3959,"end":4116},"obj":"Sentence"},{"id":"T84","span":{"begin":4117,"end":4286},"obj":"Sentence"},{"id":"T85","span":{"begin":4287,"end":4379},"obj":"Sentence"},{"id":"T86","span":{"begin":4380,"end":4623},"obj":"Sentence"},{"id":"T87","span":{"begin":4624,"end":4771},"obj":"Sentence"},{"id":"T88","span":{"begin":4772,"end":4867},"obj":"Sentence"},{"id":"T89","span":{"begin":4868,"end":4978},"obj":"Sentence"},{"id":"T90","span":{"begin":4979,"end":5154},"obj":"Sentence"},{"id":"T91","span":{"begin":5155,"end":5273},"obj":"Sentence"},{"id":"T92","span":{"begin":5274,"end":5495},"obj":"Sentence"},{"id":"T93","span":{"begin":5496,"end":5588},"obj":"Sentence"},{"id":"T94","span":{"begin":5589,"end":5727},"obj":"Sentence"},{"id":"T95","span":{"begin":5728,"end":5848},"obj":"Sentence"},{"id":"T96","span":{"begin":5849,"end":5975},"obj":"Sentence"},{"id":"T97","span":{"begin":5976,"end":6137},"obj":"Sentence"},{"id":"T98","span":{"begin":6138,"end":6250},"obj":"Sentence"},{"id":"T99","span":{"begin":6251,"end":6354},"obj":"Sentence"},{"id":"T100","span":{"begin":6355,"end":6490},"obj":"Sentence"},{"id":"T101","span":{"begin":6491,"end":6662},"obj":"Sentence"},{"id":"T102","span":{"begin":6663,"end":6844},"obj":"Sentence"},{"id":"T103","span":{"begin":6845,"end":6927},"obj":"Sentence"},{"id":"T104","span":{"begin":6928,"end":7084},"obj":"Sentence"},{"id":"T105","span":{"begin":7085,"end":7175},"obj":"Sentence"},{"id":"T106","span":{"begin":7176,"end":7272},"obj":"Sentence"},{"id":"T107","span":{"begin":7273,"end":7431},"obj":"Sentence"},{"id":"T108","span":{"begin":7432,"end":7569},"obj":"Sentence"},{"id":"T109","span":{"begin":7570,"end":7693},"obj":"Sentence"},{"id":"T110","span":{"begin":7694,"end":7874},"obj":"Sentence"},{"id":"T111","span":{"begin":7875,"end":8049},"obj":"Sentence"},{"id":"T112","span":{"begin":8050,"end":8118},"obj":"Sentence"},{"id":"T113","span":{"begin":8119,"end":8210},"obj":"Sentence"},{"id":"T114","span":{"begin":8211,"end":8316},"obj":"Sentence"},{"id":"T115","span":{"begin":8317,"end":8429},"obj":"Sentence"}],"namespaces":[{"prefix":"_base","uri":"http://pubannotation.org/ontology/tao.owl#"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}
LitCovid-PMC-OGER-BB
{"project":"LitCovid-PMC-OGER-BB","denotations":[{"id":"T162","span":{"begin":0,"end":6},"obj":"SO:0001026"},{"id":"T163","span":{"begin":19,"end":24},"obj":"SP_6;NCBITaxon:9606"},{"id":"T164","span":{"begin":25,"end":38},"obj":"NCBITaxon:11118"},{"id":"T165","span":{"begin":100,"end":105},"obj":"SP_6;NCBITaxon:9606"},{"id":"T166","span":{"begin":125,"end":132},"obj":"NCBITaxon:10239"},{"id":"T167","span":{"begin":156,"end":171},"obj":"SO:0000984"},{"id":"T168","span":{"begin":176,"end":182},"obj":"SO:0001026"},{"id":"T169","span":{"begin":233,"end":238},"obj":"SP_6;NCBITaxon:9606"},{"id":"T170","span":{"begin":264,"end":269},"obj":"NCBITaxon:10239"},{"id":"T171","span":{"begin":270,"end":276},"obj":"SO:0001026"},{"id":"T172","span":{"begin":293,"end":298},"obj":"SO:0000028"},{"id":"T173","span":{"begin":319,"end":326},"obj":"NCBITaxon:10239"},{"id":"T174","span":{"begin":351,"end":357},"obj":"SO:0001026"},{"id":"T175","span":{"begin":419,"end":438},"obj":"SO:0000236"},{"id":"T176","span":{"begin":440,"end":444},"obj":"SO:0000236"},{"id":"T177","span":{"begin":447,"end":450},"obj":"SO:0000236"},{"id":"T178","span":{"begin":458,"end":461},"obj":"SO:0000236"},{"id":"T179","span":{"begin":491,"end":497},"obj":"SO:0001026"},{"id":"T180","span":{"begin":537,"end":543},"obj":"SO:0001026"},{"id":"T181","span":{"begin":610,"end":614},"obj":"SO:0000704"},{"id":"T182","span":{"begin":649,"end":657},"obj":"GO:0016020"},{"id":"T183","span":{"begin":728,"end":731},"obj":"SO:0000236"},{"id":"T184","span":{"begin":751,"end":755},"obj":"SO:0000704"},{"id":"T185","span":{"begin":782,"end":786},"obj":"PG_16;PR:000003209"},{"id":"T186","span":{"begin":792,"end":796},"obj":"PR:000013103"},{"id":"T187","span":{"begin":815,"end":822},"obj":"MOP:0000780"},{"id":"T188","span":{"begin":859,"end":863},"obj":"PR:000004677"},{"id":"T189","span":{"begin":864,"end":869},"obj":"PR:000000125"},{"id":"T190","span":{"begin":873,"end":877},"obj":"PR:000014815"},{"id":"T191","span":{"begin":878,"end":883},"obj":"PR:000000125"},{"id":"T192","span":{"begin":957,"end":961},"obj":"PR:000014816"},{"id":"T193","span":{"begin":1077,"end":1082},"obj":"SO:0000704"},{"id":"T194","span":{"begin":1127,"end":1132},"obj":"SO:0000704"},{"id":"T195","span":{"begin":1329,"end":1333},"obj":"SO:0000704"},{"id":"T196","span":{"begin":1350,"end":1353},"obj":"SO:0000236"},{"id":"T197","span":{"begin":1364,"end":1368},"obj":"SO:0000704"},{"id":"T198","span":{"begin":1501,"end":1506},"obj":"SP_6;NCBITaxon:9606"},{"id":"T199","span":{"begin":1513,"end":1521},"obj":"SP_10"},{"id":"T200","span":{"begin":1541,"end":1547},"obj":"SO:0001026"},{"id":"T201","span":{"begin":1563,"end":1567},"obj":"PG_16;PR:000003209"},{"id":"T202","span":{"begin":1572,"end":1577},"obj":"PG_15;PR:000001623"},{"id":"T203","span":{"begin":1759,"end":1767},"obj":"SP_9"},{"id":"T204","span":{"begin":1768,"end":1774},"obj":"SO:0001026"},{"id":"T205","span":{"begin":1812,"end":1816},"obj":"PG_16;PR:000003209"},{"id":"T206","span":{"begin":1818,"end":1823},"obj":"PG_15;PR:000001623"},{"id":"T207","span":{"begin":1964,"end":1971},"obj":"SO:0001026"},{"id":"T208","span":{"begin":1979,"end":1987},"obj":"SP_10"},{"id":"T209","span":{"begin":1992,"end":2000},"obj":"SP_9"},{"id":"T210","span":{"begin":2039,"end":2048},"obj":"PG_4"},{"id":"T211","span":{"begin":2090,"end":2098},"obj":"GO:0031975"},{"id":"T212","span":{"begin":2145,"end":2150},"obj":"SO:0000704"},{"id":"T213","span":{"begin":2236,"end":2244},"obj":"SP_10"},{"id":"T214","span":{"begin":2249,"end":2257},"obj":"SP_9"},{"id":"T215","span":{"begin":2284,"end":2288},"obj":"SO:0000704"},{"id":"T216","span":{"begin":2362,"end":2370},"obj":"SP_10"},{"id":"T217","span":{"begin":2372,"end":2380},"obj":"SP_9"},{"id":"T218","span":{"begin":2386,"end":2395},"obj":"SP_7"},{"id":"T219","span":{"begin":2422,"end":2429},"obj":"SO:0001026"},{"id":"T220","span":{"begin":2446,"end":2454},"obj":"SP_10"},{"id":"T221","span":{"begin":2456,"end":2464},"obj":"SP_9"},{"id":"T222","span":{"begin":2470,"end":2479},"obj":"SP_7"},{"id":"T223","span":{"begin":2485,"end":2492},"obj":"SO:0001026"},{"id":"T224","span":{"begin":2504,"end":2523},"obj":"SO:0000236"},{"id":"T225","span":{"begin":2525,"end":2529},"obj":"SO:0000236"},{"id":"T226","span":{"begin":2587,"end":2595},"obj":"GO:0031975"},{"id":"T227","span":{"begin":2601,"end":2609},"obj":"GO:0016020"},{"id":"T228","span":{"begin":2618,"end":2630},"obj":"GO:0000786"},{"id":"T229","span":{"begin":2680,"end":2690},"obj":"GO:0006412"},{"id":"T230","span":{"begin":2696,"end":2699},"obj":"SO:0000236"},{"id":"T231","span":{"begin":2707,"end":2710},"obj":"SO:0000236"},{"id":"T232","span":{"begin":2787,"end":2795},"obj":"SP_10"},{"id":"T233","span":{"begin":2824,"end":2832},"obj":"SP_9"},{"id":"T234","span":{"begin":2869,"end":2878},"obj":"SP_7"},{"id":"T235","span":{"begin":2895,"end":2901},"obj":"SO:0000204"},{"id":"T236","span":{"begin":2906,"end":2909},"obj":"SO:0000205"},{"id":"T237","span":{"begin":2909,"end":2912},"obj":"SO:0000203"},{"id":"T238","span":{"begin":2914,"end":2926},"obj":"GO:0006412;SO:0000203"},{"id":"T239","span":{"begin":2927,"end":2934},"obj":"SO:0000203"},{"id":"T240","span":{"begin":2942,"end":2944},"obj":"SO:0000204"},{"id":"T241","span":{"begin":2996,"end":3000},"obj":"SO:0000028"},{"id":"T242","span":{"begin":3039,"end":3047},"obj":"SP_10"},{"id":"T243","span":{"begin":3049,"end":3057},"obj":"SP_9"},{"id":"T244","span":{"begin":3063,"end":3072},"obj":"SP_7"},{"id":"T245","span":{"begin":3128,"end":3133},"obj":"SP_6;NCBITaxon:9606"},{"id":"T246","span":{"begin":3157,"end":3162},"obj":"NCBITaxon:10239"},{"id":"T247","span":{"begin":3232,"end":3237},"obj":"NCBITaxon:10239;GO:0006260"},{"id":"T248","span":{"begin":3238,"end":3253},"obj":"GO:0039703"},{"id":"T249","span":{"begin":3372,"end":3379},"obj":"NCBITaxon:10239"},{"id":"T250","span":{"begin":3393,"end":3406},"obj":"UBERON:0002405"},{"id":"T251","span":{"begin":3506,"end":3511},"obj":"NCBITaxon:10239"},{"id":"T252","span":{"begin":3684,"end":3689},"obj":"NCBITaxon:10239;GO:0019068"},{"id":"T253","span":{"begin":3690,"end":3698},"obj":"GO:0019068"},{"id":"T254","span":{"begin":3804,"end":3813},"obj":"PG_1"},{"id":"T255","span":{"begin":3817,"end":3822},"obj":"SP_6;NCBITaxon:9606"},{"id":"T256","span":{"begin":3837,"end":3842},"obj":"NCBITaxon:10239;GO:0035376"},{"id":"T257","span":{"begin":3843,"end":3848},"obj":"GO:0035376"},{"id":"T258","span":{"begin":3880,"end":3888},"obj":"GO:0016020;GO:0061025"},{"id":"T259","span":{"begin":3889,"end":3895},"obj":"GO:0061025"},{"id":"T260","span":{"begin":3906,"end":3911},"obj":"NCBITaxon:10239"},{"id":"T261","span":{"begin":3963,"end":3972},"obj":"PG_1"},{"id":"T262","span":{"begin":3986,"end":3991},"obj":"NCBITaxon:10239"},{"id":"T263","span":{"begin":4014,"end":4021},"obj":"MOP:0000780"},{"id":"T264","span":{"begin":4151,"end":4156},"obj":"NCBITaxon:10239"},{"id":"T265","span":{"begin":4223,"end":4228},"obj":"NCBITaxon:10239"},{"id":"T266","span":{"begin":4229,"end":4233},"obj":"GO:0018995"},{"id":"T267","span":{"begin":4234,"end":4242},"obj":"GO:0016020;GO:0061025"},{"id":"T268","span":{"begin":4243,"end":4249},"obj":"GO:0061025"},{"id":"T269","span":{"begin":4313,"end":4320},"obj":"SO:0000417"},{"id":"T270","span":{"begin":4336,"end":4342},"obj":"SO:0000417"},{"id":"T271","span":{"begin":4366,"end":4372},"obj":"SO:0000417"},{"id":"T272","span":{"begin":4669,"end":4673},"obj":"GO:0018995"},{"id":"T273","span":{"begin":4716,"end":4723},"obj":"SO:0000417"},{"id":"T274","span":{"begin":4785,"end":4790},"obj":"SP_6;NCBITaxon:9606"},{"id":"T275","span":{"begin":4830,"end":4835},"obj":"NCBITaxon:10239"},{"id":"T276","span":{"begin":4840,"end":4857},"obj":"GO:0005886"},{"id":"T277","span":{"begin":4930,"end":4942},"obj":"GO:0009986"},{"id":"T278","span":{"begin":5026,"end":5035},"obj":"PG_1"},{"id":"T279","span":{"begin":5082,"end":5090},"obj":"GO:0016020;GO:0061025"},{"id":"T280","span":{"begin":5091,"end":5097},"obj":"GO:0061025"},{"id":"T281","span":{"begin":5173,"end":5178},"obj":"NCBITaxon:10239"},{"id":"T282","span":{"begin":5179,"end":5186},"obj":"SO:0000704"},{"id":"T283","span":{"begin":5329,"end":5337},"obj":"SP_10"},{"id":"T284","span":{"begin":5403,"end":5434},"obj":"PG_10;PR:000003622"},{"id":"T285","span":{"begin":5436,"end":5440},"obj":"G_3;PG_10;PR:000003622"},{"id":"T286","span":{"begin":5578,"end":5587},"obj":"PG_1"},{"id":"T287","span":{"begin":5589,"end":5593},"obj":"G_3;PG_10;PR:000003622"},{"id":"T288","span":{"begin":5743,"end":5748},"obj":"SP_6;NCBITaxon:9606;PG_10"},{"id":"T289","span":{"begin":5749,"end":5753},"obj":"PG_10;PR:000003622"},{"id":"T290","span":{"begin":5755,"end":5763},"obj":"SP_10"},{"id":"T291","span":{"begin":5764,"end":5773},"obj":"PG_1"},{"id":"T292","span":{"begin":5793,"end":5803},"obj":"SP_5"},{"id":"T293","span":{"begin":5808,"end":5813},"obj":"NCBITaxon:10088"},{"id":"T294","span":{"begin":5814,"end":5819},"obj":"G_3;PG_10;PR:000003622"},{"id":"T295","span":{"begin":5907,"end":5914},"obj":"NCBITaxon:species"},{"id":"T296","span":{"begin":5931,"end":5939},"obj":"SP_10"},{"id":"T297","span":{"begin":5984,"end":5987},"obj":"SP_2"},{"id":"T298","span":{"begin":6048,"end":6053},"obj":"SP_6;NCBITaxon:9606;PG_10"},{"id":"T299","span":{"begin":6054,"end":6058},"obj":"PG_10;PR:000003622"},{"id":"T300","span":{"begin":6155,"end":6163},"obj":"SP_10"},{"id":"T301","span":{"begin":6196,"end":6200},"obj":"G_3;PG_10;PR:000003622"},{"id":"T302","span":{"begin":6251,"end":6259},"obj":"SP_9"},{"id":"T303","span":{"begin":6306,"end":6314},"obj":"SO:0000853"},{"id":"T304","span":{"begin":6315,"end":6321},"obj":"SO:0000417"},{"id":"T305","span":{"begin":6325,"end":6333},"obj":"SP_10"},{"id":"T306","span":{"begin":6364,"end":6374},"obj":"GO:0042571"},{"id":"T307","span":{"begin":6386,"end":6394},"obj":"SP_10"},{"id":"T308","span":{"begin":6462,"end":6470},"obj":"SP_9"},{"id":"T309","span":{"begin":6501,"end":6509},"obj":"SP_9"},{"id":"T310","span":{"begin":6530,"end":6541},"obj":"PR:000012546"},{"id":"T311","span":{"begin":6543,"end":6547},"obj":"PR:000001341;PG_7"},{"id":"T312","span":{"begin":6639,"end":6647},"obj":"SP_10"},{"id":"T313","span":{"begin":6657,"end":6661},"obj":"PR:000003622;G_3;PG_10"},{"id":"T314","span":{"begin":6692,"end":6700},"obj":"SP_10"},{"id":"T315","span":{"begin":6705,"end":6713},"obj":"SP_9"},{"id":"T316","span":{"begin":6845,"end":6853},"obj":"SP_9"},{"id":"T317","span":{"begin":6909,"end":6917},"obj":"SP_10"},{"id":"T318","span":{"begin":6982,"end":6990},"obj":"SP_9"},{"id":"T319","span":{"begin":7005,"end":7009},"obj":"PG_7;PR:000001341"},{"id":"T320","span":{"begin":7085,"end":7093},"obj":"SP_9"},{"id":"T321","span":{"begin":7226,"end":7231},"obj":"SP_6;NCBITaxon:9606"},{"id":"T322","span":{"begin":7232,"end":7236},"obj":"PG_7;PR:000001341"},{"id":"T323","span":{"begin":7288,"end":7292},"obj":"SP_9"},{"id":"T324","span":{"begin":7345,"end":7349},"obj":"SP_2;NCBITaxon:8342"},{"id":"T325","span":{"begin":7365,"end":7370},"obj":"SP_6;NCBITaxon:9606"},{"id":"T326","span":{"begin":7371,"end":7375},"obj":"PG_7;PR:000001341"},{"id":"T327","span":{"begin":7376,"end":7386},"obj":"GO:0010467"},{"id":"T328","span":{"begin":7500,"end":7505},"obj":"SP_6;NCBITaxon:9606"},{"id":"T329","span":{"begin":7548,"end":7555},"obj":"NCBITaxon:species"},{"id":"T330","span":{"begin":7574,"end":7582},"obj":"SP_9"},{"id":"T331","span":{"begin":7583,"end":7592},"obj":"PG_1"},{"id":"T332","span":{"begin":7611,"end":7620},"obj":"GO:0032991"},{"id":"T333","span":{"begin":7637,"end":7641},"obj":"PG_7;PR:000001341"},{"id":"T334","span":{"begin":7733,"end":7738},"obj":"SP_6;NCBITaxon:9606"},{"id":"T335","span":{"begin":7744,"end":7753},"obj":"SP_7"},{"id":"T336","span":{"begin":7776,"end":7783},"obj":"NCBITaxon:species"},{"id":"T337","span":{"begin":7845,"end":7853},"obj":"SP_10"},{"id":"T338","span":{"begin":7879,"end":7885},"obj":"SO:0001026"},{"id":"T339","span":{"begin":7889,"end":7898},"obj":"SP_7"},{"id":"T340","span":{"begin":7907,"end":7912},"obj":"PG_15"},{"id":"T341","span":{"begin":7914,"end":7924},"obj":"GO:0006412"},{"id":"T342","span":{"begin":7930,"end":7933},"obj":"SO:0000236"},{"id":"T343","span":{"begin":8058,"end":8066},"obj":"SP_10"},{"id":"T344","span":{"begin":8071,"end":8079},"obj":"SP_9"},{"id":"T345","span":{"begin":8081,"end":8090},"obj":"SP_7"},{"id":"T346","span":{"begin":8113,"end":8117},"obj":"SO:0000704"},{"id":"T347","span":{"begin":8133,"end":8142},"obj":"SP_7"},{"id":"T348","span":{"begin":8176,"end":8184},"obj":"SP_10"},{"id":"T349","span":{"begin":8189,"end":8197},"obj":"SP_9"},{"id":"T350","span":{"begin":8216,"end":8224},"obj":"SP_10"},{"id":"T351","span":{"begin":8226,"end":8235},"obj":"SP_7"},{"id":"T352","span":{"begin":8246,"end":8250},"obj":"G_3;PG_10;PR:000003622"},{"id":"T353","span":{"begin":8345,"end":8354},"obj":"SP_7"},{"id":"T354","span":{"begin":8387,"end":8392},"obj":"NCBITaxon:10239"},{"id":"T6954","span":{"begin":0,"end":6},"obj":"SO:0001026"},{"id":"T10468","span":{"begin":19,"end":24},"obj":"SP_6;NCBITaxon:9606"},{"id":"T36779","span":{"begin":25,"end":38},"obj":"NCBITaxon:11118"},{"id":"T76228","span":{"begin":100,"end":105},"obj":"SP_6;NCBITaxon:9606"},{"id":"T44525","span":{"begin":125,"end":132},"obj":"NCBITaxon:10239"},{"id":"T14620","span":{"begin":156,"end":171},"obj":"SO:0000984"},{"id":"T64880","span":{"begin":176,"end":182},"obj":"SO:0001026"},{"id":"T58645","span":{"begin":233,"end":238},"obj":"SP_6;NCBITaxon:9606"},{"id":"T9835","span":{"begin":264,"end":269},"obj":"NCBITaxon:10239"},{"id":"T64010","span":{"begin":270,"end":276},"obj":"SO:0001026"},{"id":"T10101","span":{"begin":293,"end":298},"obj":"SO:0000028"},{"id":"T99663","span":{"begin":319,"end":326},"obj":"NCBITaxon:10239"},{"id":"T65268","span":{"begin":351,"end":357},"obj":"SO:0001026"},{"id":"T42666","span":{"begin":419,"end":438},"obj":"SO:0000236"},{"id":"T74127","span":{"begin":440,"end":444},"obj":"SO:0000236"},{"id":"T74246","span":{"begin":447,"end":450},"obj":"SO:0000236"},{"id":"T85903","span":{"begin":458,"end":461},"obj":"SO:0000236"},{"id":"T51104","span":{"begin":491,"end":497},"obj":"SO:0001026"},{"id":"T62201","span":{"begin":537,"end":543},"obj":"SO:0001026"},{"id":"T69414","span":{"begin":610,"end":614},"obj":"SO:0000704"},{"id":"T1371","span":{"begin":649,"end":657},"obj":"GO:0016020"},{"id":"T82583","span":{"begin":728,"end":731},"obj":"SO:0000236"},{"id":"T81801","span":{"begin":751,"end":755},"obj":"SO:0000704"},{"id":"T70907","span":{"begin":782,"end":786},"obj":"PG_16;PR:000003209"},{"id":"T25954","span":{"begin":792,"end":796},"obj":"PR:000013103"},{"id":"T4713","span":{"begin":815,"end":822},"obj":"MOP:0000780"},{"id":"T4703","span":{"begin":859,"end":863},"obj":"PR:000004677"},{"id":"T36339","span":{"begin":864,"end":869},"obj":"PR:000000125"},{"id":"T30074","span":{"begin":873,"end":877},"obj":"PR:000014815"},{"id":"T32869","span":{"begin":878,"end":883},"obj":"PR:000000125"},{"id":"T28347","span":{"begin":957,"end":961},"obj":"PR:000014816"},{"id":"T17264","span":{"begin":1077,"end":1082},"obj":"SO:0000704"},{"id":"T14205","span":{"begin":1127,"end":1132},"obj":"SO:0000704"},{"id":"T82252","span":{"begin":1329,"end":1333},"obj":"SO:0000704"},{"id":"T33553","span":{"begin":1350,"end":1353},"obj":"SO:0000236"},{"id":"T22432","span":{"begin":1364,"end":1368},"obj":"SO:0000704"},{"id":"T3462","span":{"begin":1501,"end":1506},"obj":"SP_6;NCBITaxon:9606"},{"id":"T56651","span":{"begin":1513,"end":1521},"obj":"SP_10"},{"id":"T46590","span":{"begin":1541,"end":1547},"obj":"SO:0001026"},{"id":"T11916","span":{"begin":1563,"end":1567},"obj":"PG_16;PR:000003209"},{"id":"T17349","span":{"begin":1572,"end":1577},"obj":"PG_15;PR:000001623"},{"id":"T83873","span":{"begin":1759,"end":1767},"obj":"SP_9"},{"id":"T94441","span":{"begin":1768,"end":1774},"obj":"SO:0001026"},{"id":"T90837","span":{"begin":1812,"end":1816},"obj":"PG_16;PR:000003209"},{"id":"T8929","span":{"begin":1818,"end":1823},"obj":"PG_15;PR:000001623"},{"id":"T57362","span":{"begin":1964,"end":1971},"obj":"SO:0001026"},{"id":"T16622","span":{"begin":1979,"end":1987},"obj":"SP_10"},{"id":"T8486","span":{"begin":1992,"end":2000},"obj":"SP_9"},{"id":"T50696","span":{"begin":2039,"end":2048},"obj":"PG_4"},{"id":"T94952","span":{"begin":2090,"end":2098},"obj":"GO:0031975"},{"id":"T36645","span":{"begin":2145,"end":2150},"obj":"SO:0000704"},{"id":"T7283","span":{"begin":2236,"end":2244},"obj":"SP_10"},{"id":"T14445","span":{"begin":2249,"end":2257},"obj":"SP_9"},{"id":"T79858","span":{"begin":2284,"end":2288},"obj":"SO:0000704"},{"id":"T68938","span":{"begin":2362,"end":2370},"obj":"SP_10"},{"id":"T14236","span":{"begin":2372,"end":2380},"obj":"SP_9"},{"id":"T4340","span":{"begin":2386,"end":2395},"obj":"SP_7"},{"id":"T76523","span":{"begin":2422,"end":2429},"obj":"SO:0001026"},{"id":"T22537","span":{"begin":2446,"end":2454},"obj":"SP_10"},{"id":"T91705","span":{"begin":2456,"end":2464},"obj":"SP_9"},{"id":"T26104","span":{"begin":2470,"end":2479},"obj":"SP_7"},{"id":"T79836","span":{"begin":2485,"end":2492},"obj":"SO:0001026"},{"id":"T84618","span":{"begin":2504,"end":2523},"obj":"SO:0000236"},{"id":"T19063","span":{"begin":2525,"end":2529},"obj":"SO:0000236"},{"id":"T98118","span":{"begin":2587,"end":2595},"obj":"GO:0031975"},{"id":"T3600","span":{"begin":2601,"end":2609},"obj":"GO:0016020"},{"id":"T24611","span":{"begin":2618,"end":2630},"obj":"GO:0000786"},{"id":"T62198","span":{"begin":2680,"end":2690},"obj":"GO:0006412"},{"id":"T42846","span":{"begin":2696,"end":2699},"obj":"SO:0000236"},{"id":"T50694","span":{"begin":2707,"end":2710},"obj":"SO:0000236"},{"id":"T80205","span":{"begin":2787,"end":2795},"obj":"SP_10"},{"id":"T50023","span":{"begin":2824,"end":2832},"obj":"SP_9"},{"id":"T9028","span":{"begin":2869,"end":2878},"obj":"SP_7"},{"id":"T26735","span":{"begin":2895,"end":2901},"obj":"SO:0000204"},{"id":"T15719","span":{"begin":2906,"end":2909},"obj":"SO:0000205"},{"id":"T80430","span":{"begin":2909,"end":2912},"obj":"SO:0000203"},{"id":"T73692","span":{"begin":2914,"end":2926},"obj":"GO:0006412;SO:0000203"},{"id":"T18701","span":{"begin":2927,"end":2934},"obj":"SO:0000203"},{"id":"T25285","span":{"begin":2942,"end":2944},"obj":"SO:0000204"},{"id":"T15989","span":{"begin":2996,"end":3000},"obj":"SO:0000028"},{"id":"T64360","span":{"begin":3039,"end":3047},"obj":"SP_10"},{"id":"T58689","span":{"begin":3049,"end":3057},"obj":"SP_9"},{"id":"T91484","span":{"begin":3063,"end":3072},"obj":"SP_7"},{"id":"T1968","span":{"begin":3128,"end":3133},"obj":"SP_6;NCBITaxon:9606"},{"id":"T57613","span":{"begin":3157,"end":3162},"obj":"NCBITaxon:10239"},{"id":"T21257","span":{"begin":3232,"end":3237},"obj":"NCBITaxon:10239;GO:0006260"},{"id":"T77300","span":{"begin":3238,"end":3253},"obj":"GO:0039703"},{"id":"T62214","span":{"begin":3372,"end":3379},"obj":"NCBITaxon:10239"},{"id":"T56006","span":{"begin":3393,"end":3406},"obj":"UBERON:0002405"},{"id":"T42960","span":{"begin":3506,"end":3511},"obj":"NCBITaxon:10239"},{"id":"T69860","span":{"begin":3684,"end":3689},"obj":"NCBITaxon:10239;GO:0019068"},{"id":"T45894","span":{"begin":3690,"end":3698},"obj":"GO:0019068"},{"id":"T71239","span":{"begin":3804,"end":3813},"obj":"PG_1"},{"id":"T4932","span":{"begin":3817,"end":3822},"obj":"SP_6;NCBITaxon:9606"},{"id":"T74967","span":{"begin":3837,"end":3842},"obj":"NCBITaxon:10239;GO:0035376"},{"id":"T69577","span":{"begin":3843,"end":3848},"obj":"GO:0035376"},{"id":"T53606","span":{"begin":3880,"end":3888},"obj":"GO:0016020;GO:0061025"},{"id":"T3981","span":{"begin":3889,"end":3895},"obj":"GO:0061025"},{"id":"T89765","span":{"begin":3906,"end":3911},"obj":"NCBITaxon:10239"},{"id":"T70942","span":{"begin":3963,"end":3972},"obj":"PG_1"},{"id":"T47357","span":{"begin":3986,"end":3991},"obj":"NCBITaxon:10239"},{"id":"T8900","span":{"begin":4014,"end":4021},"obj":"MOP:0000780"},{"id":"T43016","span":{"begin":4151,"end":4156},"obj":"NCBITaxon:10239"},{"id":"T88882","span":{"begin":4223,"end":4228},"obj":"NCBITaxon:10239"},{"id":"T28289","span":{"begin":4229,"end":4233},"obj":"GO:0018995"},{"id":"T68398","span":{"begin":4234,"end":4242},"obj":"GO:0016020;GO:0061025"},{"id":"T65073","span":{"begin":4243,"end":4249},"obj":"GO:0061025"},{"id":"T84986","span":{"begin":4313,"end":4320},"obj":"SO:0000417"},{"id":"T14910","span":{"begin":4336,"end":4342},"obj":"SO:0000417"},{"id":"T57910","span":{"begin":4366,"end":4372},"obj":"SO:0000417"},{"id":"T80490","span":{"begin":4669,"end":4673},"obj":"GO:0018995"},{"id":"T96176","span":{"begin":4716,"end":4723},"obj":"SO:0000417"},{"id":"T32496","span":{"begin":4785,"end":4790},"obj":"SP_6;NCBITaxon:9606"},{"id":"T21394","span":{"begin":4830,"end":4835},"obj":"NCBITaxon:10239"},{"id":"T3686","span":{"begin":4840,"end":4857},"obj":"GO:0005886"},{"id":"T93872","span":{"begin":4930,"end":4942},"obj":"GO:0009986"},{"id":"T87352","span":{"begin":5026,"end":5035},"obj":"PG_1"},{"id":"T53956","span":{"begin":5082,"end":5090},"obj":"GO:0016020;GO:0061025"},{"id":"T75162","span":{"begin":5091,"end":5097},"obj":"GO:0061025"},{"id":"T1039","span":{"begin":5173,"end":5178},"obj":"NCBITaxon:10239"},{"id":"T29793","span":{"begin":5179,"end":5186},"obj":"SO:0000704"},{"id":"T75904","span":{"begin":5329,"end":5337},"obj":"SP_10"},{"id":"T49457","span":{"begin":5403,"end":5434},"obj":"PG_10;PR:000003622"},{"id":"T35911","span":{"begin":5436,"end":5440},"obj":"G_3;PG_10;PR:000003622"},{"id":"T70634","span":{"begin":5578,"end":5587},"obj":"PG_1"},{"id":"T11366","span":{"begin":5589,"end":5593},"obj":"G_3;PG_10;PR:000003622"},{"id":"T55512","span":{"begin":5743,"end":5748},"obj":"SP_6;NCBITaxon:9606;PG_10"},{"id":"T44473","span":{"begin":5749,"end":5753},"obj":"PG_10;PR:000003622"},{"id":"T82875","span":{"begin":5755,"end":5763},"obj":"SP_10"},{"id":"T71569","span":{"begin":5764,"end":5773},"obj":"PG_1"},{"id":"T13510","span":{"begin":5793,"end":5803},"obj":"SP_5"},{"id":"T75313","span":{"begin":5808,"end":5813},"obj":"NCBITaxon:10088"},{"id":"T61463","span":{"begin":5814,"end":5819},"obj":"G_3;PG_10;PR:000003622"},{"id":"T80982","span":{"begin":5907,"end":5914},"obj":"NCBITaxon:species"},{"id":"T18234","span":{"begin":5931,"end":5939},"obj":"SP_10"},{"id":"T61271","span":{"begin":5984,"end":5987},"obj":"SP_2"},{"id":"T69856","span":{"begin":6048,"end":6053},"obj":"SP_6;NCBITaxon:9606;PG_10"},{"id":"T52038","span":{"begin":6054,"end":6058},"obj":"PG_10;PR:000003622"},{"id":"T61108","span":{"begin":6155,"end":6163},"obj":"SP_10"},{"id":"T50014","span":{"begin":6196,"end":6200},"obj":"G_3;PG_10;PR:000003622"},{"id":"T29983","span":{"begin":6251,"end":6259},"obj":"SP_9"},{"id":"T17057","span":{"begin":6306,"end":6314},"obj":"SO:0000853"},{"id":"T32008","span":{"begin":6315,"end":6321},"obj":"SO:0000417"},{"id":"T56589","span":{"begin":6325,"end":6333},"obj":"SP_10"},{"id":"T57877","span":{"begin":6364,"end":6374},"obj":"GO:0042571"},{"id":"T96396","span":{"begin":6386,"end":6394},"obj":"SP_10"},{"id":"T84761","span":{"begin":6462,"end":6470},"obj":"SP_9"},{"id":"T4692","span":{"begin":6501,"end":6509},"obj":"SP_9"},{"id":"T69481","span":{"begin":6530,"end":6541},"obj":"PR:000012546"},{"id":"T91688","span":{"begin":6543,"end":6547},"obj":"PR:000001341;PG_7"},{"id":"T56470","span":{"begin":6639,"end":6647},"obj":"SP_10"},{"id":"T53532","span":{"begin":6657,"end":6661},"obj":"PR:000003622;G_3;PG_10"},{"id":"T12186","span":{"begin":6692,"end":6700},"obj":"SP_10"},{"id":"T61058","span":{"begin":6705,"end":6713},"obj":"SP_9"},{"id":"T96781","span":{"begin":6845,"end":6853},"obj":"SP_9"},{"id":"T61210","span":{"begin":6909,"end":6917},"obj":"SP_10"},{"id":"T31903","span":{"begin":6982,"end":6990},"obj":"SP_9"},{"id":"T62435","span":{"begin":7005,"end":7009},"obj":"PG_7;PR:000001341"},{"id":"T27691","span":{"begin":7085,"end":7093},"obj":"SP_9"},{"id":"T96680","span":{"begin":7226,"end":7231},"obj":"SP_6;NCBITaxon:9606"},{"id":"T87087","span":{"begin":7232,"end":7236},"obj":"PG_7;PR:000001341"},{"id":"T52550","span":{"begin":7288,"end":7292},"obj":"SP_9"},{"id":"T37578","span":{"begin":7345,"end":7349},"obj":"SP_2;NCBITaxon:8342"},{"id":"T90449","span":{"begin":7365,"end":7370},"obj":"SP_6;NCBITaxon:9606"},{"id":"T19922","span":{"begin":7371,"end":7375},"obj":"PG_7;PR:000001341"},{"id":"T67319","span":{"begin":7376,"end":7386},"obj":"GO:0010467"},{"id":"T99843","span":{"begin":7500,"end":7505},"obj":"SP_6;NCBITaxon:9606"},{"id":"T78701","span":{"begin":7548,"end":7555},"obj":"NCBITaxon:species"},{"id":"T14528","span":{"begin":7574,"end":7582},"obj":"SP_9"},{"id":"T7216","span":{"begin":7583,"end":7592},"obj":"PG_1"},{"id":"T77879","span":{"begin":7611,"end":7620},"obj":"GO:0032991"},{"id":"T55323","span":{"begin":7637,"end":7641},"obj":"PG_7;PR:000001341"},{"id":"T1965","span":{"begin":7733,"end":7738},"obj":"SP_6;NCBITaxon:9606"},{"id":"T85882","span":{"begin":7744,"end":7753},"obj":"SP_7"},{"id":"T71678","span":{"begin":7776,"end":7783},"obj":"NCBITaxon:species"},{"id":"T10379","span":{"begin":7845,"end":7853},"obj":"SP_10"},{"id":"T31660","span":{"begin":7879,"end":7885},"obj":"SO:0001026"},{"id":"T21405","span":{"begin":7889,"end":7898},"obj":"SP_7"},{"id":"T59197","span":{"begin":7907,"end":7912},"obj":"PG_15"},{"id":"T93875","span":{"begin":7914,"end":7924},"obj":"GO:0006412"},{"id":"T21842","span":{"begin":7930,"end":7933},"obj":"SO:0000236"},{"id":"T77041","span":{"begin":8058,"end":8066},"obj":"SP_10"},{"id":"T8743","span":{"begin":8071,"end":8079},"obj":"SP_9"},{"id":"T42284","span":{"begin":8081,"end":8090},"obj":"SP_7"},{"id":"T61203","span":{"begin":8113,"end":8117},"obj":"SO:0000704"},{"id":"T64022","span":{"begin":8133,"end":8142},"obj":"SP_7"},{"id":"T97146","span":{"begin":8176,"end":8184},"obj":"SP_10"},{"id":"T89407","span":{"begin":8189,"end":8197},"obj":"SP_9"},{"id":"T3948","span":{"begin":8216,"end":8224},"obj":"SP_10"},{"id":"T28360","span":{"begin":8226,"end":8235},"obj":"SP_7"},{"id":"T33721","span":{"begin":8246,"end":8250},"obj":"G_3;PG_10;PR:000003622"},{"id":"T49760","span":{"begin":8345,"end":8354},"obj":"SP_7"},{"id":"T92299","span":{"begin":8387,"end":8392},"obj":"NCBITaxon:10239"}],"text":"Genome of Emerging Human Coronaviruses, as Well as Structure and Function of Their Key Proteins\nThe human CoVs are enveloped viruses with a positive-sense, single-stranded RNA genome. They are 80–160 nm in diameter. Like other CoVs, human CoVs contain the largest viral genome [27–32 kilobase pairs (kb)] among the RNA viruses, and they share similar genome organization (Fehr and Perlman, 2015). Two large overlapping open reading frames (ORFs), ORF 1a and ORF 1b, occupy two-thirds of the genome at the 5′-terminus, and a third of the genome at the 3′-terminus encodes four common structural proteins in the gene order of spike (S), envelope (E), membrane (M), and nucleocapsid (N) (5′–3′) (Fehr and Perlman, 2015). The large ORF 1ab is a replicase gene encoding polyproteins 1a (pp1a) and pp1b/1ab, which can be cleaved into 15–16 non-structural proteins (nsp2-nsp16 or nsp1-nsp16) by 3C-like proteinase (3CLpro, nsp5) and papain-like proteinase (PLpro, nsp3) (Bailey-Elkin et al., 2014; Fehr and Perlman, 2015; Tomar et al., 2015; Snijder et al., 2016). In addition to the genes encoding the above structural proteins, the genes encoding accessory proteins have also been detected in the 3′ region between S–E–M–N (Fehr and Perlman, 2015). Some beta-CoVs, such as HCoV-OC43 and HCoV-HKU1, contain hemagglutinin-esterase (HE) gene located between ORF 1ab and S gene encoding an additional structural protein, HE (De Groot et al., 2011; Desforges et al., 2013; Huang et al., 2015). Similar to other human CoVs, SARS-CoV possesses a ∼29-kb genome, which encodes pp1a and pp1ab, four main structural proteins (S, E, M, and N), and eight accessory proteins, such as 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (Figure 2A) (Marra et al., 2003; Snijder et al., 2003). The MERS-CoV genome is about 30 kb in length and encodes pp1a, pp1ab, four structural proteins (S, E, M, and N), and five accessory proteins (3, 4a, 4b, 5, and 8b) (Figure 2A) (van Boheemen et al., 2012). The genomic RNA of SARS-CoV and MERS-CoV is packed inside capsid formed by the N protein, while the M, E, and S proteins form the envelope surrounding the capsid (Figure 2B). Accessory genes may incorporate into virions at low levels (Liu et al., 2014). Nevertheless, neither SARS-CoV nor MERS-CoV appears to contain the HE gene (Rota et al., 2003; Zaki et al., 2012).\nFIGURE 2 Schematic structure of SARS-CoV, MERS-CoV, and 2019-nCoV. (A) Schematic diagram of genomic organization of SARS-CoV, MERS-CoV, and 2019-nCoV. The genomic regions or open-reading frames (ORFs) are compared. Structural proteins, including spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins, as well as non-structural proteins translated from ORF 1a and ORF 1b and accessory proteins, including 3a, 3b, 6, 7a, 7b, 8a, 8b, and 9b (for SARS-CoV), 3, 4a, 4b, 5, and 8b (for MERS-CoV), and 3a, 6, 7a, 7b, 8, and 10 (for 2019-nCoV) are indicated. 5′-UTR and 3′-UTR, untranslated regions at the N- and C-terminal regions, respectively. Kb, kilobase pair. (B) Schematic structure of virion of SARS-CoV, MERS-CoV, and 2019-nCoV and its major structural proteins. Several proteins of human CoVs are important for viral infection and/or pathogenesis. For example, most nsps participate in viral RNA replication and/or transcription (Snijder et al., 2016). The accessory proteins interact with host cells, potentially helping the viruses to evade the immune system and increase their virulence (Menachery et al., 2017). The HE protein assists in the attachment of virus–host cells, thus playing a key role in the production of infectious virions, as in the case of HCoV-OC43 (Desforges et al., 2013). The M and E proteins are responsible for virus assembly or promote virulence (Scobey et al., 2013; DeDiego et al., 2014). Different from the above proteins, the S protein of human CoVs mediates viral entry into host cells and subsequent membrane fusion, enabling viral infection (Du et al., 2009a; Lu et al., 2014). The S protein is a class I viral protein, which can be cleaved into two functional subunits, an amino-terminal S1 subunit and a carboxyl-terminal S2 subunit. The S1 subunit is responsible for virus–host cell receptor binding, whereas the S2 subunit is involved in virus–host membrane fusion (Li et al., 2005a; Lu et al., 2014). The S1 contains two major domains, an N-terminal domain (NTD) and a C-terminal domain (CTD). In general, NTDs mediate sugar binding (Li, 2016; Li et al., 2017; Ou et al., 2017; Hulswit et al., 2019; Tortorici et al., 2019), whereas CTDs facilitate protein receptor recognition (Wong et al., 2004; Hofmann et al., 2006; Lu et al., 2013). The NTDs and CTDs of the S1 subunit can bind host receptors or function as receptor-binding domains (RBDs) (Lu et al., 2013; Hulswit et al., 2019). The entry of human CoVs relies on the interaction between viral and cellular membrane proteins. Recognition of S1 subunit with a receptor and/or sugar on the cell surface initiates the infection (Li, 2015). After the initial recognition and binding, the S protein undergoes conformational changes, followed by membrane fusion through the S2 region (Li, 2015, 2016; Du et al., 2017). Consequently, the viral genetic materials are delivered into the host cell through the fusion core (Du et al., 2009a).\nSimilar to HCoV-NL63 (Hofmann et al., 2005; Li, 2015), SARS-CoV recognizes, through the RBD in the CTD region of its S1 subunit, angiotensin-converting enzyme 2 (ACE2) as the receptor on the target cell (Li et al., 2003). The RBD (CTD) in two states (standing or lying) has been observed in the trimeric S protein. ACE2 binds to standing RBD, specifically in the receptor-binding motif (RBM), keeping the RBD in the “standing” state (Yuan et al., 2017). In addition to human ACE2, SARS-CoV S protein could also bind to palm civet and mouse ACE2s (Li et al., 2004; Li, 2008). Mutations in the RBD of S1 subunit are required for cross-species transmission of SARS-CoV (Li et al., 2005c; Li, 2008, 2016). Several bat SARSr-CoVs have been identified, and these CoVs can utilize human ACE2 as their receptor to bind the target cells (Ge et al., 2013; Hu et al., 2017). The structure of SARS-CoV S trimer and RBD binding to the ACE2 receptor is shown in Supplementary Figures S1A,B.\nMERS-CoV RBD shares a structure similar to that of the homology domain of SARS-CoV (Wang et al., 2013). However, antibodies induced by SARS-CoV RBD have no cross-reactivity and/or cross-neutralizing activity to MERS-CoV (Du et al., 2013b). Moreover, MERS-CoV utilizes dipeptidyl peptidase 4 (DPP4) as a receptor through the RBD (CTD) region (Raj et al., 2013), which is distinct from the SARS-CoV receptor ACE2. Although the core regions of SARS-CoV and MERS-CoV RBDs are similar, their RBM regions are significantly different, which explains why they recognize different receptors (Li, 2015). MERS-CoV S trimer also maintains a structure similar to that of SARS-CoV S trimer. Both standing and lying states can be detected in the MERS-CoV RBDs, whereas DPP4 only binds to the standing RBD (Pallesen et al., 2017; Yuan et al., 2017). MERS-CoV is clustered with Ty-BatCoV HKU4 and Pi-BatCoV HKU5 in the subgenus Merbecovirus. Ty-BatCoV HKU4, but not Pi-BatCoV HKU5, could use human DPP4 as a receptor (Yang et al., 2014b). Recently, some MERS-related CoVs (MERSr-CoVs) have been discovered from bats that can enter human DPP4-expressing cells (Lau et al., 2018; Luo et al., 2018a). These findings suggest that the emergence of MERSr-CoV may threaten human health owing to their potential for cross-species transmission. The MERS-CoV S protein and RBD and their complexes, along with the DPP4 receptor, are shown in Supplementary Figures S1C,D.\nRecent studies have found that the new human CoV, 2019-nCoV, which belongs to the species of SARSr-CoV, shares high sequence identify (about 79.5%) to SARS-CoV (Zhou et al., 2020). The genome of 2019-nCoV encodes pp1ab (translated from ORF 1ab), four structural proteins (S, E, M, and N), and six accessory proteins (3a, 6, 7a, 7b, 8, and 10) (Figure 2A). Same as SARS-CoV and MERS-CoV, 2019-nCoV appears to have no HE gene. The virion of 2019-nCoV consists of similar structure as SARS-CoV and MERS-CoV (Figure 2B). Like SARS-CoV, 2019-nCoV also uses ACE2 as its cellular receptor to enter host cells (Zhou et al., 2020). Currently, the structure of 2019-nCoV RBD and/or its binding with the viral receptor has not yet been available."}