PMC:7405836 / 9342-22626 JSONTXT

Annnotations TAB JSON ListView MergeView

    LitCovid-PD-FMA-UBERON

    {"project":"LitCovid-PD-FMA-UBERON","denotations":[{"id":"T97561","span":{"begin":56,"end":59},"obj":"Body_part"},{"id":"T3215","span":{"begin":1292,"end":1296},"obj":"Body_part"},{"id":"T55185","span":{"begin":1392,"end":1395},"obj":"Body_part"},{"id":"T57154","span":{"begin":1396,"end":1402},"obj":"Body_part"},{"id":"T31557","span":{"begin":1471,"end":1477},"obj":"Body_part"},{"id":"T58680","span":{"begin":1734,"end":1737},"obj":"Body_part"},{"id":"T56769","span":{"begin":2006,"end":2012},"obj":"Body_part"},{"id":"T67784","span":{"begin":2061,"end":2077},"obj":"Body_part"},{"id":"T16392","span":{"begin":2177,"end":2181},"obj":"Body_part"},{"id":"T43992","span":{"begin":2224,"end":2240},"obj":"Body_part"},{"id":"T67915","span":{"begin":2333,"end":2341},"obj":"Body_part"},{"id":"T16102","span":{"begin":2444,"end":2450},"obj":"Body_part"},{"id":"T48415","span":{"begin":2473,"end":2477},"obj":"Body_part"},{"id":"T409","span":{"begin":2639,"end":2647},"obj":"Body_part"},{"id":"T15201","span":{"begin":2683,"end":2687},"obj":"Body_part"},{"id":"T17186","span":{"begin":3238,"end":3248},"obj":"Body_part"},{"id":"T4307","span":{"begin":3665,"end":3681},"obj":"Body_part"},{"id":"T53065","span":{"begin":4122,"end":4129},"obj":"Body_part"},{"id":"T20479","span":{"begin":4733,"end":4740},"obj":"Body_part"},{"id":"T27420","span":{"begin":4774,"end":4781},"obj":"Body_part"},{"id":"T61724","span":{"begin":4796,"end":4801},"obj":"Body_part"},{"id":"T16194","span":{"begin":4899,"end":4907},"obj":"Body_part"},{"id":"T52269","span":{"begin":5018,"end":5030},"obj":"Body_part"},{"id":"T90910","span":{"begin":5045,"end":5052},"obj":"Body_part"},{"id":"T6390","span":{"begin":5095,"end":5116},"obj":"Body_part"},{"id":"T55301","span":{"begin":5146,"end":5153},"obj":"Body_part"},{"id":"T95276","span":{"begin":5172,"end":5183},"obj":"Body_part"},{"id":"T81422","span":{"begin":5240,"end":5251},"obj":"Body_part"},{"id":"T95553","span":{"begin":5467,"end":5471},"obj":"Body_part"},{"id":"T65392","span":{"begin":5581,"end":5587},"obj":"Body_part"},{"id":"T45119","span":{"begin":5649,"end":5656},"obj":"Body_part"},{"id":"T63666","span":{"begin":5692,"end":5700},"obj":"Body_part"},{"id":"T95143","span":{"begin":5787,"end":5795},"obj":"Body_part"},{"id":"T95436","span":{"begin":6289,"end":6296},"obj":"Body_part"},{"id":"T39694","span":{"begin":6406,"end":6414},"obj":"Body_part"},{"id":"T21074","span":{"begin":6444,"end":6454},"obj":"Body_part"},{"id":"T80948","span":{"begin":6484,"end":6494},"obj":"Body_part"},{"id":"T68680","span":{"begin":6551,"end":6562},"obj":"Body_part"},{"id":"T78938","span":{"begin":6662,"end":6672},"obj":"Body_part"},{"id":"T89404","span":{"begin":7079,"end":7085},"obj":"Body_part"},{"id":"T52477","span":{"begin":7169,"end":7173},"obj":"Body_part"},{"id":"T55542","span":{"begin":7736,"end":7743},"obj":"Body_part"},{"id":"T74523","span":{"begin":7750,"end":7757},"obj":"Body_part"},{"id":"T18598","span":{"begin":7785,"end":7792},"obj":"Body_part"},{"id":"T37585","span":{"begin":7982,"end":7990},"obj":"Body_part"},{"id":"T68033","span":{"begin":8013,"end":8023},"obj":"Body_part"},{"id":"T24078","span":{"begin":8112,"end":8119},"obj":"Body_part"},{"id":"T42949","span":{"begin":8335,"end":8342},"obj":"Body_part"},{"id":"T9662","span":{"begin":8374,"end":8384},"obj":"Body_part"},{"id":"T92572","span":{"begin":8436,"end":8443},"obj":"Body_part"},{"id":"T6573","span":{"begin":8462,"end":8469},"obj":"Body_part"},{"id":"T81130","span":{"begin":8529,"end":8537},"obj":"Body_part"},{"id":"T17307","span":{"begin":8763,"end":8770},"obj":"Body_part"},{"id":"T3919","span":{"begin":8877,"end":8884},"obj":"Body_part"},{"id":"T27311","span":{"begin":9056,"end":9063},"obj":"Body_part"},{"id":"T30130","span":{"begin":9082,"end":9092},"obj":"Body_part"},{"id":"T92154","span":{"begin":9140,"end":9147},"obj":"Body_part"},{"id":"T13665","span":{"begin":9154,"end":9161},"obj":"Body_part"},{"id":"T82837","span":{"begin":9271,"end":9277},"obj":"Body_part"},{"id":"T92091","span":{"begin":9293,"end":9300},"obj":"Body_part"},{"id":"T46746","span":{"begin":9485,"end":9488},"obj":"Body_part"},{"id":"T31431","span":{"begin":9589,"end":9595},"obj":"Body_part"},{"id":"T26009","span":{"begin":9713,"end":9719},"obj":"Body_part"},{"id":"T86388","span":{"begin":9724,"end":9732},"obj":"Body_part"},{"id":"T64065","span":{"begin":9767,"end":9773},"obj":"Body_part"},{"id":"T38101","span":{"begin":9778,"end":9786},"obj":"Body_part"},{"id":"T98601","span":{"begin":9856,"end":9859},"obj":"Body_part"},{"id":"T63939","span":{"begin":9895,"end":9899},"obj":"Body_part"},{"id":"T65258","span":{"begin":10026,"end":10029},"obj":"Body_part"},{"id":"T26171","span":{"begin":10085,"end":10092},"obj":"Body_part"},{"id":"T21006","span":{"begin":10120,"end":10130},"obj":"Body_part"},{"id":"T79154","span":{"begin":10327,"end":10335},"obj":"Body_part"},{"id":"T94033","span":{"begin":10369,"end":10377},"obj":"Body_part"},{"id":"T32909","span":{"begin":10394,"end":10400},"obj":"Body_part"},{"id":"T45672","span":{"begin":10469,"end":10477},"obj":"Body_part"},{"id":"T74876","span":{"begin":10534,"end":10542},"obj":"Body_part"},{"id":"T46810","span":{"begin":10612,"end":10620},"obj":"Body_part"},{"id":"T13888","span":{"begin":10665,"end":10672},"obj":"Body_part"},{"id":"T63748","span":{"begin":10708,"end":10715},"obj":"Body_part"},{"id":"T48025","span":{"begin":10781,"end":10789},"obj":"Body_part"},{"id":"T28902","span":{"begin":10822,"end":10832},"obj":"Body_part"},{"id":"T58100","span":{"begin":11012,"end":11024},"obj":"Body_part"},{"id":"T29616","span":{"begin":11025,"end":11029},"obj":"Body_part"},{"id":"T60900","span":{"begin":11094,"end":11104},"obj":"Body_part"},{"id":"T37693","span":{"begin":11382,"end":11392},"obj":"Body_part"},{"id":"T55073","span":{"begin":11534,"end":11544},"obj":"Body_part"},{"id":"T3191","span":{"begin":11565,"end":11575},"obj":"Body_part"},{"id":"T29957","span":{"begin":12480,"end":12487},"obj":"Body_part"}],"attributes":[{"id":"A61488","pred":"fma_id","subj":"T97561","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A75368","pred":"fma_id","subj":"T3215","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A60426","pred":"fma_id","subj":"T55185","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A44963","pred":"fma_id","subj":"T57154","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A65908","pred":"fma_id","subj":"T31557","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A52196","pred":"fma_id","subj":"T58680","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A94854","pred":"fma_id","subj":"T56769","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A5460","pred":"fma_id","subj":"T67784","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A98164","pred":"fma_id","subj":"T16392","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A53494","pred":"fma_id","subj":"T43992","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A76568","pred":"fma_id","subj":"T67915","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A19598","pred":"fma_id","subj":"T16102","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A69985","pred":"fma_id","subj":"T48415","obj":"http://purl.org/sig/ont/fma/fma67122"},{"id":"A44429","pred":"fma_id","subj":"T409","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A33626","pred":"fma_id","subj":"T15201","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A62310","pred":"fma_id","subj":"T17186","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A85895","pred":"fma_id","subj":"T4307","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A40140","pred":"fma_id","subj":"T53065","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A34445","pred":"fma_id","subj":"T20479","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A32054","pred":"fma_id","subj":"T27420","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A36531","pred":"fma_id","subj":"T61724","obj":"http://purl.org/sig/ont/fma/fma60992"},{"id":"A13979","pred":"fma_id","subj":"T16194","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A59712","pred":"fma_id","subj":"T52269","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A97258","pred":"fma_id","subj":"T90910","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A19848","pred":"fma_id","subj":"T6390","obj":"http://purl.org/sig/ont/fma/fma67858"},{"id":"A5795","pred":"fma_id","subj":"T55301","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A58590","pred":"fma_id","subj":"T95276","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A31430","pred":"fma_id","subj":"T81422","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A67130","pred":"fma_id","subj":"T95553","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A45418","pred":"fma_id","subj":"T65392","obj":"http://purl.org/sig/ont/fma/fma9637"},{"id":"A95878","pred":"fma_id","subj":"T45119","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A15109","pred":"fma_id","subj":"T63666","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A18979","pred":"fma_id","subj":"T95143","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A37937","pred":"fma_id","subj":"T95436","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A42637","pred":"fma_id","subj":"T39694","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A59769","pred":"fma_id","subj":"T21074","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A87136","pred":"fma_id","subj":"T80948","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A93295","pred":"fma_id","subj":"T68680","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A95166","pred":"fma_id","subj":"T78938","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A71745","pred":"fma_id","subj":"T89404","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A10476","pred":"fma_id","subj":"T52477","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A99669","pred":"fma_id","subj":"T55542","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A85503","pred":"fma_id","subj":"T74523","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A67681","pred":"fma_id","subj":"T18598","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A35732","pred":"fma_id","subj":"T37585","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A90479","pred":"fma_id","subj":"T68033","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A16756","pred":"fma_id","subj":"T24078","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A34514","pred":"fma_id","subj":"T42949","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A7398","pred":"fma_id","subj":"T9662","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A2641","pred":"fma_id","subj":"T92572","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A48030","pred":"fma_id","subj":"T6573","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A33927","pred":"fma_id","subj":"T81130","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A96902","pred":"fma_id","subj":"T17307","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A25365","pred":"fma_id","subj":"T3919","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A14003","pred":"fma_id","subj":"T27311","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A88575","pred":"fma_id","subj":"T30130","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A40983","pred":"fma_id","subj":"T92154","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A83356","pred":"fma_id","subj":"T13665","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A80388","pred":"fma_id","subj":"T82837","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A1172","pred":"fma_id","subj":"T92091","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A39080","pred":"fma_id","subj":"T46746","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A66045","pred":"fma_id","subj":"T31431","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A97858","pred":"fma_id","subj":"T26009","obj":"http://purl.org/sig/ont/fma/fma82764"},{"id":"A23006","pred":"fma_id","subj":"T86388","obj":"http://purl.org/sig/ont/fma/fma82763"},{"id":"A12365","pred":"fma_id","subj":"T64065","obj":"http://purl.org/sig/ont/fma/fma82764"},{"id":"A59463","pred":"fma_id","subj":"T38101","obj":"http://purl.org/sig/ont/fma/fma82763"},{"id":"A12798","pred":"fma_id","subj":"T98601","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A96844","pred":"fma_id","subj":"T63939","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A88864","pred":"fma_id","subj":"T65258","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A8617","pred":"fma_id","subj":"T26171","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A12726","pred":"fma_id","subj":"T21006","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A11108","pred":"fma_id","subj":"T79154","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A43658","pred":"fma_id","subj":"T94033","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A74554","pred":"fma_id","subj":"T32909","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A42287","pred":"fma_id","subj":"T45672","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A52495","pred":"fma_id","subj":"T74876","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A92423","pred":"fma_id","subj":"T46810","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A22357","pred":"fma_id","subj":"T13888","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A87648","pred":"fma_id","subj":"T63748","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A89545","pred":"fma_id","subj":"T48025","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A36374","pred":"fma_id","subj":"T28902","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A79520","pred":"fma_id","subj":"T58100","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A73275","pred":"fma_id","subj":"T29616","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A59699","pred":"fma_id","subj":"T60900","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A62534","pred":"fma_id","subj":"T37693","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A16879","pred":"fma_id","subj":"T55073","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A61817","pred":"fma_id","subj":"T3191","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A99683","pred":"fma_id","subj":"T29957","obj":"http://purl.org/sig/ont/fma/fma67257"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}

    LitCovid-PD-UBERON

    {"project":"LitCovid-PD-UBERON","denotations":[{"id":"T6","span":{"begin":1456,"end":1460},"obj":"Body_part"},{"id":"T7","span":{"begin":4796,"end":4801},"obj":"Body_part"},{"id":"T8","span":{"begin":5581,"end":5587},"obj":"Body_part"}],"attributes":[{"id":"A6","pred":"uberon_id","subj":"T6","obj":"http://purl.obolibrary.org/obo/UBERON_0002415"},{"id":"A7","pred":"uberon_id","subj":"T7","obj":"http://purl.obolibrary.org/obo/UBERON_0002488"},{"id":"A8","pred":"uberon_id","subj":"T8","obj":"http://purl.obolibrary.org/obo/UBERON_0000479"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}

    LitCovid-PD-MONDO

    {"project":"LitCovid-PD-MONDO","denotations":[{"id":"T98","span":{"begin":11,"end":19},"obj":"Disease"},{"id":"T99","span":{"begin":11,"end":15},"obj":"Disease"},{"id":"T100","span":{"begin":242,"end":253},"obj":"Disease"},{"id":"T101","span":{"begin":269,"end":289},"obj":"Disease"},{"id":"T102","span":{"begin":295,"end":299},"obj":"Disease"},{"id":"T103","span":{"begin":342,"end":350},"obj":"Disease"},{"id":"T104","span":{"begin":342,"end":346},"obj":"Disease"},{"id":"T105","span":{"begin":639,"end":686},"obj":"Disease"},{"id":"T106","span":{"begin":639,"end":672},"obj":"Disease"},{"id":"T107","span":{"begin":688,"end":696},"obj":"Disease"},{"id":"T108","span":{"begin":688,"end":692},"obj":"Disease"},{"id":"T109","span":{"begin":804,"end":837},"obj":"Disease"},{"id":"T110","span":{"begin":902,"end":906},"obj":"Disease"},{"id":"T111","span":{"begin":918,"end":926},"obj":"Disease"},{"id":"T112","span":{"begin":918,"end":922},"obj":"Disease"},{"id":"T113","span":{"begin":1481,"end":1489},"obj":"Disease"},{"id":"T114","span":{"begin":1481,"end":1485},"obj":"Disease"},{"id":"T115","span":{"begin":1780,"end":1788},"obj":"Disease"},{"id":"T116","span":{"begin":1780,"end":1784},"obj":"Disease"},{"id":"T117","span":{"begin":2247,"end":2255},"obj":"Disease"},{"id":"T118","span":{"begin":2247,"end":2251},"obj":"Disease"},{"id":"T119","span":{"begin":2844,"end":2852},"obj":"Disease"},{"id":"T120","span":{"begin":2844,"end":2848},"obj":"Disease"},{"id":"T121","span":{"begin":2995,"end":2999},"obj":"Disease"},{"id":"T122","span":{"begin":3048,"end":3056},"obj":"Disease"},{"id":"T123","span":{"begin":3048,"end":3052},"obj":"Disease"},{"id":"T124","span":{"begin":3279,"end":3287},"obj":"Disease"},{"id":"T125","span":{"begin":3279,"end":3283},"obj":"Disease"},{"id":"T126","span":{"begin":3307,"end":3315},"obj":"Disease"},{"id":"T127","span":{"begin":3307,"end":3311},"obj":"Disease"},{"id":"T128","span":{"begin":3361,"end":3365},"obj":"Disease"},{"id":"T129","span":{"begin":3452,"end":3460},"obj":"Disease"},{"id":"T130","span":{"begin":3452,"end":3456},"obj":"Disease"},{"id":"T131","span":{"begin":3473,"end":3481},"obj":"Disease"},{"id":"T132","span":{"begin":3473,"end":3477},"obj":"Disease"},{"id":"T133","span":{"begin":3497,"end":3505},"obj":"Disease"},{"id":"T134","span":{"begin":3497,"end":3501},"obj":"Disease"},{"id":"T135","span":{"begin":3619,"end":3627},"obj":"Disease"},{"id":"T136","span":{"begin":3619,"end":3623},"obj":"Disease"},{"id":"T137","span":{"begin":3701,"end":3709},"obj":"Disease"},{"id":"T138","span":{"begin":3701,"end":3705},"obj":"Disease"},{"id":"T139","span":{"begin":3729,"end":3733},"obj":"Disease"},{"id":"T140","span":{"begin":3758,"end":3766},"obj":"Disease"},{"id":"T141","span":{"begin":3758,"end":3762},"obj":"Disease"},{"id":"T142","span":{"begin":3954,"end":3958},"obj":"Disease"},{"id":"T143","span":{"begin":4026,"end":4030},"obj":"Disease"},{"id":"T144","span":{"begin":4534,"end":4542},"obj":"Disease"},{"id":"T145","span":{"begin":4534,"end":4538},"obj":"Disease"},{"id":"T146","span":{"begin":4659,"end":4667},"obj":"Disease"},{"id":"T147","span":{"begin":4659,"end":4663},"obj":"Disease"},{"id":"T148","span":{"begin":5190,"end":5200},"obj":"Disease"},{"id":"T149","span":{"begin":5201,"end":5211},"obj":"Disease"},{"id":"T150","span":{"begin":5430,"end":5440},"obj":"Disease"},{"id":"T151","span":{"begin":6059,"end":6062},"obj":"Disease"},{"id":"T153","span":{"begin":6418,"end":6426},"obj":"Disease"},{"id":"T154","span":{"begin":6779,"end":6787},"obj":"Disease"},{"id":"T155","span":{"begin":6779,"end":6783},"obj":"Disease"},{"id":"T156","span":{"begin":6990,"end":6998},"obj":"Disease"},{"id":"T157","span":{"begin":6990,"end":6994},"obj":"Disease"},{"id":"T158","span":{"begin":7027,"end":7035},"obj":"Disease"},{"id":"T159","span":{"begin":7027,"end":7031},"obj":"Disease"},{"id":"T160","span":{"begin":8346,"end":8354},"obj":"Disease"},{"id":"T161","span":{"begin":8346,"end":8350},"obj":"Disease"},{"id":"T162","span":{"begin":8418,"end":8426},"obj":"Disease"},{"id":"T163","span":{"begin":8418,"end":8422},"obj":"Disease"},{"id":"T164","span":{"begin":9043,"end":9051},"obj":"Disease"},{"id":"T165","span":{"begin":9043,"end":9047},"obj":"Disease"},{"id":"T166","span":{"begin":9477,"end":9480},"obj":"Disease"},{"id":"T168","span":{"begin":9550,"end":9553},"obj":"Disease"},{"id":"T170","span":{"begin":10069,"end":10077},"obj":"Disease"},{"id":"T171","span":{"begin":10069,"end":10073},"obj":"Disease"},{"id":"T172","span":{"begin":10096,"end":10104},"obj":"Disease"},{"id":"T173","span":{"begin":10096,"end":10100},"obj":"Disease"},{"id":"T174","span":{"begin":10238,"end":10241},"obj":"Disease"},{"id":"T176","span":{"begin":10383,"end":10391},"obj":"Disease"},{"id":"T177","span":{"begin":10383,"end":10387},"obj":"Disease"},{"id":"T178","span":{"begin":10624,"end":10632},"obj":"Disease"},{"id":"T179","span":{"begin":10624,"end":10628},"obj":"Disease"},{"id":"T180","span":{"begin":10634,"end":10642},"obj":"Disease"},{"id":"T181","span":{"begin":10634,"end":10638},"obj":"Disease"},{"id":"T182","span":{"begin":10925,"end":10933},"obj":"Disease"},{"id":"T183","span":{"begin":10925,"end":10929},"obj":"Disease"},{"id":"T184","span":{"begin":10966,"end":10974},"obj":"Disease"},{"id":"T185","span":{"begin":10966,"end":10970},"obj":"Disease"},{"id":"T186","span":{"begin":10995,"end":11003},"obj":"Disease"},{"id":"T187","span":{"begin":10995,"end":10999},"obj":"Disease"},{"id":"T188","span":{"begin":11200,"end":11208},"obj":"Disease"},{"id":"T189","span":{"begin":11200,"end":11204},"obj":"Disease"},{"id":"T190","span":{"begin":11309,"end":11317},"obj":"Disease"},{"id":"T191","span":{"begin":11309,"end":11313},"obj":"Disease"},{"id":"T192","span":{"begin":11471,"end":11475},"obj":"Disease"},{"id":"T193","span":{"begin":11516,"end":11524},"obj":"Disease"},{"id":"T194","span":{"begin":11516,"end":11520},"obj":"Disease"},{"id":"T195","span":{"begin":11862,"end":11870},"obj":"Disease"},{"id":"T196","span":{"begin":11862,"end":11866},"obj":"Disease"},{"id":"T197","span":{"begin":12118,"end":12126},"obj":"Disease"},{"id":"T198","span":{"begin":12118,"end":12122},"obj":"Disease"},{"id":"T199","span":{"begin":12562,"end":12570},"obj":"Disease"},{"id":"T200","span":{"begin":12562,"end":12566},"obj":"Disease"},{"id":"T201","span":{"begin":12803,"end":12811},"obj":"Disease"},{"id":"T202","span":{"begin":12803,"end":12807},"obj":"Disease"},{"id":"T203","span":{"begin":12819,"end":12823},"obj":"Disease"},{"id":"T204","span":{"begin":12851,"end":12859},"obj":"Disease"},{"id":"T205","span":{"begin":12851,"end":12855},"obj":"Disease"},{"id":"T206","span":{"begin":13250,"end":13254},"obj":"Disease"}],"attributes":[{"id":"A98","pred":"mondo_id","subj":"T98","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A99","pred":"mondo_id","subj":"T99","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A100","pred":"mondo_id","subj":"T100","obj":"http://purl.obolibrary.org/obo/MONDO_0005709"},{"id":"A101","pred":"mondo_id","subj":"T101","obj":"http://purl.obolibrary.org/obo/MONDO_0005087"},{"id":"A102","pred":"mondo_id","subj":"T102","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A103","pred":"mondo_id","subj":"T103","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A104","pred":"mondo_id","subj":"T104","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A105","pred":"mondo_id","subj":"T105","obj":"http://purl.obolibrary.org/obo/MONDO_0100096"},{"id":"A106","pred":"mondo_id","subj":"T106","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A107","pred":"mondo_id","subj":"T107","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A108","pred":"mondo_id","subj":"T108","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A109","pred":"mondo_id","subj":"T109","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A110","pred":"mondo_id","subj":"T110","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A111","pred":"mondo_id","subj":"T111","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A112","pred":"mondo_id","subj":"T112","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A113","pred":"mondo_id","subj":"T113","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A114","pred":"mondo_id","subj":"T114","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A115","pred":"mondo_id","subj":"T115","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A116","pred":"mondo_id","subj":"T116","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A117","pred":"mondo_id","subj":"T117","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A118","pred":"mondo_id","subj":"T118","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A119","pred":"mondo_id","subj":"T119","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A120","pred":"mondo_id","subj":"T120","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A121","pred":"mondo_id","subj":"T121","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A122","pred":"mondo_id","subj":"T122","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A123","pred":"mondo_id","subj":"T123","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A124","pred":"mondo_id","subj":"T124","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A125","pred":"mondo_id","subj":"T125","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A126","pred":"mondo_id","subj":"T126","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A127","pred":"mondo_id","subj":"T127","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A128","pred":"mondo_id","subj":"T128","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A129","pred":"mondo_id","subj":"T129","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A130","pred":"mondo_id","subj":"T130","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A131","pred":"mondo_id","subj":"T131","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A132","pred":"mondo_id","subj":"T132","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A133","pred":"mondo_id","subj":"T133","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A134","pred":"mondo_id","subj":"T134","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A135","pred":"mondo_id","subj":"T135","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A136","pred":"mondo_id","subj":"T136","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A137","pred":"mondo_id","subj":"T137","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A138","pred":"mondo_id","subj":"T138","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A139","pred":"mondo_id","subj":"T139","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A140","pred":"mondo_id","subj":"T140","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A141","pred":"mondo_id","subj":"T141","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A142","pred":"mondo_id","subj":"T142","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A143","pred":"mondo_id","subj":"T143","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A144","pred":"mondo_id","subj":"T144","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A145","pred":"mondo_id","subj":"T145","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A146","pred":"mondo_id","subj":"T146","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A147","pred":"mondo_id","subj":"T147","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A148","pred":"mondo_id","subj":"T148","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A149","pred":"mondo_id","subj":"T149","obj":"http://purl.obolibrary.org/obo/MONDO_0003781"},{"id":"A150","pred":"mondo_id","subj":"T150","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A151","pred":"mondo_id","subj":"T151","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A152","pred":"mondo_id","subj":"T151","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A153","pred":"mondo_id","subj":"T153","obj":"http://purl.obolibrary.org/obo/MONDO_0100096"},{"id":"A154","pred":"mondo_id","subj":"T154","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A155","pred":"mondo_id","subj":"T155","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A156","pred":"mondo_id","subj":"T156","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A157","pred":"mondo_id","subj":"T157","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A158","pred":"mondo_id","subj":"T158","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A159","pred":"mondo_id","subj":"T159","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A160","pred":"mondo_id","subj":"T160","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A161","pred":"mondo_id","subj":"T161","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A162","pred":"mondo_id","subj":"T162","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A163","pred":"mondo_id","subj":"T163","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A164","pred":"mondo_id","subj":"T164","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A165","pred":"mondo_id","subj":"T165","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A166","pred":"mondo_id","subj":"T166","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A167","pred":"mondo_id","subj":"T166","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A168","pred":"mondo_id","subj":"T168","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A169","pred":"mondo_id","subj":"T168","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A170","pred":"mondo_id","subj":"T170","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A171","pred":"mondo_id","subj":"T171","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A172","pred":"mondo_id","subj":"T172","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A173","pred":"mondo_id","subj":"T173","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A174","pred":"mondo_id","subj":"T174","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A175","pred":"mondo_id","subj":"T174","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A176","pred":"mondo_id","subj":"T176","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A177","pred":"mondo_id","subj":"T177","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A178","pred":"mondo_id","subj":"T178","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A179","pred":"mondo_id","subj":"T179","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A180","pred":"mondo_id","subj":"T180","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A181","pred":"mondo_id","subj":"T181","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A182","pred":"mondo_id","subj":"T182","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A183","pred":"mondo_id","subj":"T183","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A184","pred":"mondo_id","subj":"T184","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A185","pred":"mondo_id","subj":"T185","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A186","pred":"mondo_id","subj":"T186","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A187","pred":"mondo_id","subj":"T187","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A188","pred":"mondo_id","subj":"T188","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A189","pred":"mondo_id","subj":"T189","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A190","pred":"mondo_id","subj":"T190","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A191","pred":"mondo_id","subj":"T191","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A192","pred":"mondo_id","subj":"T192","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A193","pred":"mondo_id","subj":"T193","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A194","pred":"mondo_id","subj":"T194","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A195","pred":"mondo_id","subj":"T195","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A196","pred":"mondo_id","subj":"T196","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A197","pred":"mondo_id","subj":"T197","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A198","pred":"mondo_id","subj":"T198","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A199","pred":"mondo_id","subj":"T199","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A200","pred":"mondo_id","subj":"T200","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A201","pred":"mondo_id","subj":"T201","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A202","pred":"mondo_id","subj":"T202","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A203","pred":"mondo_id","subj":"T203","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A204","pred":"mondo_id","subj":"T204","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A205","pred":"mondo_id","subj":"T205","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A206","pred":"mondo_id","subj":"T206","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}

    LitCovid-PD-CLO

    {"project":"LitCovid-PD-CLO","denotations":[{"id":"T35545","span":{"begin":4,"end":9},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T88144","span":{"begin":60,"end":67},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T18398","span":{"begin":210,"end":216},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T3660","span":{"begin":353,"end":356},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T46247","span":{"begin":391,"end":392},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T44089","span":{"begin":575,"end":580},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T93862","span":{"begin":592,"end":597},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T57907","span":{"begin":598,"end":601},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T63589","span":{"begin":746,"end":753},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T12099","span":{"begin":783,"end":788},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T16241","span":{"begin":882,"end":887},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T2621","span":{"begin":932,"end":933},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T86871","span":{"begin":1141,"end":1143},"obj":"http://purl.obolibrary.org/obo/CLO_0050509"},{"id":"T16373","span":{"begin":1209,"end":1213},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T19657","span":{"begin":1292,"end":1296},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T66275","span":{"begin":1416,"end":1418},"obj":"http://purl.obolibrary.org/obo/CLO_0007074"},{"id":"T45005","span":{"begin":1416,"end":1418},"obj":"http://purl.obolibrary.org/obo/CLO_0051988"},{"id":"T35656","span":{"begin":1432,"end":1433},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T69516","span":{"begin":1453,"end":1454},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T24441","span":{"begin":1456,"end":1460},"obj":"http://purl.obolibrary.org/obo/UBERON_0002415"},{"id":"T92697","span":{"begin":1516,"end":1517},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T85017","span":{"begin":1549,"end":1556},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T51486","span":{"begin":1738,"end":1745},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T62699","span":{"begin":1800,"end":1801},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T25841","span":{"begin":1954,"end":1959},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T76364","span":{"begin":1978,"end":1979},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T53349","span":{"begin":2072,"end":2077},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T30689","span":{"begin":2100,"end":2101},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T70922","span":{"begin":2119,"end":2124},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T59932","span":{"begin":2137,"end":2138},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T56925","span":{"begin":2143,"end":2144},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T49383","span":{"begin":2177,"end":2181},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T5940","span":{"begin":2235,"end":2240},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T86758","span":{"begin":2258,"end":2261},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T94196","span":{"begin":2535,"end":2536},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T62424","span":{"begin":2597,"end":2605},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T58532","span":{"begin":2683,"end":2687},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T71201","span":{"begin":2689,"end":2691},"obj":"http://purl.obolibrary.org/obo/CLO_0001302"},{"id":"T81895","span":{"begin":2728,"end":2729},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T67572","span":{"begin":2802,"end":2804},"obj":"http://purl.obolibrary.org/obo/CLO_0001000"},{"id":"T55965","span":{"begin":2869,"end":2870},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T82265","span":{"begin":2935,"end":2936},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T8748","span":{"begin":2965,"end":2972},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T22810","span":{"begin":3077,"end":3078},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T93066","span":{"begin":3088,"end":3093},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T53444","span":{"begin":3184,"end":3185},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T16126","span":{"begin":3193,"end":3194},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T52861","span":{"begin":3382,"end":3386},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T50725","span":{"begin":3395,"end":3397},"obj":"http://purl.obolibrary.org/obo/CLO_0001313"},{"id":"T29361","span":{"begin":3426,"end":3429},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T18080","span":{"begin":3467,"end":3472},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T54872","span":{"begin":3491,"end":3496},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T85624","span":{"begin":3522,"end":3523},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T67299","span":{"begin":3676,"end":3681},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T65625","span":{"begin":3725,"end":3728},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T5893","span":{"begin":3796,"end":3800},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T34134","span":{"begin":3848,"end":3849},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T69248","span":{"begin":3882,"end":3888},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T25458","span":{"begin":3993,"end":3997},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T92446","span":{"begin":4055,"end":4061},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9837"},{"id":"T2505","span":{"begin":4105,"end":4107},"obj":"http://purl.obolibrary.org/obo/CLO_0053794"},{"id":"T82397","span":{"begin":4215,"end":4216},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T87267","span":{"begin":4314,"end":4315},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T86333","span":{"begin":4319,"end":4320},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T61879","span":{"begin":4369,"end":4374},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T69107","span":{"begin":4638,"end":4639},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T67548","span":{"begin":4710,"end":4711},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T10115","span":{"begin":4763,"end":4764},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T91278","span":{"begin":4928,"end":4936},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T71854","span":{"begin":5056,"end":5057},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T97323","span":{"begin":5212,"end":5217},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T49651","span":{"begin":5296,"end":5297},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T12838","span":{"begin":5346,"end":5347},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T50646","span":{"begin":5467,"end":5471},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T22585","span":{"begin":5559,"end":5560},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T63337","span":{"begin":5633,"end":5635},"obj":"http://purl.obolibrary.org/obo/CLO_0053799"},{"id":"T7231","span":{"begin":5752,"end":5754},"obj":"http://purl.obolibrary.org/obo/CLO_0053799"},{"id":"T89969","span":{"begin":5816,"end":5829},"obj":"http://purl.obolibrary.org/obo/OBI_0000245"},{"id":"T64278","span":{"begin":5858,"end":5860},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T31333","span":{"begin":5865,"end":5867},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T93842","span":{"begin":5865,"end":5867},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T90518","span":{"begin":5889,"end":5891},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T33265","span":{"begin":5947,"end":5949},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T55619","span":{"begin":5947,"end":5949},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T40645","span":{"begin":5984,"end":5986},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T45986","span":{"begin":6204,"end":6206},"obj":"http://purl.obolibrary.org/obo/CLO_0053799"},{"id":"T84021","span":{"begin":6213,"end":6215},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T85083","span":{"begin":6311,"end":6313},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T99435","span":{"begin":6352,"end":6354},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T97652","span":{"begin":6352,"end":6354},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T1834","span":{"begin":6362,"end":6364},"obj":"http://purl.obolibrary.org/obo/CLO_0053799"},{"id":"T3578","span":{"begin":6441,"end":6443},"obj":"http://purl.obolibrary.org/obo/CLO_0050509"},{"id":"T8835","span":{"begin":6476,"end":6477},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T73505","span":{"begin":6570,"end":6573},"obj":"http://purl.obolibrary.org/obo/CLO_0001409"},{"id":"T58583","span":{"begin":6634,"end":6636},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T23988","span":{"begin":7093,"end":7098},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T13810","span":{"begin":7169,"end":7173},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T33275","span":{"begin":7284,"end":7285},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T63507","span":{"begin":7454,"end":7455},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T5066","span":{"begin":7492,"end":7497},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T10671","span":{"begin":7575,"end":7580},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T97239","span":{"begin":7691,"end":7696},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T25808","span":{"begin":7700,"end":7705},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T78760","span":{"begin":7832,"end":7833},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T15183","span":{"begin":7872,"end":7874},"obj":"http://purl.obolibrary.org/obo/CLO_0001382"},{"id":"T23598","span":{"begin":7918,"end":7919},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T6527","span":{"begin":7928,"end":7937},"obj":"http://purl.obolibrary.org/obo/OBI_0000245"},{"id":"T12003","span":{"begin":8120,"end":8123},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T4121","span":{"begin":8164,"end":8165},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T47456","span":{"begin":8210,"end":8211},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T5417","span":{"begin":8553,"end":8554},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T59324","span":{"begin":8626,"end":8631},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T37985","span":{"begin":8633,"end":8635},"obj":"http://purl.obolibrary.org/obo/CLO_0001407"},{"id":"T91644","span":{"begin":8644,"end":8645},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T87895","span":{"begin":8661,"end":8669},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T30412","span":{"begin":8695,"end":8696},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T61081","span":{"begin":8920,"end":8921},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T87404","span":{"begin":8956,"end":8957},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T9709","span":{"begin":9072,"end":9073},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T81401","span":{"begin":9228,"end":9229},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T90573","span":{"begin":9393,"end":9398},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T13118","span":{"begin":9507,"end":9508},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T74119","span":{"begin":9534,"end":9535},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T77281","span":{"begin":9763,"end":9765},"obj":"http://purl.obolibrary.org/obo/CLO_0009126"},{"id":"T2595","span":{"begin":9895,"end":9899},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T61322","span":{"begin":9900,"end":9909},"obj":"http://purl.obolibrary.org/obo/SO_0000418"},{"id":"T45950","span":{"begin":10487,"end":10489},"obj":"http://purl.obolibrary.org/obo/CLO_0008339"},{"id":"T38116","span":{"begin":10548,"end":10549},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T3669","span":{"begin":10586,"end":10588},"obj":"http://purl.obolibrary.org/obo/CLO_0050509"},{"id":"T74427","span":{"begin":10677,"end":10680},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T62291","span":{"begin":10681,"end":10682},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T80886","span":{"begin":10761,"end":10763},"obj":"http://purl.obolibrary.org/obo/CLO_0008339"},{"id":"T63741","span":{"begin":10906,"end":10911},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T90002","span":{"begin":10977,"end":10982},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T3134","span":{"begin":11025,"end":11029},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T62091","span":{"begin":11365,"end":11368},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T91591","span":{"begin":11833,"end":11834},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T79207","span":{"begin":11886,"end":11889},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T70060","span":{"begin":11909,"end":11910},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T23742","span":{"begin":12269,"end":12272},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T96383","span":{"begin":12294,"end":12298},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T48063","span":{"begin":12317,"end":12318},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T27036","span":{"begin":12359,"end":12366},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T38832","span":{"begin":12489,"end":12494},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T40021","span":{"begin":12624,"end":12625},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T1642","span":{"begin":12661,"end":12662},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T41371","span":{"begin":12737,"end":12744},"obj":"http://purl.obolibrary.org/obo/UBERON_0000982"},{"id":"T81447","span":{"begin":12737,"end":12744},"obj":"http://purl.obolibrary.org/obo/UBERON_0004905"},{"id":"T30367","span":{"begin":12815,"end":12818},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T50486","span":{"begin":12834,"end":12837},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"},{"id":"T74741","span":{"begin":12953,"end":12954},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T94159","span":{"begin":13065,"end":13071},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9796"},{"id":"T22104","span":{"begin":13082,"end":13087},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T51739","span":{"begin":13118,"end":13119},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T22238","span":{"begin":13271,"end":13272},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T23524","span":{"begin":13273,"end":13276},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9397"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}

    LitCovid-PD-CHEBI

    {"project":"LitCovid-PD-CHEBI","denotations":[{"id":"T21","span":{"begin":1756,"end":1764},"obj":"Chemical"},{"id":"T22","span":{"begin":1986,"end":1992},"obj":"Chemical"},{"id":"T23","span":{"begin":2081,"end":2084},"obj":"Chemical"},{"id":"T24","span":{"begin":2333,"end":2341},"obj":"Chemical"},{"id":"T25","span":{"begin":2639,"end":2647},"obj":"Chemical"},{"id":"T26","span":{"begin":3238,"end":3248},"obj":"Chemical"},{"id":"T27","span":{"begin":4047,"end":4050},"obj":"Chemical"},{"id":"T28","span":{"begin":4733,"end":4740},"obj":"Chemical"},{"id":"T29","span":{"begin":4774,"end":4781},"obj":"Chemical"},{"id":"T30","span":{"begin":4790,"end":4795},"obj":"Chemical"},{"id":"T31","span":{"begin":4806,"end":4810},"obj":"Chemical"},{"id":"T32","span":{"begin":4899,"end":4907},"obj":"Chemical"},{"id":"T33","span":{"begin":5018,"end":5030},"obj":"Chemical"},{"id":"T34","span":{"begin":5045,"end":5052},"obj":"Chemical"},{"id":"T35","span":{"begin":5109,"end":5116},"obj":"Chemical"},{"id":"T36","span":{"begin":5146,"end":5153},"obj":"Chemical"},{"id":"T37","span":{"begin":5172,"end":5183},"obj":"Chemical"},{"id":"T38","span":{"begin":5172,"end":5177},"obj":"Chemical"},{"id":"T39","span":{"begin":5178,"end":5183},"obj":"Chemical"},{"id":"T40","span":{"begin":5240,"end":5251},"obj":"Chemical"},{"id":"T41","span":{"begin":5240,"end":5245},"obj":"Chemical"},{"id":"T42","span":{"begin":5246,"end":5251},"obj":"Chemical"},{"id":"T43","span":{"begin":5348,"end":5354},"obj":"Chemical"},{"id":"T44","span":{"begin":5358,"end":5363},"obj":"Chemical"},{"id":"T45","span":{"begin":5649,"end":5656},"obj":"Chemical"},{"id":"T46","span":{"begin":5692,"end":5700},"obj":"Chemical"},{"id":"T47","span":{"begin":5787,"end":5795},"obj":"Chemical"},{"id":"T48","span":{"begin":5865,"end":5867},"obj":"Chemical"},{"id":"T49","span":{"begin":5947,"end":5949},"obj":"Chemical"},{"id":"T50","span":{"begin":6289,"end":6296},"obj":"Chemical"},{"id":"T51","span":{"begin":6352,"end":6354},"obj":"Chemical"},{"id":"T52","span":{"begin":6406,"end":6414},"obj":"Chemical"},{"id":"T53","span":{"begin":6444,"end":6454},"obj":"Chemical"},{"id":"T54","span":{"begin":6444,"end":6449},"obj":"Chemical"},{"id":"T55","span":{"begin":6450,"end":6454},"obj":"Chemical"},{"id":"T56","span":{"begin":6484,"end":6489},"obj":"Chemical"},{"id":"T57","span":{"begin":6490,"end":6494},"obj":"Chemical"},{"id":"T58","span":{"begin":6551,"end":6562},"obj":"Chemical"},{"id":"T59","span":{"begin":6551,"end":6556},"obj":"Chemical"},{"id":"T60","span":{"begin":6557,"end":6562},"obj":"Chemical"},{"id":"T61","span":{"begin":6662,"end":6672},"obj":"Chemical"},{"id":"T62","span":{"begin":6662,"end":6667},"obj":"Chemical"},{"id":"T63","span":{"begin":6668,"end":6672},"obj":"Chemical"},{"id":"T64","span":{"begin":6728,"end":6739},"obj":"Chemical"},{"id":"T65","span":{"begin":7736,"end":7743},"obj":"Chemical"},{"id":"T66","span":{"begin":7750,"end":7757},"obj":"Chemical"},{"id":"T67","span":{"begin":7785,"end":7792},"obj":"Chemical"},{"id":"T68","span":{"begin":7982,"end":7990},"obj":"Chemical"},{"id":"T69","span":{"begin":8013,"end":8023},"obj":"Chemical"},{"id":"T70","span":{"begin":8013,"end":8018},"obj":"Chemical"},{"id":"T71","span":{"begin":8019,"end":8023},"obj":"Chemical"},{"id":"T72","span":{"begin":8112,"end":8119},"obj":"Chemical"},{"id":"T73","span":{"begin":8172,"end":8177},"obj":"Chemical"},{"id":"T74","span":{"begin":8217,"end":8224},"obj":"Chemical"},{"id":"T75","span":{"begin":8335,"end":8342},"obj":"Chemical"},{"id":"T76","span":{"begin":8374,"end":8384},"obj":"Chemical"},{"id":"T77","span":{"begin":8374,"end":8379},"obj":"Chemical"},{"id":"T78","span":{"begin":8380,"end":8384},"obj":"Chemical"},{"id":"T79","span":{"begin":8436,"end":8443},"obj":"Chemical"},{"id":"T80","span":{"begin":8462,"end":8469},"obj":"Chemical"},{"id":"T81","span":{"begin":8529,"end":8537},"obj":"Chemical"},{"id":"T82","span":{"begin":8670,"end":8681},"obj":"Chemical"},{"id":"T83","span":{"begin":8708,"end":8711},"obj":"Chemical"},{"id":"T84","span":{"begin":8763,"end":8770},"obj":"Chemical"},{"id":"T85","span":{"begin":8877,"end":8884},"obj":"Chemical"},{"id":"T86","span":{"begin":8940,"end":8945},"obj":"Chemical"},{"id":"T87","span":{"begin":9056,"end":9063},"obj":"Chemical"},{"id":"T88","span":{"begin":9082,"end":9092},"obj":"Chemical"},{"id":"T89","span":{"begin":9082,"end":9087},"obj":"Chemical"},{"id":"T90","span":{"begin":9088,"end":9092},"obj":"Chemical"},{"id":"T91","span":{"begin":9140,"end":9147},"obj":"Chemical"},{"id":"T92","span":{"begin":9154,"end":9161},"obj":"Chemical"},{"id":"T93","span":{"begin":9293,"end":9300},"obj":"Chemical"},{"id":"T94","span":{"begin":9713,"end":9719},"obj":"Chemical"},{"id":"T95","span":{"begin":9724,"end":9732},"obj":"Chemical"},{"id":"T98","span":{"begin":9767,"end":9773},"obj":"Chemical"},{"id":"T99","span":{"begin":9778,"end":9786},"obj":"Chemical"},{"id":"T102","span":{"begin":9942,"end":9951},"obj":"Chemical"},{"id":"T103","span":{"begin":9990,"end":10000},"obj":"Chemical"},{"id":"T104","span":{"begin":10005,"end":10015},"obj":"Chemical"},{"id":"T105","span":{"begin":10017,"end":10020},"obj":"Chemical"},{"id":"T106","span":{"begin":10085,"end":10092},"obj":"Chemical"},{"id":"T107","span":{"begin":10120,"end":10130},"obj":"Chemical"},{"id":"T108","span":{"begin":10120,"end":10125},"obj":"Chemical"},{"id":"T109","span":{"begin":10126,"end":10130},"obj":"Chemical"},{"id":"T110","span":{"begin":10369,"end":10377},"obj":"Chemical"},{"id":"T111","span":{"begin":10469,"end":10477},"obj":"Chemical"},{"id":"T112","span":{"begin":10534,"end":10542},"obj":"Chemical"},{"id":"T113","span":{"begin":10612,"end":10620},"obj":"Chemical"},{"id":"T114","span":{"begin":10665,"end":10672},"obj":"Chemical"},{"id":"T115","span":{"begin":10708,"end":10715},"obj":"Chemical"},{"id":"T116","span":{"begin":10781,"end":10789},"obj":"Chemical"},{"id":"T117","span":{"begin":10822,"end":10832},"obj":"Chemical"},{"id":"T118","span":{"begin":10822,"end":10827},"obj":"Chemical"},{"id":"T119","span":{"begin":10828,"end":10832},"obj":"Chemical"},{"id":"T120","span":{"begin":11012,"end":11024},"obj":"Chemical"},{"id":"T121","span":{"begin":11094,"end":11104},"obj":"Chemical"},{"id":"T122","span":{"begin":11369,"end":11371},"obj":"Chemical"},{"id":"T123","span":{"begin":11382,"end":11392},"obj":"Chemical"},{"id":"T124","span":{"begin":11534,"end":11544},"obj":"Chemical"},{"id":"T125","span":{"begin":11565,"end":11575},"obj":"Chemical"},{"id":"T126","span":{"begin":11890,"end":11892},"obj":"Chemical"},{"id":"T127","span":{"begin":12480,"end":12487},"obj":"Chemical"},{"id":"T128","span":{"begin":12838,"end":12840},"obj":"Chemical"}],"attributes":[{"id":"A21","pred":"chebi_id","subj":"T21","obj":"http://purl.obolibrary.org/obo/CHEBI_10545"},{"id":"A22","pred":"chebi_id","subj":"T22","obj":"http://purl.obolibrary.org/obo/CHEBI_37409"},{"id":"A23","pred":"chebi_id","subj":"T23","obj":"http://purl.obolibrary.org/obo/CHEBI_73507"},{"id":"A24","pred":"chebi_id","subj":"T24","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A25","pred":"chebi_id","subj":"T25","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A26","pred":"chebi_id","subj":"T26","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A27","pred":"chebi_id","subj":"T27","obj":"http://purl.obolibrary.org/obo/CHEBI_32402"},{"id":"A28","pred":"chebi_id","subj":"T28","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A29","pred":"chebi_id","subj":"T29","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A30","pred":"chebi_id","subj":"T30","obj":"http://purl.obolibrary.org/obo/CHEBI_30216"},{"id":"A31","pred":"chebi_id","subj":"T31","obj":"http://purl.obolibrary.org/obo/CHEBI_10545"},{"id":"A32","pred":"chebi_id","subj":"T32","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A33","pred":"chebi_id","subj":"T33","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A34","pred":"chebi_id","subj":"T34","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A35","pred":"chebi_id","subj":"T35","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A36","pred":"chebi_id","subj":"T36","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A37","pred":"chebi_id","subj":"T37","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A38","pred":"chebi_id","subj":"T38","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A39","pred":"chebi_id","subj":"T39","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A40","pred":"chebi_id","subj":"T40","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A41","pred":"chebi_id","subj":"T41","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A42","pred":"chebi_id","subj":"T42","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A43","pred":"chebi_id","subj":"T43","obj":"http://purl.obolibrary.org/obo/CHEBI_37409"},{"id":"A44","pred":"chebi_id","subj":"T44","obj":"http://purl.obolibrary.org/obo/CHEBI_37409"},{"id":"A45","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A46","pred":"chebi_id","subj":"T46","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A47","pred":"chebi_id","subj":"T47","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A48","pred":"chebi_id","subj":"T48","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A49","pred":"chebi_id","subj":"T49","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A50","pred":"chebi_id","subj":"T50","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A51","pred":"chebi_id","subj":"T51","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A52","pred":"chebi_id","subj":"T52","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A53","pred":"chebi_id","subj":"T53","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A54","pred":"chebi_id","subj":"T54","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A55","pred":"chebi_id","subj":"T55","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A56","pred":"chebi_id","subj":"T56","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A57","pred":"chebi_id","subj":"T57","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A58","pred":"chebi_id","subj":"T58","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A59","pred":"chebi_id","subj":"T59","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A60","pred":"chebi_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A61","pred":"chebi_id","subj":"T61","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A62","pred":"chebi_id","subj":"T62","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A63","pred":"chebi_id","subj":"T63","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A64","pred":"chebi_id","subj":"T64","obj":"http://purl.obolibrary.org/obo/CHEBI_48433"},{"id":"A65","pred":"chebi_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/CHEBI_16541"},{"id":"A66","pred":"chebi_id","subj":"T66","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A67","pred":"chebi_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A68","pred":"chebi_id","subj":"T68","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A69","pred":"chebi_id","subj":"T69","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A70","pred":"chebi_id","subj":"T70","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A71","pred":"chebi_id","subj":"T71","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A72","pred":"chebi_id","subj":"T72","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A73","pred":"chebi_id","subj":"T73","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A74","pred":"chebi_id","subj":"T74","obj":"http://purl.obolibrary.org/obo/CHEBI_46883"},{"id":"A75","pred":"chebi_id","subj":"T75","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A76","pred":"chebi_id","subj":"T76","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A77","pred":"chebi_id","subj":"T77","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A78","pred":"chebi_id","subj":"T78","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A79","pred":"chebi_id","subj":"T79","obj":"http://purl.obolibrary.org/obo/CHEBI_16541"},{"id":"A80","pred":"chebi_id","subj":"T80","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A81","pred":"chebi_id","subj":"T81","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A82","pred":"chebi_id","subj":"T82","obj":"http://purl.obolibrary.org/obo/CHEBI_15841"},{"id":"A83","pred":"chebi_id","subj":"T83","obj":"http://purl.obolibrary.org/obo/CHEBI_24870"},{"id":"A84","pred":"chebi_id","subj":"T84","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A85","pred":"chebi_id","subj":"T85","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A86","pred":"chebi_id","subj":"T86","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A87","pred":"chebi_id","subj":"T87","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A88","pred":"chebi_id","subj":"T88","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A89","pred":"chebi_id","subj":"T89","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A90","pred":"chebi_id","subj":"T90","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A91","pred":"chebi_id","subj":"T91","obj":"http://purl.obolibrary.org/obo/CHEBI_16541"},{"id":"A92","pred":"chebi_id","subj":"T92","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A93","pred":"chebi_id","subj":"T93","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A94","pred":"chebi_id","subj":"T94","obj":"http://purl.obolibrary.org/obo/CHEBI_17822"},{"id":"A95","pred":"chebi_id","subj":"T95","obj":"http://purl.obolibrary.org/obo/CHEBI_16467"},{"id":"A96","pred":"chebi_id","subj":"T95","obj":"http://purl.obolibrary.org/obo/CHEBI_29016"},{"id":"A97","pred":"chebi_id","subj":"T95","obj":"http://purl.obolibrary.org/obo/CHEBI_32696"},{"id":"A98","pred":"chebi_id","subj":"T98","obj":"http://purl.obolibrary.org/obo/CHEBI_17822"},{"id":"A99","pred":"chebi_id","subj":"T99","obj":"http://purl.obolibrary.org/obo/CHEBI_16467"},{"id":"A100","pred":"chebi_id","subj":"T99","obj":"http://purl.obolibrary.org/obo/CHEBI_29016"},{"id":"A101","pred":"chebi_id","subj":"T99","obj":"http://purl.obolibrary.org/obo/CHEBI_32696"},{"id":"A102","pred":"chebi_id","subj":"T102","obj":"http://purl.obolibrary.org/obo/CHEBI_22587"},{"id":"A103","pred":"chebi_id","subj":"T103","obj":"http://purl.obolibrary.org/obo/CHEBI_48706"},{"id":"A104","pred":"chebi_id","subj":"T104","obj":"http://purl.obolibrary.org/obo/CHEBI_52999"},{"id":"A105","pred":"chebi_id","subj":"T105","obj":"http://purl.obolibrary.org/obo/CHEBI_52999"},{"id":"A106","pred":"chebi_id","subj":"T106","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A107","pred":"chebi_id","subj":"T107","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A108","pred":"chebi_id","subj":"T108","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A109","pred":"chebi_id","subj":"T109","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A110","pred":"chebi_id","subj":"T110","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A111","pred":"chebi_id","subj":"T111","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A112","pred":"chebi_id","subj":"T112","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A113","pred":"chebi_id","subj":"T113","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A114","pred":"chebi_id","subj":"T114","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A115","pred":"chebi_id","subj":"T115","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A116","pred":"chebi_id","subj":"T116","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A117","pred":"chebi_id","subj":"T117","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A118","pred":"chebi_id","subj":"T118","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A119","pred":"chebi_id","subj":"T119","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A120","pred":"chebi_id","subj":"T120","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A121","pred":"chebi_id","subj":"T121","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A122","pred":"chebi_id","subj":"T122","obj":"http://purl.obolibrary.org/obo/CHEBI_74815"},{"id":"A123","pred":"chebi_id","subj":"T123","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A124","pred":"chebi_id","subj":"T124","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A125","pred":"chebi_id","subj":"T125","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A126","pred":"chebi_id","subj":"T126","obj":"http://purl.obolibrary.org/obo/CHEBI_74815"},{"id":"A127","pred":"chebi_id","subj":"T127","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A128","pred":"chebi_id","subj":"T128","obj":"http://purl.obolibrary.org/obo/CHEBI_74815"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}

    LitCovid-PD-GO-BP

    {"project":"LitCovid-PD-GO-BP","denotations":[{"id":"T92536","span":{"begin":2549,"end":2562},"obj":"http://purl.obolibrary.org/obo/GO_0006351"},{"id":"T24596","span":{"begin":2787,"end":2800},"obj":"http://purl.obolibrary.org/obo/GO_0006351"},{"id":"T68867","span":{"begin":5588,"end":5595},"obj":"http://purl.obolibrary.org/obo/GO_0009606"},{"id":"T97335","span":{"begin":5734,"end":5750},"obj":"http://purl.obolibrary.org/obo/GO_0006955"},{"id":"T96048","span":{"begin":6897,"end":6904},"obj":"http://purl.obolibrary.org/obo/GO_0009606"},{"id":"T60831","span":{"begin":8583,"end":8595},"obj":"http://purl.obolibrary.org/obo/GO_0009405"},{"id":"T2623","span":{"begin":8697,"end":8706},"obj":"http://purl.obolibrary.org/obo/GO_0039707"},{"id":"T70473","span":{"begin":8708,"end":8719},"obj":"http://purl.obolibrary.org/obo/GO_0022831"},{"id":"T71812","span":{"begin":8797,"end":8806},"obj":"http://purl.obolibrary.org/obo/GO_0016032"},{"id":"T78340","span":{"begin":8797,"end":8806},"obj":"http://purl.obolibrary.org/obo/GO_0009405"},{"id":"T63686","span":{"begin":8857,"end":8864},"obj":"http://purl.obolibrary.org/obo/GO_0009606"},{"id":"T6706","span":{"begin":9246,"end":9255},"obj":"http://purl.obolibrary.org/obo/GO_0009058"},{"id":"T85138","span":{"begin":9327,"end":9342},"obj":"http://purl.obolibrary.org/obo/GO_0019068"},{"id":"T63681","span":{"begin":9361,"end":9374},"obj":"http://purl.obolibrary.org/obo/GO_0006351"},{"id":"T76650","span":{"begin":9900,"end":9909},"obj":"http://purl.obolibrary.org/obo/GO_0023052"},{"id":"T96883","span":{"begin":9942,"end":9960},"obj":"http://purl.obolibrary.org/obo/GO_0051607"},{"id":"T60457","span":{"begin":10026,"end":10042},"obj":"http://purl.obolibrary.org/obo/GO_0016246"},{"id":"T35914","span":{"begin":10567,"end":10584},"obj":"http://purl.obolibrary.org/obo/GO_0019079"},{"id":"T63895","span":{"begin":10567,"end":10584},"obj":"http://purl.obolibrary.org/obo/GO_0019058"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}

    LitCovid-sentences

    {"project":"LitCovid-sentences","denotations":[{"id":"T59","span":{"begin":0,"end":22},"obj":"Sentence"},{"id":"T60","span":{"begin":23,"end":164},"obj":"Sentence"},{"id":"T61","span":{"begin":165,"end":319},"obj":"Sentence"},{"id":"T62","span":{"begin":320,"end":586},"obj":"Sentence"},{"id":"T63","span":{"begin":587,"end":917},"obj":"Sentence"},{"id":"T64","span":{"begin":918,"end":1145},"obj":"Sentence"},{"id":"T65","span":{"begin":1146,"end":1321},"obj":"Sentence"},{"id":"T66","span":{"begin":1322,"end":1466},"obj":"Sentence"},{"id":"T67","span":{"begin":1467,"end":1542},"obj":"Sentence"},{"id":"T68","span":{"begin":1543,"end":1618},"obj":"Sentence"},{"id":"T69","span":{"begin":1619,"end":1751},"obj":"Sentence"},{"id":"T70","span":{"begin":1752,"end":1997},"obj":"Sentence"},{"id":"T71","span":{"begin":1998,"end":2108},"obj":"Sentence"},{"id":"T72","span":{"begin":2109,"end":2246},"obj":"Sentence"},{"id":"T73","span":{"begin":2247,"end":2424},"obj":"Sentence"},{"id":"T74","span":{"begin":2425,"end":2534},"obj":"Sentence"},{"id":"T75","span":{"begin":2535,"end":2693},"obj":"Sentence"},{"id":"T76","span":{"begin":2694,"end":2806},"obj":"Sentence"},{"id":"T77","span":{"begin":2807,"end":2934},"obj":"Sentence"},{"id":"T78","span":{"begin":2935,"end":3038},"obj":"Sentence"},{"id":"T79","span":{"begin":3039,"end":3222},"obj":"Sentence"},{"id":"T80","span":{"begin":3223,"end":3399},"obj":"Sentence"},{"id":"T81","span":{"begin":3400,"end":3521},"obj":"Sentence"},{"id":"T82","span":{"begin":3522,"end":3635},"obj":"Sentence"},{"id":"T83","span":{"begin":3636,"end":3746},"obj":"Sentence"},{"id":"T84","span":{"begin":3747,"end":3894},"obj":"Sentence"},{"id":"T85","span":{"begin":3895,"end":4007},"obj":"Sentence"},{"id":"T86","span":{"begin":4008,"end":4109},"obj":"Sentence"},{"id":"T87","span":{"begin":4110,"end":4166},"obj":"Sentence"},{"id":"T88","span":{"begin":4167,"end":4241},"obj":"Sentence"},{"id":"T89","span":{"begin":4242,"end":4364},"obj":"Sentence"},{"id":"T90","span":{"begin":4365,"end":4533},"obj":"Sentence"},{"id":"T91","span":{"begin":4534,"end":4625},"obj":"Sentence"},{"id":"T92","span":{"begin":4626,"end":4855},"obj":"Sentence"},{"id":"T93","span":{"begin":4856,"end":5014},"obj":"Sentence"},{"id":"T94","span":{"begin":5016,"end":5030},"obj":"Sentence"},{"id":"T95","span":{"begin":5031,"end":5117},"obj":"Sentence"},{"id":"T96","span":{"begin":5118,"end":5284},"obj":"Sentence"},{"id":"T97","span":{"begin":5285,"end":5380},"obj":"Sentence"},{"id":"T98","span":{"begin":5381,"end":5534},"obj":"Sentence"},{"id":"T99","span":{"begin":5535,"end":5637},"obj":"Sentence"},{"id":"T100","span":{"begin":5638,"end":5756},"obj":"Sentence"},{"id":"T101","span":{"begin":5757,"end":5873},"obj":"Sentence"},{"id":"T102","span":{"begin":5874,"end":5971},"obj":"Sentence"},{"id":"T103","span":{"begin":5972,"end":6092},"obj":"Sentence"},{"id":"T104","span":{"begin":6093,"end":6208},"obj":"Sentence"},{"id":"T105","span":{"begin":6209,"end":6262},"obj":"Sentence"},{"id":"T106","span":{"begin":6263,"end":6366},"obj":"Sentence"},{"id":"T107","span":{"begin":6367,"end":6508},"obj":"Sentence"},{"id":"T108","span":{"begin":6509,"end":6649},"obj":"Sentence"},{"id":"T109","span":{"begin":6650,"end":6797},"obj":"Sentence"},{"id":"T110","span":{"begin":6798,"end":6905},"obj":"Sentence"},{"id":"T111","span":{"begin":6906,"end":7036},"obj":"Sentence"},{"id":"T112","span":{"begin":7037,"end":7229},"obj":"Sentence"},{"id":"T113","span":{"begin":7230,"end":7394},"obj":"Sentence"},{"id":"T114","span":{"begin":7395,"end":7527},"obj":"Sentence"},{"id":"T115","span":{"begin":7528,"end":7732},"obj":"Sentence"},{"id":"T116","span":{"begin":7734,"end":7743},"obj":"Sentence"},{"id":"T117","span":{"begin":7744,"end":7876},"obj":"Sentence"},{"id":"T118","span":{"begin":7877,"end":7967},"obj":"Sentence"},{"id":"T119","span":{"begin":7968,"end":8105},"obj":"Sentence"},{"id":"T120","span":{"begin":8106,"end":8257},"obj":"Sentence"},{"id":"T121","span":{"begin":8258,"end":8319},"obj":"Sentence"},{"id":"T122","span":{"begin":8320,"end":8432},"obj":"Sentence"},{"id":"T123","span":{"begin":8434,"end":8443},"obj":"Sentence"},{"id":"T124","span":{"begin":8444,"end":8543},"obj":"Sentence"},{"id":"T125","span":{"begin":8544,"end":8637},"obj":"Sentence"},{"id":"T126","span":{"begin":8638,"end":8726},"obj":"Sentence"},{"id":"T127","span":{"begin":8727,"end":8870},"obj":"Sentence"},{"id":"T128","span":{"begin":8871,"end":9038},"obj":"Sentence"},{"id":"T129","span":{"begin":9039,"end":9136},"obj":"Sentence"},{"id":"T130","span":{"begin":9138,"end":9147},"obj":"Sentence"},{"id":"T131","span":{"begin":9148,"end":9193},"obj":"Sentence"},{"id":"T132","span":{"begin":9194,"end":9408},"obj":"Sentence"},{"id":"T133","span":{"begin":9409,"end":9545},"obj":"Sentence"},{"id":"T134","span":{"begin":9546,"end":9693},"obj":"Sentence"},{"id":"T135","span":{"begin":9694,"end":9800},"obj":"Sentence"},{"id":"T136","span":{"begin":9801,"end":9919},"obj":"Sentence"},{"id":"T137","span":{"begin":9920,"end":10048},"obj":"Sentence"},{"id":"T138","span":{"begin":10049,"end":10306},"obj":"Sentence"},{"id":"T139","span":{"begin":10308,"end":10335},"obj":"Sentence"},{"id":"T140","span":{"begin":10336,"end":10523},"obj":"Sentence"},{"id":"T141","span":{"begin":10524,"end":10590},"obj":"Sentence"},{"id":"T142","span":{"begin":10591,"end":10721},"obj":"Sentence"},{"id":"T143","span":{"begin":10722,"end":10901},"obj":"Sentence"},{"id":"T144","span":{"begin":10902,"end":10958},"obj":"Sentence"},{"id":"T145","span":{"begin":10959,"end":10993},"obj":"Sentence"},{"id":"T146","span":{"begin":10995,"end":11038},"obj":"Sentence"},{"id":"T147","span":{"begin":11040,"end":11077},"obj":"Sentence"},{"id":"T148","span":{"begin":11078,"end":11254},"obj":"Sentence"},{"id":"T149","span":{"begin":11255,"end":11442},"obj":"Sentence"},{"id":"T150","span":{"begin":11443,"end":11551},"obj":"Sentence"},{"id":"T151","span":{"begin":11552,"end":11764},"obj":"Sentence"},{"id":"T152","span":{"begin":11765,"end":11925},"obj":"Sentence"},{"id":"T153","span":{"begin":11926,"end":12216},"obj":"Sentence"},{"id":"T154","span":{"begin":12217,"end":12367},"obj":"Sentence"},{"id":"T155","span":{"begin":12369,"end":12399},"obj":"Sentence"},{"id":"T156","span":{"begin":12400,"end":12561},"obj":"Sentence"},{"id":"T157","span":{"begin":12562,"end":12687},"obj":"Sentence"},{"id":"T158","span":{"begin":12688,"end":12869},"obj":"Sentence"},{"id":"T159","span":{"begin":12870,"end":13135},"obj":"Sentence"},{"id":"T160","span":{"begin":13136,"end":13284},"obj":"Sentence"}],"namespaces":[{"prefix":"_base","uri":"http://pubannotation.org/ontology/tao.owl#"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}

    LitCovid-PD-HP

    {"project":"LitCovid-PD-HP","denotations":[{"id":"T13","span":{"begin":5201,"end":5211},"obj":"Phenotype"}],"attributes":[{"id":"A13","pred":"hp_id","subj":"T13","obj":"http://purl.obolibrary.org/obo/HP_0012387"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}

    LitCovid-PubTator

    {"project":"LitCovid-PubTator","denotations":[{"id":"327","span":{"begin":11,"end":21},"obj":"Species"},{"id":"344","span":{"begin":23,"end":36},"obj":"Species"},{"id":"345","span":{"begin":165,"end":178},"obj":"Species"},{"id":"346","span":{"begin":210,"end":216},"obj":"Species"},{"id":"347","span":{"begin":342,"end":352},"obj":"Species"},{"id":"348","span":{"begin":639,"end":686},"obj":"Species"},{"id":"349","span":{"begin":688,"end":698},"obj":"Species"},{"id":"350","span":{"begin":804,"end":857},"obj":"Species"},{"id":"351","span":{"begin":902,"end":911},"obj":"Species"},{"id":"352","span":{"begin":918,"end":928},"obj":"Species"},{"id":"353","span":{"begin":974,"end":987},"obj":"Species"},{"id":"354","span":{"begin":1081,"end":1096},"obj":"Species"},{"id":"355","span":{"begin":1178,"end":1193},"obj":"Species"},{"id":"356","span":{"begin":1286,"end":1291},"obj":"Species"},{"id":"357","span":{"begin":374,"end":379},"obj":"Species"},{"id":"358","span":{"begin":249,"end":253},"obj":"Disease"},{"id":"359","span":{"begin":269,"end":289},"obj":"Disease"},{"id":"371","span":{"begin":2126,"end":2144},"obj":"Gene"},{"id":"372","span":{"begin":1605,"end":1617},"obj":"Gene"},{"id":"373","span":{"begin":2597,"end":2605},"obj":"Gene"},{"id":"374","span":{"begin":1322,"end":1335},"obj":"Species"},{"id":"375","span":{"begin":1481,"end":1491},"obj":"Species"},{"id":"376","span":{"begin":1640,"end":1644},"obj":"Species"},{"id":"377","span":{"begin":1780,"end":1790},"obj":"Species"},{"id":"378","span":{"begin":2002,"end":2005},"obj":"Species"},{"id":"379","span":{"begin":2247,"end":2257},"obj":"Species"},{"id":"380","span":{"begin":2401,"end":2418},"obj":"Species"},{"id":"381","span":{"begin":2454,"end":2458},"obj":"Species"},{"id":"412","span":{"begin":3162,"end":3185},"obj":"Gene"},{"id":"413","span":{"begin":3187,"end":3194},"obj":"Gene"},{"id":"414","span":{"begin":3725,"end":3728},"obj":"Gene"},{"id":"415","span":{"begin":2844,"end":2854},"obj":"Species"},{"id":"416","span":{"begin":2875,"end":2890},"obj":"Species"},{"id":"417","span":{"begin":2987,"end":2990},"obj":"Species"},{"id":"418","span":{"begin":2995,"end":3011},"obj":"Species"},{"id":"419","span":{"begin":3048,"end":3058},"obj":"Species"},{"id":"420","span":{"begin":3135,"end":3150},"obj":"Species"},{"id":"421","span":{"begin":3279,"end":3289},"obj":"Species"},{"id":"422","span":{"begin":3307,"end":3315},"obj":"Species"},{"id":"423","span":{"begin":3361,"end":3378},"obj":"Species"},{"id":"424","span":{"begin":3452,"end":3462},"obj":"Species"},{"id":"425","span":{"begin":3467,"end":3472},"obj":"Species"},{"id":"426","span":{"begin":3473,"end":3481},"obj":"Species"},{"id":"427","span":{"begin":3491,"end":3496},"obj":"Species"},{"id":"428","span":{"begin":3497,"end":3505},"obj":"Species"},{"id":"429","span":{"begin":3590,"end":3593},"obj":"Species"},{"id":"430","span":{"begin":3619,"end":3629},"obj":"Species"},{"id":"431","span":{"begin":3701,"end":3711},"obj":"Species"},{"id":"432","span":{"begin":3729,"end":3745},"obj":"Species"},{"id":"433","span":{"begin":3758,"end":3768},"obj":"Species"},{"id":"434","span":{"begin":3882,"end":3888},"obj":"Species"},{"id":"435","span":{"begin":3927,"end":3931},"obj":"Species"},{"id":"436","span":{"begin":3946,"end":3949},"obj":"Species"},{"id":"437","span":{"begin":3954,"end":3970},"obj":"Species"},{"id":"438","span":{"begin":4055,"end":4061},"obj":"Species"},{"id":"439","span":{"begin":3482,"end":3486},"obj":"Species"},{"id":"440","span":{"begin":2956,"end":2972},"obj":"Disease"},{"id":"441","span":{"begin":3918,"end":3926},"obj":"Disease"},{"id":"451","span":{"begin":4209,"end":4216},"obj":"Gene"},{"id":"452","span":{"begin":4264,"end":4268},"obj":"Gene"},{"id":"453","span":{"begin":4308,"end":4315},"obj":"Gene"},{"id":"454","span":{"begin":4450,"end":4454},"obj":"Gene"},{"id":"455","span":{"begin":4845,"end":4849},"obj":"Gene"},{"id":"456","span":{"begin":4110,"end":4121},"obj":"Species"},{"id":"457","span":{"begin":4534,"end":4544},"obj":"Species"},{"id":"458","span":{"begin":4659,"end":4669},"obj":"Species"},{"id":"459","span":{"begin":4680,"end":4684},"obj":"Species"},{"id":"468","span":{"begin":4924,"end":4925},"obj":"Gene"},{"id":"469","span":{"begin":4928,"end":4936},"obj":"Gene"},{"id":"470","span":{"begin":4938,"end":4939},"obj":"Gene"},{"id":"471","span":{"begin":4960,"end":4972},"obj":"Gene"},{"id":"472","span":{"begin":4974,"end":4975},"obj":"Gene"},{"id":"473","span":{"begin":4952,"end":4953},"obj":"Gene"},{"id":"474","span":{"begin":4917,"end":4922},"obj":"Gene"},{"id":"475","span":{"begin":4856,"end":4869},"obj":"Species"},{"id":"477","span":{"begin":5016,"end":5017},"obj":"Gene"},{"id":"483","span":{"begin":5043,"end":5044},"obj":"Gene"},{"id":"484","span":{"begin":5031,"end":5042},"obj":"Species"},{"id":"485","span":{"begin":5190,"end":5217},"obj":"Species"},{"id":"486","span":{"begin":5259,"end":5277},"obj":"Species"},{"id":"487","span":{"begin":5430,"end":5440},"obj":"Species"},{"id":"505","span":{"begin":6728,"end":6759},"obj":"Gene"},{"id":"506","span":{"begin":6761,"end":6765},"obj":"Gene"},{"id":"507","span":{"begin":7127,"end":7133},"obj":"Gene"},{"id":"508","span":{"begin":7135,"end":7139},"obj":"Gene"},{"id":"509","span":{"begin":7197,"end":7202},"obj":"Gene"},{"id":"510","span":{"begin":7163,"end":7168},"obj":"Gene"},{"id":"511","span":{"begin":6283,"end":6288},"obj":"Gene"},{"id":"512","span":{"begin":5704,"end":5708},"obj":"Species"},{"id":"513","span":{"begin":5780,"end":5784},"obj":"Species"},{"id":"514","span":{"begin":6271,"end":6282},"obj":"Species"},{"id":"515","span":{"begin":6779,"end":6787},"obj":"Species"},{"id":"516","span":{"begin":6990,"end":7000},"obj":"Species"},{"id":"517","span":{"begin":7027,"end":7035},"obj":"Species"},{"id":"518","span":{"begin":7691,"end":7696},"obj":"Species"},{"id":"519","span":{"begin":7700,"end":7705},"obj":"Species"},{"id":"520","span":{"begin":7295,"end":7302},"obj":"Species"},{"id":"521","span":{"begin":6418,"end":6426},"obj":"Disease"},{"id":"523","span":{"begin":7734,"end":7735},"obj":"Gene"},{"id":"531","span":{"begin":8333,"end":8334},"obj":"Gene"},{"id":"532","span":{"begin":7748,"end":7749},"obj":"Gene"},{"id":"533","span":{"begin":7893,"end":7905},"obj":"Gene"},{"id":"534","span":{"begin":7941,"end":7952},"obj":"Species"},{"id":"535","span":{"begin":7968,"end":7979},"obj":"Species"},{"id":"536","span":{"begin":8346,"end":8356},"obj":"Species"},{"id":"537","span":{"begin":8418,"end":8426},"obj":"Species"},{"id":"539","span":{"begin":8434,"end":8435},"obj":"Gene"},{"id":"545","span":{"begin":9054,"end":9055},"obj":"Gene"},{"id":"546","span":{"begin":8661,"end":8669},"obj":"Gene"},{"id":"547","span":{"begin":8448,"end":8459},"obj":"Species"},{"id":"548","span":{"begin":8810,"end":8823},"obj":"Species"},{"id":"549","span":{"begin":9043,"end":9053},"obj":"Species"},{"id":"551","span":{"begin":9138,"end":9139},"obj":"Gene"},{"id":"561","span":{"begin":9291,"end":9292},"obj":"Gene"},{"id":"562","span":{"begin":9165,"end":9176},"obj":"Species"},{"id":"563","span":{"begin":10069,"end":10077},"obj":"Species"},{"id":"564","span":{"begin":10096,"end":10106},"obj":"Species"},{"id":"565","span":{"begin":9713,"end":9719},"obj":"Chemical"},{"id":"566","span":{"begin":9724,"end":9732},"obj":"Chemical"},{"id":"567","span":{"begin":9763,"end":9765},"obj":"Chemical"},{"id":"568","span":{"begin":9767,"end":9773},"obj":"Chemical"},{"id":"569","span":{"begin":9778,"end":9786},"obj":"Chemical"},{"id":"575","span":{"begin":10419,"end":10423},"obj":"Gene"},{"id":"576","span":{"begin":10383,"end":10393},"obj":"Species"},{"id":"577","span":{"begin":10624,"end":10632},"obj":"Species"},{"id":"578","span":{"begin":10634,"end":10644},"obj":"Species"},{"id":"579","span":{"begin":10882,"end":10895},"obj":"Species"},{"id":"581","span":{"begin":10925,"end":10935},"obj":"Species"},{"id":"583","span":{"begin":10966,"end":10976},"obj":"Species"},{"id":"585","span":{"begin":10995,"end":11005},"obj":"Species"},{"id":"600","span":{"begin":12269,"end":12272},"obj":"Gene"},{"id":"601","span":{"begin":11886,"end":11889},"obj":"Gene"},{"id":"602","span":{"begin":11365,"end":11368},"obj":"Gene"},{"id":"603","span":{"begin":11200,"end":11210},"obj":"Species"},{"id":"604","span":{"begin":11284,"end":11287},"obj":"Species"},{"id":"605","span":{"begin":11309,"end":11319},"obj":"Species"},{"id":"606","span":{"begin":11372,"end":11375},"obj":"Species"},{"id":"607","span":{"begin":11471,"end":11480},"obj":"Species"},{"id":"608","span":{"begin":11516,"end":11526},"obj":"Species"},{"id":"609","span":{"begin":11862,"end":11872},"obj":"Species"},{"id":"610","span":{"begin":11893,"end":11896},"obj":"Species"},{"id":"611","span":{"begin":12118,"end":12128},"obj":"Species"},{"id":"612","span":{"begin":12198,"end":12215},"obj":"Species"},{"id":"613","span":{"begin":12273,"end":12276},"obj":"Species"},{"id":"637","span":{"begin":13273,"end":13276},"obj":"Gene"},{"id":"638","span":{"begin":12834,"end":12837},"obj":"Gene"},{"id":"639","span":{"begin":12815,"end":12818},"obj":"Gene"},{"id":"640","span":{"begin":12447,"end":12464},"obj":"Species"},{"id":"641","span":{"begin":12562,"end":12572},"obj":"Species"},{"id":"642","span":{"begin":12670,"end":12682},"obj":"Species"},{"id":"643","span":{"begin":12692,"end":12696},"obj":"Species"},{"id":"644","span":{"begin":12803,"end":12813},"obj":"Species"},{"id":"645","span":{"begin":12829,"end":12832},"obj":"Species"},{"id":"646","span":{"begin":12841,"end":12844},"obj":"Species"},{"id":"647","span":{"begin":12851,"end":12859},"obj":"Species"},{"id":"648","span":{"begin":13027,"end":13033},"obj":"Species"},{"id":"649","span":{"begin":13046,"end":13050},"obj":"Species"},{"id":"650","span":{"begin":13052,"end":13058},"obj":"Species"},{"id":"651","span":{"begin":13059,"end":13063},"obj":"Species"},{"id":"652","span":{"begin":13065,"end":13071},"obj":"Species"},{"id":"653","span":{"begin":13072,"end":13076},"obj":"Species"},{"id":"654","span":{"begin":13082,"end":13087},"obj":"Species"},{"id":"655","span":{"begin":13088,"end":13091},"obj":"Species"},{"id":"656","span":{"begin":13250,"end":13259},"obj":"Species"},{"id":"657","span":{"begin":12962,"end":12969},"obj":"Species"},{"id":"658","span":{"begin":13127,"end":13134},"obj":"Species"},{"id":"659","span":{"begin":13100,"end":13104},"obj":"Species"}],"attributes":[{"id":"A327","pred":"tao:has_database_id","subj":"327","obj":"Tax:2697049"},{"id":"A344","pred":"tao:has_database_id","subj":"344","obj":"Tax:11118"},{"id":"A345","pred":"tao:has_database_id","subj":"345","obj":"Tax:11118"},{"id":"A346","pred":"tao:has_database_id","subj":"346","obj":"Tax:9606"},{"id":"A347","pred":"tao:has_database_id","subj":"347","obj":"Tax:2697049"},{"id":"A348","pred":"tao:has_database_id","subj":"348","obj":"Tax:2697049"},{"id":"A349","pred":"tao:has_database_id","subj":"349","obj":"Tax:2697049"},{"id":"A350","pred":"tao:has_database_id","subj":"350","obj":"Tax:694009"},{"id":"A351","pred":"tao:has_database_id","subj":"351","obj":"Tax:694009"},{"id":"A352","pred":"tao:has_database_id","subj":"352","obj":"Tax:2697049"},{"id":"A353","pred":"tao:has_database_id","subj":"353","obj":"Tax:11118"},{"id":"A354","pred":"tao:has_database_id","subj":"354","obj":"Tax:694002"},{"id":"A355","pred":"tao:has_database_id","subj":"355","obj":"Tax:694002"},{"id":"A356","pred":"tao:has_database_id","subj":"356","obj":"Tax:9823"},{"id":"A357","pred":"tao:has_database_id","subj":"357","obj":"Tax:998089"},{"id":"A358","pred":"tao:has_database_id","subj":"358","obj":"MESH:D000067390"},{"id":"A359","pred":"tao:has_database_id","subj":"359","obj":"MESH:D012140"},{"id":"A372","pred":"tao:has_database_id","subj":"372","obj":"Gene:43740575"},{"id":"A373","pred":"tao:has_database_id","subj":"373","obj":"Gene:43740571"},{"id":"A374","pred":"tao:has_database_id","subj":"374","obj":"Tax:11118"},{"id":"A375","pred":"tao:has_database_id","subj":"375","obj":"Tax:2697049"},{"id":"A376","pred":"tao:has_database_id","subj":"376","obj":"Tax:11118"},{"id":"A377","pred":"tao:has_database_id","subj":"377","obj":"Tax:2697049"},{"id":"A378","pred":"tao:has_database_id","subj":"378","obj":"Tax:11118"},{"id":"A379","pred":"tao:has_database_id","subj":"379","obj":"Tax:2697049"},{"id":"A380","pred":"tao:has_database_id","subj":"380","obj":"Tax:694002"},{"id":"A381","pred":"tao:has_database_id","subj":"381","obj":"Tax:11118"},{"id":"A413","pred":"tao:has_database_id","subj":"413","obj":"Gene:43740578"},{"id":"A414","pred":"tao:has_database_id","subj":"414","obj":"Gene:570"},{"id":"A415","pred":"tao:has_database_id","subj":"415","obj":"Tax:2697049"},{"id":"A416","pred":"tao:has_database_id","subj":"416","obj":"Tax:694002"},{"id":"A417","pred":"tao:has_database_id","subj":"417","obj":"Tax:11118"},{"id":"A418","pred":"tao:has_database_id","subj":"418","obj":"Tax:694009"},{"id":"A419","pred":"tao:has_database_id","subj":"419","obj":"Tax:2697049"},{"id":"A420","pred":"tao:has_database_id","subj":"420","obj":"Tax:694002"},{"id":"A421","pred":"tao:has_database_id","subj":"421","obj":"Tax:2697049"},{"id":"A422","pred":"tao:has_database_id","subj":"422","obj":"Tax:694009"},{"id":"A423","pred":"tao:has_database_id","subj":"423","obj":"Tax:694009"},{"id":"A424","pred":"tao:has_database_id","subj":"424","obj":"Tax:2697049"},{"id":"A425","pred":"tao:has_database_id","subj":"425","obj":"Tax:9606"},{"id":"A426","pred":"tao:has_database_id","subj":"426","obj":"Tax:694009"},{"id":"A427","pred":"tao:has_database_id","subj":"427","obj":"Tax:9606"},{"id":"A428","pred":"tao:has_database_id","subj":"428","obj":"Tax:694009"},{"id":"A429","pred":"tao:has_database_id","subj":"429","obj":"Tax:11118"},{"id":"A430","pred":"tao:has_database_id","subj":"430","obj":"Tax:2697049"},{"id":"A431","pred":"tao:has_database_id","subj":"431","obj":"Tax:2697049"},{"id":"A432","pred":"tao:has_database_id","subj":"432","obj":"Tax:694009"},{"id":"A433","pred":"tao:has_database_id","subj":"433","obj":"Tax:2697049"},{"id":"A434","pred":"tao:has_database_id","subj":"434","obj":"Tax:9606"},{"id":"A435","pred":"tao:has_database_id","subj":"435","obj":"Tax:11118"},{"id":"A436","pred":"tao:has_database_id","subj":"436","obj":"Tax:11118"},{"id":"A437","pred":"tao:has_database_id","subj":"437","obj":"Tax:694009"},{"id":"A438","pred":"tao:has_database_id","subj":"438","obj":"Tax:9838"},{"id":"A439","pred":"tao:has_database_id","subj":"439","obj":"Tax:227984"},{"id":"A440","pred":"tao:has_database_id","subj":"440","obj":"MESH:D015047"},{"id":"A441","pred":"tao:has_database_id","subj":"441","obj":"MESH:D015047"},{"id":"A451","pred":"tao:has_database_id","subj":"451","obj":"Gene:43740578"},{"id":"A452","pred":"tao:has_database_id","subj":"452","obj":"Gene:5499"},{"id":"A453","pred":"tao:has_database_id","subj":"453","obj":"Gene:43740578"},{"id":"A454","pred":"tao:has_database_id","subj":"454","obj":"Gene:8673700"},{"id":"A455","pred":"tao:has_database_id","subj":"455","obj":"Gene:43740577"},{"id":"A456","pred":"tao:has_database_id","subj":"456","obj":"Tax:11118"},{"id":"A457","pred":"tao:has_database_id","subj":"457","obj":"Tax:2697049"},{"id":"A458","pred":"tao:has_database_id","subj":"458","obj":"Tax:2697049"},{"id":"A459","pred":"tao:has_database_id","subj":"459","obj":"Tax:11118"},{"id":"A468","pred":"tao:has_database_id","subj":"468","obj":"Gene:43740568"},{"id":"A469","pred":"tao:has_database_id","subj":"469","obj":"Gene:43740571"},{"id":"A470","pred":"tao:has_database_id","subj":"470","obj":"Gene:43740571"},{"id":"A471","pred":"tao:has_database_id","subj":"471","obj":"Gene:43740575"},{"id":"A472","pred":"tao:has_database_id","subj":"472","obj":"Gene:43740575"},{"id":"A473","pred":"tao:has_database_id","subj":"473","obj":"Gene:43740570"},{"id":"A474","pred":"tao:has_database_id","subj":"474","obj":"Gene:43740568"},{"id":"A475","pred":"tao:has_database_id","subj":"475","obj":"Tax:11118"},{"id":"A477","pred":"tao:has_database_id","subj":"477","obj":"Gene:43740568"},{"id":"A483","pred":"tao:has_database_id","subj":"483","obj":"Gene:43740568"},{"id":"A484","pred":"tao:has_database_id","subj":"484","obj":"Tax:11118"},{"id":"A485","pred":"tao:has_database_id","subj":"485","obj":"Tax:11120"},{"id":"A486","pred":"tao:has_database_id","subj":"486","obj":"Tax:12663"},{"id":"A487","pred":"tao:has_database_id","subj":"487","obj":"Tax:11120"},{"id":"A505","pred":"tao:has_database_id","subj":"505","obj":"Gene:59272"},{"id":"A506","pred":"tao:has_database_id","subj":"506","obj":"Gene:59272"},{"id":"A507","pred":"tao:has_database_id","subj":"507","obj":"Gene:43740578"},{"id":"A508","pred":"tao:has_database_id","subj":"508","obj":"Gene:43740577"},{"id":"A509","pred":"tao:has_database_id","subj":"509","obj":"Gene:43740573"},{"id":"A510","pred":"tao:has_database_id","subj":"510","obj":"Gene:43740568"},{"id":"A511","pred":"tao:has_database_id","subj":"511","obj":"Gene:43740568"},{"id":"A512","pred":"tao:has_database_id","subj":"512","obj":"Tax:11118"},{"id":"A513","pred":"tao:has_database_id","subj":"513","obj":"Tax:11118"},{"id":"A514","pred":"tao:has_database_id","subj":"514","obj":"Tax:11118"},{"id":"A515","pred":"tao:has_database_id","subj":"515","obj":"Tax:694009"},{"id":"A516","pred":"tao:has_database_id","subj":"516","obj":"Tax:2697049"},{"id":"A517","pred":"tao:has_database_id","subj":"517","obj":"Tax:694009"},{"id":"A518","pred":"tao:has_database_id","subj":"518","obj":"Tax:9606"},{"id":"A519","pred":"tao:has_database_id","subj":"519","obj":"Tax:9606"},{"id":"A520","pred":"tao:has_database_id","subj":"520","obj":"Tax:100569"},{"id":"A521","pred":"tao:has_database_id","subj":"521","obj":"MESH:C000657245"},{"id":"A523","pred":"tao:has_database_id","subj":"523","obj":"Gene:43740571"},{"id":"A531","pred":"tao:has_database_id","subj":"531","obj":"Gene:43740571"},{"id":"A532","pred":"tao:has_database_id","subj":"532","obj":"Gene:43740571"},{"id":"A533","pred":"tao:has_database_id","subj":"533","obj":"Gene:43740575"},{"id":"A534","pred":"tao:has_database_id","subj":"534","obj":"Tax:11118"},{"id":"A535","pred":"tao:has_database_id","subj":"535","obj":"Tax:11118"},{"id":"A536","pred":"tao:has_database_id","subj":"536","obj":"Tax:2697049"},{"id":"A537","pred":"tao:has_database_id","subj":"537","obj":"Tax:694009"},{"id":"A539","pred":"tao:has_database_id","subj":"539","obj":"Gene:43740570"},{"id":"A545","pred":"tao:has_database_id","subj":"545","obj":"Gene:43740570"},{"id":"A546","pred":"tao:has_database_id","subj":"546","obj":"Gene:43740571"},{"id":"A547","pred":"tao:has_database_id","subj":"547","obj":"Tax:11118"},{"id":"A548","pred":"tao:has_database_id","subj":"548","obj":"Tax:11118"},{"id":"A549","pred":"tao:has_database_id","subj":"549","obj":"Tax:2697049"},{"id":"A551","pred":"tao:has_database_id","subj":"551","obj":"Gene:43740575"},{"id":"A561","pred":"tao:has_database_id","subj":"561","obj":"Gene:43740571"},{"id":"A562","pred":"tao:has_database_id","subj":"562","obj":"Tax:11118"},{"id":"A563","pred":"tao:has_database_id","subj":"563","obj":"Tax:694009"},{"id":"A564","pred":"tao:has_database_id","subj":"564","obj":"Tax:2697049"},{"id":"A565","pred":"tao:has_database_id","subj":"565","obj":"MESH:D012694"},{"id":"A566","pred":"tao:has_database_id","subj":"566","obj":"MESH:D001120"},{"id":"A567","pred":"tao:has_database_id","subj":"567","obj":"MESH:D013324"},{"id":"A568","pred":"tao:has_database_id","subj":"568","obj":"MESH:D012694"},{"id":"A569","pred":"tao:has_database_id","subj":"569","obj":"MESH:D001120"},{"id":"A575","pred":"tao:has_database_id","subj":"575","obj":"Gene:10045"},{"id":"A576","pred":"tao:has_database_id","subj":"576","obj":"Tax:2697049"},{"id":"A577","pred":"tao:has_database_id","subj":"577","obj":"Tax:694009"},{"id":"A578","pred":"tao:has_database_id","subj":"578","obj":"Tax:2697049"},{"id":"A579","pred":"tao:has_database_id","subj":"579","obj":"Tax:11118"},{"id":"A581","pred":"tao:has_database_id","subj":"581","obj":"Tax:2697049"},{"id":"A583","pred":"tao:has_database_id","subj":"583","obj":"Tax:2697049"},{"id":"A585","pred":"tao:has_database_id","subj":"585","obj":"Tax:2697049"},{"id":"A600","pred":"tao:has_database_id","subj":"600","obj":"Gene:570"},{"id":"A601","pred":"tao:has_database_id","subj":"601","obj":"Gene:570"},{"id":"A602","pred":"tao:has_database_id","subj":"602","obj":"Gene:570"},{"id":"A603","pred":"tao:has_database_id","subj":"603","obj":"Tax:2697049"},{"id":"A604","pred":"tao:has_database_id","subj":"604","obj":"Tax:11118"},{"id":"A605","pred":"tao:has_database_id","subj":"605","obj":"Tax:2697049"},{"id":"A606","pred":"tao:has_database_id","subj":"606","obj":"Tax:11118"},{"id":"A607","pred":"tao:has_database_id","subj":"607","obj":"Tax:694009"},{"id":"A608","pred":"tao:has_database_id","subj":"608","obj":"Tax:2697049"},{"id":"A609","pred":"tao:has_database_id","subj":"609","obj":"Tax:2697049"},{"id":"A610","pred":"tao:has_database_id","subj":"610","obj":"Tax:11118"},{"id":"A611","pred":"tao:has_database_id","subj":"611","obj":"Tax:2697049"},{"id":"A612","pred":"tao:has_database_id","subj":"612","obj":"Tax:694002"},{"id":"A613","pred":"tao:has_database_id","subj":"613","obj":"Tax:11118"},{"id":"A637","pred":"tao:has_database_id","subj":"637","obj":"Gene:570"},{"id":"A638","pred":"tao:has_database_id","subj":"638","obj":"Gene:570"},{"id":"A639","pred":"tao:has_database_id","subj":"639","obj":"Gene:570"},{"id":"A640","pred":"tao:has_database_id","subj":"640","obj":"Tax:694002"},{"id":"A641","pred":"tao:has_database_id","subj":"641","obj":"Tax:2697049"},{"id":"A642","pred":"tao:has_database_id","subj":"642","obj":"Tax:100569"},{"id":"A643","pred":"tao:has_database_id","subj":"643","obj":"Tax:11118"},{"id":"A644","pred":"tao:has_database_id","subj":"644","obj":"Tax:2697049"},{"id":"A645","pred":"tao:has_database_id","subj":"645","obj":"Tax:11118"},{"id":"A646","pred":"tao:has_database_id","subj":"646","obj":"Tax:11118"},{"id":"A647","pred":"tao:has_database_id","subj":"647","obj":"Tax:694009"},{"id":"A648","pred":"tao:has_database_id","subj":"648","obj":"Tax:9615"},{"id":"A649","pred":"tao:has_database_id","subj":"649","obj":"Tax:11118"},{"id":"A650","pred":"tao:has_database_id","subj":"650","obj":"Tax:9913"},{"id":"A651","pred":"tao:has_database_id","subj":"651","obj":"Tax:11118"},{"id":"A652","pred":"tao:has_database_id","subj":"652","obj":"Tax:9796"},{"id":"A653","pred":"tao:has_database_id","subj":"653","obj":"Tax:11118"},{"id":"A654","pred":"tao:has_database_id","subj":"654","obj":"Tax:9606"},{"id":"A655","pred":"tao:has_database_id","subj":"655","obj":"Tax:11118"},{"id":"A656","pred":"tao:has_database_id","subj":"656","obj":"Tax:694009"},{"id":"A657","pred":"tao:has_database_id","subj":"657","obj":"Tax:100569"},{"id":"A658","pred":"tao:has_database_id","subj":"658","obj":"Tax:100569"},{"id":"A659","pred":"tao:has_database_id","subj":"659","obj":"Tax:31631"}],"namespaces":[{"prefix":"Tax","uri":"https://www.ncbi.nlm.nih.gov/taxonomy/"},{"prefix":"MESH","uri":"https://id.nlm.nih.gov/mesh/"},{"prefix":"Gene","uri":"https://www.ncbi.nlm.nih.gov/gene/"},{"prefix":"CVCL","uri":"https://web.expasy.org/cellosaurus/CVCL_"}],"text":"THE VIRUS (SARS-CoV-2)\nCoronaviruses are positive-sense RNA viruses having an extensive and promiscuous range of natural hosts and affect multiple systems (23, 24). Coronaviruses can cause clinical diseases in humans that may extend from the common cold to more severe respiratory diseases like SARS and MERS (17, 279). The recently emerging SARS-CoV-2 has wrought havoc in China and caused a pandemic situation in the worldwide population, leading to disease outbreaks that have not been controlled to date, although extensive efforts are being put in place to counter this virus (25). This virus has been proposed to be designated/named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (ICTV), which determined the virus belongs to the Severe acute respiratory syndrome-related coronavirus category and found this virus is related to SARS-CoVs (26). SARS-CoV-2 is a member of the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae, which is subdivided into four genera, viz., Alphacoronavirus, Betacoronavirus, Gammacoronavirus, and Deltacoronavirus (3, 27). The genera Alphacoronavirus and Betacoronavirus originate from bats, while Gammacoronavirus and Deltacoronavirus have evolved from bird and swine gene pools (24, 28, 29, 275).\nCoronaviruses possess an unsegmented, single-stranded, positive-sense RNA genome of around 30 kb, enclosed by a 5′-cap and 3′-poly(A) tail (30). The genome of SARS-CoV-2 is 29,891 bp long, with a G+C content of 38% (31). These viruses are encircled with an envelope containing viral nucleocapsid. The nucleocapsids in CoVs are arranged in helical symmetry, which reflects an atypical attribute in positive-sense RNA viruses (30). The electron micrographs of SARS-CoV-2 revealed a diverging spherical outline with some degree of pleomorphism, virion diameters varying from 60 to 140 nm, and distinct spikes of 9 to 12 nm, giving the virus the appearance of a solar corona (3). The CoV genome is arranged linearly as 5′-leader-UTR-replicase-structural genes (S-E-M-N)-3′ UTR-poly(A) (32). Accessory genes, such as 3a/b, 4a/b, and the hemagglutinin-esterase gene (HE), are also seen intermingled with the structural genes (30). SARS-CoV-2 has also been found to be arranged similarly and encodes several accessory proteins, although it lacks the HE, which is characteristic of some betacoronaviruses (31). The positive-sense genome of CoVs serves as the mRNA and is translated to polyprotein 1a/1ab (pp1a/1ab) (33). A replication-transcription complex (RTC) is formed in double-membrane vesicles (DMVs) by nonstructural proteins (nsps), encoded by the polyprotein gene (34). Subsequently, the RTC synthesizes a nested set of subgenomic RNAs (sgRNAs) via discontinuous transcription (35).\nBased on molecular characterization, SARS-CoV-2 is considered a new Betacoronavirus belonging to the subgenus Sarbecovirus (3). A few other critical zoonotic viruses (MERS-related CoV and SARS-related CoV) belong to the same genus. However, SARS-CoV-2 was identified as a distinct virus based on the percent identity with other Betacoronavirus; conserved open reading frame 1a/b (ORF1a/b) is below 90% identity (3). An overall 80% nucleotide identity was observed between SARS-CoV-2 and the original SARS-CoV, along with 89% identity with ZC45 and ZXC21 SARS-related CoVs of bats (2, 31, 36). In addition, 82% identity has been observed between SARS-CoV-2 and human SARS-CoV Tor2 and human SARS-CoV BJ01 2003 (31). A sequence identity of only 51.8% was observed between MERS-related CoV and the recently emerged SARS-CoV-2 (37). Phylogenetic analysis of the structural genes also revealed that SARS-CoV-2 is closer to bat SARS-related CoV. Therefore, SARS-CoV-2 might have originated from bats, while other amplifier hosts might have played a role in disease transmission to humans (31). Of note, the other two zoonotic CoVs (MERS-related CoV and SARS-related CoV) also originated from bats (38, 39). Nevertheless, for SARS and MERS, civet cat and camels, respectively, act as amplifier hosts (40, 41).\nCoronavirus genomes and subgenomes encode six ORFs (31). The majority of the 5′ end is occupied by ORF1a/b, which produces 16 nsps. The two polyproteins, pp1a and pp1ab, are initially produced from ORF1a/b by a −1 frameshift between ORF1a and ORF1b (32). The virus-encoded proteases cleave polyproteins into individual nsps (main protease [Mpro], chymotrypsin-like protease [3CLpro], and papain-like proteases [PLPs]) (42). SARS-CoV-2 also encodes these nsps, and their functions have been elucidated recently (31). Remarkably, a difference between SARS-CoV-2 and other CoVs is the identification of a novel short putative protein within the ORF3 band, a secreted protein with an alpha helix and beta-sheet with six strands encoded by ORF8 (31).\nCoronaviruses encode four major structural proteins, namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N), which are described in detail below.\n\nS Glycoprotein\nCoronavirus S protein is a large, multifunctional class I viral transmembrane protein. The size of this abundant S protein varies from 1,160 amino acids (IBV, infectious bronchitis virus, in poultry) to 1,400 amino acids (FCoV, feline coronavirus) (43). It lies in a trimer on the virion surface, giving the virion a corona or crown-like appearance. Functionally it is required for the entry of the infectious virion particles into the cell through interaction with various host cellular receptors (44).\nFurthermore, it acts as a critical factor for tissue tropism and the determination of host range (45). Notably, S protein is one of the vital immunodominant proteins of CoVs capable of inducing host immune responses (45). The ectodomains in all CoVs S proteins have similar domain organizations, divided into two subunits, S1 and S2 (43). The first one, S1, helps in host receptor binding, while the second one, S2, accounts for fusion. The former (S1) is further divided into two subdomains, namely, the N-terminal domain (NTD) and C-terminal domain (CTD). Both of these subdomains act as receptor-binding domains, interacting efficiently with various host receptors (45). The S1 CTD contains the receptor-binding motif (RBM). In each coronavirus spike protein, the trimeric S1 locates itself on top of the trimeric S2 stalk (45). Recently, structural analyses of the S proteins of COVID-19 have revealed 27 amino acid substitutions within a 1,273-amino-acid stretch (16). Six substitutions are located in the RBD (amino acids 357 to 528), while four substitutions are in the RBM at the CTD of the S1 domain (16). Of note, no amino acid change is seen in the RBM, which binds directly to the angiotensin-converting enzyme-2 (ACE2) receptor in SARS-CoV (16, 46). At present, the main emphasis is knowing how many differences would be required to change the host tropism. Sequence comparison revealed 17 nonsynonymous changes between the early sequence of SARS-CoV-2 and the later isolates of SARS-CoV. The changes were found scattered over the genome of the virus, with nine substitutions in ORF1ab, ORF8 (4 substitutions), the spike gene (3 substitutions), and ORF7a (single substitution) (4). Notably, the same nonsynonymous changes were found in a familial cluster, indicating that the viral evolution happened during person-to-person transmission (4, 47). Such adaptive evolution events are frequent and constitute a constantly ongoing process once the virus spreads among new hosts (47). Even though no functional changes occur in the virus associated with this adaptive evolution, close monitoring of the viral mutations that occur during subsequent human-to-human transmission is warranted.\n\nM Protein\nThe M protein is the most abundant viral protein present in the virion particle, giving a definite shape to the viral envelope (48). It binds to the nucleocapsid and acts as a central organizer of coronavirus assembly (49). Coronavirus M proteins are highly diverse in amino acid contents but maintain overall structural similarity within different genera (50). The M protein has three transmembrane domains, flanked by a short amino terminus outside the virion and a long carboxy terminus inside the virion (50). Overall, the viral scaffold is maintained by M-M interaction. Of note, the M protein of SARS-CoV-2 does not have an amino acid substitution compared to that of SARS-CoV (16).\n\nE Protein\nThe coronavirus E protein is the most enigmatic and smallest of the major structural proteins (51). It plays a multifunctional role in the pathogenesis, assembly, and release of the virus (52). It is a small integral membrane polypeptide that acts as a viroporin (ion channel) (53). The inactivation or absence of this protein is related to the altered virulence of coronaviruses due to changes in morphology and tropism (54). The E protein consists of three domains, namely, a short hydrophilic amino terminal, a large hydrophobic transmembrane domain, and an efficient C-terminal domain (51). The SARS-CoV-2 E protein reveals a similar amino acid constitution without any substitution (16).\n\nN Protein\nThe N protein of coronavirus is multipurpose. Among several functions, it plays a role in complex formation with the viral genome, facilitates M protein interaction needed during virion assembly, and enhances the transcription efficiency of the virus (55, 56). It contains three highly conserved and distinct domains, namely, an NTD, an RNA-binding domain or a linker region (LKR), and a CTD (57). The NTD binds with the 3′ end of the viral genome, perhaps via electrostatic interactions, and is highly diverged both in length and sequence (58). The charged LKR is serine and arginine rich and is also known as the SR (serine and arginine) domain (59). The LKR is capable of direct interaction with in vitro RNA interaction and is responsible for cell signaling (60, 61). It also modulates the antiviral response of the host by working as an antagonist for interferon (IFN) and RNA interference (62). Compared to that of SARS-CoV, the N protein of SARS-CoV-2 possess five amino acid mutations, where two are in the intrinsically dispersed region (IDR; positions 25 and 26), one each in the NTD (position 103), LKR (position 217), and CTD (position 334) (16).\n\nnsps and Accessory Proteins\nBesides the important structural proteins, the SARS-CoV-2 genome contains 15 nsps, nsp1 to nsp10 and nsp12 to nsp16, and 8 accessory proteins (3a, 3b, p6, 7a, 7b, 8b, 9b, and ORF14) (16). All these proteins play a specific role in viral replication (27). Unlike the accessory proteins of SARS-CoV, SARS-CoV-2 does not contain 8a protein and has a longer 8b and shorter 3b protein (16). The nsp7, nsp13, envelope, matrix, and p6 and 8b accessory proteins have not been detected with any amino acid substitutions compared to the sequences of other coronaviruses (16).\nThe virus structure of SARS-CoV-2 is depicted in Fig. 2.\nFIG 2 SARS-CoV-2 virus structure.\n\nSARS-CoV-2 Spike Glycoprotein Gene Analysis\n\nSequence percent similarity analysis.\nWe assessed the nucleotide percent similarity using the MegAlign software program, where the similarity between the novel SARS-CoV-2 isolates was in the range of 99.4% to 100%. Among the other Serbecovirus CoV sequences, the novel SARS-CoV-2 sequences revealed the highest similarity to bat-SL-CoV, with nucleotide percent identity ranges between 88.12 and 89.65%. Meanwhile, earlier reported SARS-CoVs showed 70.6 to 74.9% similarity to SARS-CoV-2 at the nucleotide level. Further, the nucleotide percent similarity was 55.4%, 45.5% to 47.9%, 46.2% to 46.6%, and 45.0% to 46.3% to the other four subgenera, namely, Hibecovirus, Nobecovirus, Merbecovirus, and Embecovirus, respectively. The percent similarity index of current outbreak isolates indicates a close relationship between SARS-CoV-2 isolates and bat-SL-CoV, indicating a common origin. However, particular pieces of evidence based on further complete genomic analysis of current isolates are necessary to draw any conclusions, although it was ascertained that the current novel SARS-CoV-2 isolates belong to the subgenus Sarbecovirus in the diverse range of betacoronaviruses. Their possible ancestor was hypothesized to be from bat CoV strains, wherein bats might have played a crucial role in harboring this class of viruses.\n\nSplitsTree phylogeny analysis.\nIn the unrooted phylogenetic tree of different betacoronaviruses based on the S protein, virus sequences from different subgenera grouped into separate clusters. SARS-CoV-2 sequences from Wuhan and other countries exhibited a close relationship and appeared in a single cluster (Fig. 1). The CoVs from the subgenus Sarbecovirus appeared jointly in SplitsTree and divided into three subclusters, namely, SARS-CoV-2, bat-SARS-like-CoV (bat-SL-CoV), and SARS-CoV (Fig. 1). In the case of other subgenera, like Merbecovirus, all of the sequences grouped in a single cluster, whereas in Embecovirus, different species, comprised of canine respiratory CoVs, bovine CoVs, equine CoVs, and human CoV strain (OC43), grouped in a common cluster. Isolates in the subgenera Nobecovorus and Hibecovirus were found to be placed separately away from other reported SARS-CoVs but shared a bat origin."}