PMC:7352545 / 10618-19469
Annnotations
LitCovid_Glycan-Motif-Structure
{"project":"LitCovid_Glycan-Motif-Structure","denotations":[{"id":"T34","span":{"begin":8107,"end":8109},"obj":"https://glytoucan.org/Structures/Glycans/G81533KY"},{"id":"T35","span":{"begin":8139,"end":8141},"obj":"https://glytoucan.org/Structures/Glycans/G81533KY"},{"id":"T36","span":{"begin":8338,"end":8340},"obj":"https://glytoucan.org/Structures/Glycans/G81533KY"},{"id":"T37","span":{"begin":8469,"end":8471},"obj":"https://glytoucan.org/Structures/Glycans/G81533KY"},{"id":"T38","span":{"begin":8561,"end":8563},"obj":"https://glytoucan.org/Structures/Glycans/G81533KY"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-PD-FMA-UBERON
{"project":"LitCovid-PD-FMA-UBERON","denotations":[{"id":"T59","span":{"begin":129,"end":132},"obj":"Body_part"},{"id":"T60","span":{"begin":133,"end":139},"obj":"Body_part"},{"id":"T61","span":{"begin":175,"end":179},"obj":"Body_part"},{"id":"T62","span":{"begin":229,"end":232},"obj":"Body_part"},{"id":"T63","span":{"begin":233,"end":240},"obj":"Body_part"},{"id":"T64","span":{"begin":381,"end":389},"obj":"Body_part"},{"id":"T65","span":{"begin":406,"end":412},"obj":"Body_part"},{"id":"T66","span":{"begin":454,"end":465},"obj":"Body_part"},{"id":"T67","span":{"begin":566,"end":572},"obj":"Body_part"},{"id":"T68","span":{"begin":600,"end":604},"obj":"Body_part"},{"id":"T69","span":{"begin":631,"end":634},"obj":"Body_part"},{"id":"T70","span":{"begin":669,"end":672},"obj":"Body_part"},{"id":"T71","span":{"begin":687,"end":690},"obj":"Body_part"},{"id":"T72","span":{"begin":701,"end":704},"obj":"Body_part"},{"id":"T73","span":{"begin":783,"end":789},"obj":"Body_part"},{"id":"T74","span":{"begin":911,"end":918},"obj":"Body_part"},{"id":"T75","span":{"begin":941,"end":947},"obj":"Body_part"},{"id":"T76","span":{"begin":979,"end":984},"obj":"Body_part"},{"id":"T77","span":{"begin":1027,"end":1034},"obj":"Body_part"},{"id":"T78","span":{"begin":1090,"end":1102},"obj":"Body_part"},{"id":"T79","span":{"begin":1144,"end":1156},"obj":"Body_part"},{"id":"T80","span":{"begin":1168,"end":1171},"obj":"Body_part"},{"id":"T81","span":{"begin":1193,"end":1201},"obj":"Body_part"},{"id":"T82","span":{"begin":1224,"end":1232},"obj":"Body_part"},{"id":"T83","span":{"begin":1259,"end":1267},"obj":"Body_part"},{"id":"T84","span":{"begin":1349,"end":1362},"obj":"Body_part"},{"id":"T85","span":{"begin":1370,"end":1382},"obj":"Body_part"},{"id":"T86","span":{"begin":1395,"end":1402},"obj":"Body_part"},{"id":"T87","span":{"begin":1457,"end":1462},"obj":"Body_part"},{"id":"T88","span":{"begin":1484,"end":1492},"obj":"Body_part"},{"id":"T89","span":{"begin":1511,"end":1532},"obj":"Body_part"},{"id":"T90","span":{"begin":1534,"end":1536},"obj":"Body_part"},{"id":"T91","span":{"begin":1569,"end":1590},"obj":"Body_part"},{"id":"T92","span":{"begin":1610,"end":1621},"obj":"Body_part"},{"id":"T93","span":{"begin":1635,"end":1638},"obj":"Body_part"},{"id":"T94","span":{"begin":1639,"end":1645},"obj":"Body_part"},{"id":"T95","span":{"begin":1833,"end":1836},"obj":"Body_part"},{"id":"T96","span":{"begin":1968,"end":1976},"obj":"Body_part"},{"id":"T97","span":{"begin":1991,"end":1998},"obj":"Body_part"},{"id":"T98","span":{"begin":2170,"end":2173},"obj":"Body_part"},{"id":"T99","span":{"begin":2174,"end":2180},"obj":"Body_part"},{"id":"T100","span":{"begin":2186,"end":2193},"obj":"Body_part"},{"id":"T101","span":{"begin":2317,"end":2325},"obj":"Body_part"},{"id":"T102","span":{"begin":2495,"end":2502},"obj":"Body_part"},{"id":"T103","span":{"begin":2523,"end":2528},"obj":"Body_part"},{"id":"T104","span":{"begin":2587,"end":2594},"obj":"Body_part"},{"id":"T105","span":{"begin":2656,"end":2663},"obj":"Body_part"},{"id":"T106","span":{"begin":2735,"end":2742},"obj":"Body_part"},{"id":"T107","span":{"begin":2797,"end":2805},"obj":"Body_part"},{"id":"T108","span":{"begin":2933,"end":2949},"obj":"Body_part"},{"id":"T109","span":{"begin":3089,"end":3105},"obj":"Body_part"},{"id":"T110","span":{"begin":3125,"end":3133},"obj":"Body_part"},{"id":"T111","span":{"begin":3198,"end":3201},"obj":"Body_part"},{"id":"T112","span":{"begin":3261,"end":3264},"obj":"Body_part"},{"id":"T113","span":{"begin":3273,"end":3280},"obj":"Body_part"},{"id":"T114","span":{"begin":3296,"end":3300},"obj":"Body_part"},{"id":"T115","span":{"begin":3452,"end":3455},"obj":"Body_part"},{"id":"T116","span":{"begin":3536,"end":3540},"obj":"Body_part"},{"id":"T117","span":{"begin":3660,"end":3663},"obj":"Body_part"},{"id":"T118","span":{"begin":3664,"end":3670},"obj":"Body_part"},{"id":"T119","span":{"begin":3782,"end":3785},"obj":"Body_part"},{"id":"T120","span":{"begin":3827,"end":3834},"obj":"Body_part"},{"id":"T121","span":{"begin":3880,"end":3892},"obj":"Body_part"},{"id":"T122","span":{"begin":3896,"end":3899},"obj":"Body_part"},{"id":"T123","span":{"begin":3915,"end":3927},"obj":"Body_part"},{"id":"T124","span":{"begin":3955,"end":3962},"obj":"Body_part"},{"id":"T125","span":{"begin":4066,"end":4068},"obj":"Body_part"},{"id":"T126","span":{"begin":4077,"end":4082},"obj":"Body_part"},{"id":"T127","span":{"begin":4097,"end":4109},"obj":"Body_part"},{"id":"T128","span":{"begin":4180,"end":4192},"obj":"Body_part"},{"id":"T129","span":{"begin":4237,"end":4249},"obj":"Body_part"},{"id":"T130","span":{"begin":4310,"end":4315},"obj":"Body_part"},{"id":"T131","span":{"begin":4369,"end":4381},"obj":"Body_part"},{"id":"T132","span":{"begin":4436,"end":4448},"obj":"Body_part"},{"id":"T133","span":{"begin":4488,"end":4500},"obj":"Body_part"},{"id":"T134","span":{"begin":4570,"end":4582},"obj":"Body_part"},{"id":"T135","span":{"begin":4669,"end":4673},"obj":"Body_part"},{"id":"T136","span":{"begin":4806,"end":4819},"obj":"Body_part"},{"id":"T137","span":{"begin":4806,"end":4810},"obj":"Body_part"},{"id":"T138","span":{"begin":4842,"end":4851},"obj":"Body_part"},{"id":"T139","span":{"begin":4997,"end":5009},"obj":"Body_part"},{"id":"T140","span":{"begin":5084,"end":5091},"obj":"Body_part"},{"id":"T141","span":{"begin":5135,"end":5142},"obj":"Body_part"},{"id":"T142","span":{"begin":5146,"end":5149},"obj":"Body_part"},{"id":"T143","span":{"begin":5165,"end":5172},"obj":"Body_part"},{"id":"T144","span":{"begin":5194,"end":5197},"obj":"Body_part"},{"id":"T145","span":{"begin":5198,"end":5204},"obj":"Body_part"},{"id":"T146","span":{"begin":5212,"end":5219},"obj":"Body_part"},{"id":"T147","span":{"begin":5245,"end":5248},"obj":"Body_part"},{"id":"T148","span":{"begin":5249,"end":5255},"obj":"Body_part"},{"id":"T149","span":{"begin":5283,"end":5290},"obj":"Body_part"},{"id":"T150","span":{"begin":5312,"end":5315},"obj":"Body_part"},{"id":"T151","span":{"begin":5457,"end":5462},"obj":"Body_part"},{"id":"T152","span":{"begin":5508,"end":5511},"obj":"Body_part"},{"id":"T153","span":{"begin":5564,"end":5572},"obj":"Body_part"},{"id":"T154","span":{"begin":5635,"end":5642},"obj":"Body_part"},{"id":"T155","span":{"begin":5659,"end":5662},"obj":"Body_part"},{"id":"T156","span":{"begin":5673,"end":5680},"obj":"Body_part"},{"id":"T157","span":{"begin":5695,"end":5703},"obj":"Body_part"},{"id":"T158","span":{"begin":5759,"end":5766},"obj":"Body_part"},{"id":"T159","span":{"begin":5908,"end":5910},"obj":"Body_part"},{"id":"T160","span":{"begin":5918,"end":5920},"obj":"Body_part"},{"id":"T161","span":{"begin":5946,"end":5950},"obj":"Body_part"},{"id":"T162","span":{"begin":6037,"end":6044},"obj":"Body_part"},{"id":"T163","span":{"begin":6106,"end":6112},"obj":"Body_part"},{"id":"T164","span":{"begin":6151,"end":6158},"obj":"Body_part"},{"id":"T165","span":{"begin":6161,"end":6168},"obj":"Body_part"},{"id":"T166","span":{"begin":6201,"end":6208},"obj":"Body_part"},{"id":"T167","span":{"begin":6251,"end":6258},"obj":"Body_part"},{"id":"T168","span":{"begin":6268,"end":6275},"obj":"Body_part"},{"id":"T169","span":{"begin":6279,"end":6286},"obj":"Body_part"},{"id":"T170","span":{"begin":6314,"end":6321},"obj":"Body_part"},{"id":"T171","span":{"begin":6349,"end":6356},"obj":"Body_part"},{"id":"T172","span":{"begin":6413,"end":6422},"obj":"Body_part"},{"id":"T173","span":{"begin":6437,"end":6444},"obj":"Body_part"},{"id":"T174","span":{"begin":6543,"end":6550},"obj":"Body_part"},{"id":"T175","span":{"begin":6569,"end":6578},"obj":"Body_part"},{"id":"T176","span":{"begin":6598,"end":6603},"obj":"Body_part"},{"id":"T177","span":{"begin":6688,"end":6696},"obj":"Body_part"},{"id":"T178","span":{"begin":6753,"end":6765},"obj":"Body_part"},{"id":"T179","span":{"begin":6776,"end":6783},"obj":"Body_part"},{"id":"T180","span":{"begin":6840,"end":6847},"obj":"Body_part"},{"id":"T181","span":{"begin":6952,"end":6970},"obj":"Body_part"},{"id":"T182","span":{"begin":6978,"end":6985},"obj":"Body_part"},{"id":"T183","span":{"begin":7029,"end":7036},"obj":"Body_part"},{"id":"T184","span":{"begin":7054,"end":7066},"obj":"Body_part"},{"id":"T185","span":{"begin":7073,"end":7080},"obj":"Body_part"},{"id":"T186","span":{"begin":7102,"end":7109},"obj":"Body_part"},{"id":"T187","span":{"begin":7116,"end":7123},"obj":"Body_part"},{"id":"T188","span":{"begin":7148,"end":7155},"obj":"Body_part"},{"id":"T189","span":{"begin":7219,"end":7224},"obj":"Body_part"},{"id":"T190","span":{"begin":7241,"end":7248},"obj":"Body_part"},{"id":"T191","span":{"begin":7419,"end":7426},"obj":"Body_part"},{"id":"T192","span":{"begin":7456,"end":7463},"obj":"Body_part"},{"id":"T193","span":{"begin":7475,"end":7487},"obj":"Body_part"},{"id":"T194","span":{"begin":7494,"end":7506},"obj":"Body_part"},{"id":"T195","span":{"begin":7530,"end":7543},"obj":"Body_part"},{"id":"T196","span":{"begin":7553,"end":7568},"obj":"Body_part"},{"id":"T197","span":{"begin":7581,"end":7588},"obj":"Body_part"},{"id":"T198","span":{"begin":7589,"end":7592},"obj":"Body_part"},{"id":"T199","span":{"begin":7658,"end":7666},"obj":"Body_part"},{"id":"T200","span":{"begin":7787,"end":7794},"obj":"Body_part"},{"id":"T201","span":{"begin":7841,"end":7844},"obj":"Body_part"},{"id":"T202","span":{"begin":7990,"end":7993},"obj":"Body_part"},{"id":"T203","span":{"begin":8414,"end":8426},"obj":"Body_part"},{"id":"T204","span":{"begin":8635,"end":8647},"obj":"Body_part"}],"attributes":[{"id":"A59","pred":"fma_id","subj":"T59","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A60","pred":"fma_id","subj":"T60","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A61","pred":"fma_id","subj":"T61","obj":"http://purl.org/sig/ont/fma/fma67122"},{"id":"A62","pred":"fma_id","subj":"T62","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A63","pred":"fma_id","subj":"T63","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A64","pred":"fma_id","subj":"T64","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A65","pred":"fma_id","subj":"T65","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A66","pred":"fma_id","subj":"T66","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A67","pred":"fma_id","subj":"T67","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A68","pred":"fma_id","subj":"T68","obj":"http://purl.org/sig/ont/fma/fma67122"},{"id":"A69","pred":"fma_id","subj":"T69","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A70","pred":"fma_id","subj":"T70","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A71","pred":"fma_id","subj":"T71","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A72","pred":"fma_id","subj":"T72","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A73","pred":"fma_id","subj":"T73","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A74","pred":"fma_id","subj":"T74","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A75","pred":"fma_id","subj":"T75","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A76","pred":"fma_id","subj":"T76","obj":"http://purl.org/sig/ont/fma/fma67264"},{"id":"A77","pred":"fma_id","subj":"T77","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A78","pred":"fma_id","subj":"T78","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A79","pred":"fma_id","subj":"T79","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A80","pred":"fma_id","subj":"T80","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A81","pred":"fma_id","subj":"T81","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A82","pred":"fma_id","subj":"T82","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A83","pred":"fma_id","subj":"T83","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A84","pred":"fma_id","subj":"T84","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A85","pred":"fma_id","subj":"T85","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A86","pred":"fma_id","subj":"T86","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A87","pred":"fma_id","subj":"T87","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A88","pred":"fma_id","subj":"T88","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A89","pred":"fma_id","subj":"T89","obj":"http://purl.org/sig/ont/fma/fma63842"},{"id":"A90","pred":"fma_id","subj":"T90","obj":"http://purl.org/sig/ont/fma/fma63842"},{"id":"A91","pred":"fma_id","subj":"T91","obj":"http://purl.org/sig/ont/fma/fma63842"},{"id":"A92","pred":"fma_id","subj":"T92","obj":"http://purl.org/sig/ont/fma/fma76577"},{"id":"A93","pred":"fma_id","subj":"T93","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A94","pred":"fma_id","subj":"T94","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A95","pred":"fma_id","subj":"T95","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A96","pred":"fma_id","subj":"T96","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A97","pred":"fma_id","subj":"T97","obj":"http://purl.org/sig/ont/fma/fma66836"},{"id":"A98","pred":"fma_id","subj":"T98","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A99","pred":"fma_id","subj":"T99","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A100","pred":"fma_id","subj":"T100","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A101","pred":"fma_id","subj":"T101","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A102","pred":"fma_id","subj":"T102","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A103","pred":"fma_id","subj":"T103","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A104","pred":"fma_id","subj":"T104","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A105","pred":"fma_id","subj":"T105","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A106","pred":"fma_id","subj":"T106","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A107","pred":"fma_id","subj":"T107","obj":"http://purl.org/sig/ont/fma/fma84050"},{"id":"A108","pred":"fma_id","subj":"T108","obj":"http://purl.org/sig/ont/fma/fma62376"},{"id":"A109","pred":"fma_id","subj":"T109","obj":"http://purl.org/sig/ont/fma/fma62376"},{"id":"A110","pred":"fma_id","subj":"T110","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A111","pred":"fma_id","subj":"T111","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A112","pred":"fma_id","subj":"T112","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A113","pred":"fma_id","subj":"T113","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A114","pred":"fma_id","subj":"T114","obj":"http://purl.org/sig/ont/fma/fma84120"},{"id":"A115","pred":"fma_id","subj":"T115","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A116","pred":"fma_id","subj":"T116","obj":"http://purl.org/sig/ont/fma/fma84120"},{"id":"A117","pred":"fma_id","subj":"T117","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A118","pred":"fma_id","subj":"T118","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A119","pred":"fma_id","subj":"T119","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A120","pred":"fma_id","subj":"T120","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A121","pred":"fma_id","subj":"T121","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A122","pred":"fma_id","subj":"T122","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A123","pred":"fma_id","subj":"T123","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A124","pred":"fma_id","subj":"T124","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A125","pred":"fma_id","subj":"T125","obj":"http://purl.org/sig/ont/fma/fma63842"},{"id":"A126","pred":"fma_id","subj":"T126","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A127","pred":"fma_id","subj":"T127","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A128","pred":"fma_id","subj":"T128","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A129","pred":"fma_id","subj":"T129","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A130","pred":"fma_id","subj":"T130","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A131","pred":"fma_id","subj":"T131","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A132","pred":"fma_id","subj":"T132","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A133","pred":"fma_id","subj":"T133","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A134","pred":"fma_id","subj":"T134","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A135","pred":"fma_id","subj":"T135","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A136","pred":"fma_id","subj":"T136","obj":"http://purl.org/sig/ont/fma/fma63841"},{"id":"A137","pred":"fma_id","subj":"T137","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A138","pred":"fma_id","subj":"T138","obj":"http://purl.org/sig/ont/fma/fma66836"},{"id":"A139","pred":"fma_id","subj":"T139","obj":"http://purl.org/sig/ont/fma/fma82737"},{"id":"A140","pred":"fma_id","subj":"T140","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A141","pred":"fma_id","subj":"T141","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A142","pred":"fma_id","subj":"T142","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A143","pred":"fma_id","subj":"T143","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A144","pred":"fma_id","subj":"T144","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A145","pred":"fma_id","subj":"T145","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A146","pred":"fma_id","subj":"T146","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A147","pred":"fma_id","subj":"T147","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A148","pred":"fma_id","subj":"T148","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A149","pred":"fma_id","subj":"T149","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A150","pred":"fma_id","subj":"T150","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A151","pred":"fma_id","subj":"T151","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A152","pred":"fma_id","subj":"T152","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A153","pred":"fma_id","subj":"T153","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A154","pred":"fma_id","subj":"T154","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A155","pred":"fma_id","subj":"T155","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A156","pred":"fma_id","subj":"T156","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A157","pred":"fma_id","subj":"T157","obj":"http://purl.org/sig/ont/fma/fma82747"},{"id":"A158","pred":"fma_id","subj":"T158","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A159","pred":"fma_id","subj":"T159","obj":"http://purl.org/sig/ont/fma/fma66592"},{"id":"A160","pred":"fma_id","subj":"T160","obj":"http://purl.org/sig/ont/fma/fma66592"},{"id":"A161","pred":"fma_id","subj":"T161","obj":"http://purl.org/sig/ont/fma/fma67122"},{"id":"A162","pred":"fma_id","subj":"T162","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A163","pred":"fma_id","subj":"T163","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A164","pred":"fma_id","subj":"T164","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A165","pred":"fma_id","subj":"T165","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A166","pred":"fma_id","subj":"T166","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A167","pred":"fma_id","subj":"T167","obj":"http://purl.org/sig/ont/fma/fma66836"},{"id":"A168","pred":"fma_id","subj":"T168","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A169","pred":"fma_id","subj":"T169","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A170","pred":"fma_id","subj":"T170","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A171","pred":"fma_id","subj":"T171","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A172","pred":"fma_id","subj":"T172","obj":"http://purl.org/sig/ont/fma/fma66836"},{"id":"A173","pred":"fma_id","subj":"T173","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A174","pred":"fma_id","subj":"T174","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A175","pred":"fma_id","subj":"T175","obj":"http://purl.org/sig/ont/fma/fma66836"},{"id":"A176","pred":"fma_id","subj":"T176","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A177","pred":"fma_id","subj":"T177","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A178","pred":"fma_id","subj":"T178","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A179","pred":"fma_id","subj":"T179","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A180","pred":"fma_id","subj":"T180","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A181","pred":"fma_id","subj":"T181","obj":"http://purl.org/sig/ont/fma/fma72443"},{"id":"A182","pred":"fma_id","subj":"T182","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A183","pred":"fma_id","subj":"T183","obj":"http://purl.org/sig/ont/fma/fma66836"},{"id":"A184","pred":"fma_id","subj":"T184","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A185","pred":"fma_id","subj":"T185","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A186","pred":"fma_id","subj":"T186","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A187","pred":"fma_id","subj":"T187","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A188","pred":"fma_id","subj":"T188","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A189","pred":"fma_id","subj":"T189","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A190","pred":"fma_id","subj":"T190","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A191","pred":"fma_id","subj":"T191","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A192","pred":"fma_id","subj":"T192","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A193","pred":"fma_id","subj":"T193","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A194","pred":"fma_id","subj":"T194","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A195","pred":"fma_id","subj":"T195","obj":"http://purl.org/sig/ont/fma/fma63843"},{"id":"A196","pred":"fma_id","subj":"T196","obj":"http://purl.org/sig/ont/fma/fma67906"},{"id":"A197","pred":"fma_id","subj":"T197","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A198","pred":"fma_id","subj":"T198","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A199","pred":"fma_id","subj":"T199","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A200","pred":"fma_id","subj":"T200","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A201","pred":"fma_id","subj":"T201","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A202","pred":"fma_id","subj":"T202","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A203","pred":"fma_id","subj":"T203","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A204","pred":"fma_id","subj":"T204","obj":"http://purl.org/sig/ont/fma/fma62845"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-PD-UBERON
{"project":"LitCovid-PD-UBERON","denotations":[{"id":"T4","span":{"begin":541,"end":545},"obj":"Body_part"}],"attributes":[{"id":"A4","pred":"uberon_id","subj":"T4","obj":"http://purl.obolibrary.org/obo/UBERON_0002415"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-PubTator
{"project":"LitCovid-PubTator","denotations":[{"id":"426","span":{"begin":723,"end":726},"obj":"Gene"},{"id":"427","span":{"begin":48,"end":52},"obj":"Species"},{"id":"428","span":{"begin":97,"end":110},"obj":"Species"},{"id":"429","span":{"begin":216,"end":220},"obj":"Species"},{"id":"430","span":{"begin":395,"end":405},"obj":"Species"},{"id":"431","span":{"begin":504,"end":507},"obj":"Gene"},{"id":"432","span":{"begin":471,"end":476},"obj":"Chemical"},{"id":"433","span":{"begin":205,"end":214},"obj":"Disease"},{"id":"449","span":{"begin":1158,"end":1159},"obj":"Gene"},{"id":"450","span":{"begin":1482,"end":1483},"obj":"Gene"},{"id":"451","span":{"begin":1222,"end":1223},"obj":"Gene"},{"id":"452","span":{"begin":1104,"end":1105},"obj":"Gene"},{"id":"453","span":{"begin":1476,"end":1477},"obj":"Gene"},{"id":"454","span":{"begin":1341,"end":1342},"obj":"Gene"},{"id":"455","span":{"begin":1216,"end":1217},"obj":"Gene"},{"id":"456","span":{"begin":1338,"end":1339},"obj":"Gene"},{"id":"457","span":{"begin":1213,"end":1214},"obj":"Gene"},{"id":"458","span":{"begin":1162,"end":1163},"obj":"Gene"},{"id":"459","span":{"begin":965,"end":973},"obj":"Gene"},{"id":"460","span":{"begin":830,"end":838},"obj":"Gene"},{"id":"461","span":{"begin":1046,"end":1055},"obj":"Species"},{"id":"462","span":{"begin":1057,"end":1067},"obj":"Species"},{"id":"463","span":{"begin":1080,"end":1085},"obj":"Gene"},{"id":"467","span":{"begin":2851,"end":2854},"obj":"Gene"},{"id":"468","span":{"begin":2182,"end":2185},"obj":"Species"},{"id":"469","span":{"begin":3791,"end":3799},"obj":"Disease"},{"id":"471","span":{"begin":3856,"end":3861},"obj":"Gene"},{"id":"486","span":{"begin":4604,"end":4609},"obj":"Gene"},{"id":"487","span":{"begin":4220,"end":4221},"obj":"Gene"},{"id":"488","span":{"begin":4119,"end":4120},"obj":"Gene"},{"id":"489","span":{"begin":4235,"end":4236},"obj":"Gene"},{"id":"490","span":{"begin":4178,"end":4179},"obj":"Gene"},{"id":"491","span":{"begin":4095,"end":4096},"obj":"Gene"},{"id":"492","span":{"begin":5063,"end":5066},"obj":"Gene"},{"id":"493","span":{"begin":4464,"end":4469},"obj":"Gene"},{"id":"494","span":{"begin":4084,"end":4094},"obj":"Species"},{"id":"495","span":{"begin":4344,"end":4348},"obj":"Species"},{"id":"496","span":{"begin":4862,"end":4870},"obj":"Species"},{"id":"497","span":{"begin":4875,"end":4883},"obj":"Species"},{"id":"498","span":{"begin":4997,"end":5009},"obj":"Chemical"},{"id":"499","span":{"begin":4323,"end":4332},"obj":"Disease"},{"id":"510","span":{"begin":5272,"end":5275},"obj":"Gene"},{"id":"511","span":{"begin":5868,"end":5910},"obj":"Gene"},{"id":"512","span":{"begin":5912,"end":5920},"obj":"Gene"},{"id":"513","span":{"begin":6003,"end":6010},"obj":"Gene"},{"id":"514","span":{"begin":5365,"end":5368},"obj":"Gene"},{"id":"515","span":{"begin":5379,"end":5382},"obj":"Species"},{"id":"516","span":{"begin":5468,"end":5471},"obj":"Species"},{"id":"517","span":{"begin":5748,"end":5756},"obj":"Species"},{"id":"518","span":{"begin":5840,"end":5848},"obj":"Species"},{"id":"519","span":{"begin":6026,"end":6034},"obj":"Species"},{"id":"521","span":{"begin":6138,"end":6146},"obj":"Gene"},{"id":"526","span":{"begin":6402,"end":6405},"obj":"Gene"},{"id":"527","span":{"begin":6501,"end":6504},"obj":"Gene"},{"id":"528","span":{"begin":6655,"end":6663},"obj":"Gene"},{"id":"529","span":{"begin":6589,"end":6597},"obj":"Disease"},{"id":"534","span":{"begin":6941,"end":6944},"obj":"Gene"},{"id":"535","span":{"begin":7688,"end":7696},"obj":"Gene"},{"id":"536","span":{"begin":7265,"end":7268},"obj":"Species"},{"id":"537","span":{"begin":7285,"end":7289},"obj":"Species"},{"id":"539","span":{"begin":7751,"end":7773},"obj":"Gene"},{"id":"552","span":{"begin":8561,"end":8563},"obj":"Gene"},{"id":"553","span":{"begin":7854,"end":7858},"obj":"Species"},{"id":"554","span":{"begin":8189,"end":8206},"obj":"Species"},{"id":"555","span":{"begin":8227,"end":8230},"obj":"Species"},{"id":"556","span":{"begin":8334,"end":8337},"obj":"Species"},{"id":"557","span":{"begin":8627,"end":8634},"obj":"Species"},{"id":"558","span":{"begin":8775,"end":8792},"obj":"Species"},{"id":"559","span":{"begin":8839,"end":8850},"obj":"Species"},{"id":"560","span":{"begin":8581,"end":8590},"obj":"Species"},{"id":"561","span":{"begin":8811,"end":8820},"obj":"Species"},{"id":"562","span":{"begin":7795,"end":7813},"obj":"Chemical"},{"id":"563","span":{"begin":8260,"end":8279},"obj":"Disease"}],"attributes":[{"id":"A426","pred":"tao:has_database_id","subj":"426","obj":"Gene:5131"},{"id":"A427","pred":"tao:has_database_id","subj":"427","obj":"Tax:11118"},{"id":"A428","pred":"tao:has_database_id","subj":"428","obj":"Tax:439488"},{"id":"A429","pred":"tao:has_database_id","subj":"429","obj":"Tax:11118"},{"id":"A430","pred":"tao:has_database_id","subj":"430","obj":"Tax:2697049"},{"id":"A431","pred":"tao:has_database_id","subj":"431","obj":"Gene:28884"},{"id":"A433","pred":"tao:has_database_id","subj":"433","obj":"MESH:D007239"},{"id":"A449","pred":"tao:has_database_id","subj":"449","obj":"Gene:43740571"},{"id":"A450","pred":"tao:has_database_id","subj":"450","obj":"Gene:43740570"},{"id":"A451","pred":"tao:has_database_id","subj":"451","obj":"Gene:43740570"},{"id":"A452","pred":"tao:has_database_id","subj":"452","obj":"Gene:43740570"},{"id":"A453","pred":"tao:has_database_id","subj":"453","obj":"Gene:43740571"},{"id":"A454","pred":"tao:has_database_id","subj":"454","obj":"Gene:43740571"},{"id":"A455","pred":"tao:has_database_id","subj":"455","obj":"Gene:43740571"},{"id":"A456","pred":"tao:has_database_id","subj":"456","obj":"Gene:43740575"},{"id":"A457","pred":"tao:has_database_id","subj":"457","obj":"Gene:43740575"},{"id":"A458","pred":"tao:has_database_id","subj":"458","obj":"Gene:43740575"},{"id":"A459","pred":"tao:has_database_id","subj":"459","obj":"Gene:43740570"},{"id":"A460","pred":"tao:has_database_id","subj":"460","obj":"Gene:43740570"},{"id":"A461","pred":"tao:has_database_id","subj":"461","obj":"Tax:2697049"},{"id":"A462","pred":"tao:has_database_id","subj":"462","obj":"Tax:2697049"},{"id":"A463","pred":"tao:has_database_id","subj":"463","obj":"Gene:43740568"},{"id":"A467","pred":"tao:has_database_id","subj":"467","obj":"Gene:5131"},{"id":"A468","pred":"tao:has_database_id","subj":"468","obj":"Tax:11118"},{"id":"A469","pred":"tao:has_database_id","subj":"469","obj":"MESH:D008545"},{"id":"A471","pred":"tao:has_database_id","subj":"471","obj":"Gene:43740568"},{"id":"A486","pred":"tao:has_database_id","subj":"486","obj":"Gene:43740568"},{"id":"A487","pred":"tao:has_database_id","subj":"487","obj":"Gene:43740575"},{"id":"A488","pred":"tao:has_database_id","subj":"488","obj":"Gene:43740575"},{"id":"A489","pred":"tao:has_database_id","subj":"489","obj":"Gene:43740568"},{"id":"A490","pred":"tao:has_database_id","subj":"490","obj":"Gene:43740568"},{"id":"A491","pred":"tao:has_database_id","subj":"491","obj":"Gene:43740568"},{"id":"A492","pred":"tao:has_database_id","subj":"492","obj":"Gene:1283"},{"id":"A493","pred":"tao:has_database_id","subj":"493","obj":"Gene:43740568"},{"id":"A494","pred":"tao:has_database_id","subj":"494","obj":"Tax:2697049"},{"id":"A495","pred":"tao:has_database_id","subj":"495","obj":"Tax:11118"},{"id":"A496","pred":"tao:has_database_id","subj":"496","obj":"Tax:694009"},{"id":"A497","pred":"tao:has_database_id","subj":"497","obj":"Tax:1335626"},{"id":"A498","pred":"tao:has_database_id","subj":"498","obj":"MESH:D002241"},{"id":"A499","pred":"tao:has_database_id","subj":"499","obj":"MESH:D007239"},{"id":"A510","pred":"tao:has_database_id","subj":"510","obj":"Gene:1283"},{"id":"A511","pred":"tao:has_database_id","subj":"511","obj":"Gene:3178"},{"id":"A512","pred":"tao:has_database_id","subj":"512","obj":"Gene:3178"},{"id":"A513","pred":"tao:has_database_id","subj":"513","obj":"Gene:3178"},{"id":"A514","pred":"tao:has_database_id","subj":"514","obj":"Gene:5131"},{"id":"A515","pred":"tao:has_database_id","subj":"515","obj":"Tax:11118"},{"id":"A516","pred":"tao:has_database_id","subj":"516","obj":"Tax:11118"},{"id":"A517","pred":"tao:has_database_id","subj":"517","obj":"Tax:694009"},{"id":"A518","pred":"tao:has_database_id","subj":"518","obj":"Tax:694009"},{"id":"A519","pred":"tao:has_database_id","subj":"519","obj":"Tax:694009"},{"id":"A521","pred":"tao:has_database_id","subj":"521","obj":"Gene:43740570"},{"id":"A526","pred":"tao:has_database_id","subj":"526","obj":"Gene:1283"},{"id":"A527","pred":"tao:has_database_id","subj":"527","obj":"Gene:1283"},{"id":"A528","pred":"tao:has_database_id","subj":"528","obj":"Gene:43740570"},{"id":"A529","pred":"tao:has_database_id","subj":"529","obj":"MESH:D007239"},{"id":"A534","pred":"tao:has_database_id","subj":"534","obj":"Gene:1283"},{"id":"A535","pred":"tao:has_database_id","subj":"535","obj":"Gene:43740570"},{"id":"A536","pred":"tao:has_database_id","subj":"536","obj":"Tax:11118"},{"id":"A537","pred":"tao:has_database_id","subj":"537","obj":"Tax:11118"},{"id":"A539","pred":"tao:has_database_id","subj":"539","obj":"Gene:17603139"},{"id":"A552","pred":"tao:has_database_id","subj":"552","obj":"Gene:6296"},{"id":"A553","pred":"tao:has_database_id","subj":"553","obj":"Tax:11118"},{"id":"A554","pred":"tao:has_database_id","subj":"554","obj":"Tax:11552"},{"id":"A555","pred":"tao:has_database_id","subj":"555","obj":"Tax:11118"},{"id":"A556","pred":"tao:has_database_id","subj":"556","obj":"Tax:11118"},{"id":"A557","pred":"tao:has_database_id","subj":"557","obj":"Tax:9031"},{"id":"A558","pred":"tao:has_database_id","subj":"558","obj":"Tax:11308"},{"id":"A559","pred":"tao:has_database_id","subj":"559","obj":"Tax:11161"},{"id":"A560","pred":"tao:has_database_id","subj":"560","obj":"Tax:11552"},{"id":"A561","pred":"tao:has_database_id","subj":"561","obj":"Tax:11176"},{"id":"A563","pred":"tao:has_database_id","subj":"563","obj":"MESH:D000740"}],"namespaces":[{"prefix":"Tax","uri":"https://www.ncbi.nlm.nih.gov/taxonomy/"},{"prefix":"MESH","uri":"https://id.nlm.nih.gov/mesh/"},{"prefix":"Gene","uri":"https://www.ncbi.nlm.nih.gov/gene/"},{"prefix":"CVCL","uri":"https://web.expasy.org/cellosaurus/CVCL_"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-PD-MONDO
{"project":"LitCovid-PD-MONDO","denotations":[{"id":"T56","span":{"begin":205,"end":214},"obj":"Disease"},{"id":"T57","span":{"begin":395,"end":403},"obj":"Disease"},{"id":"T58","span":{"begin":1057,"end":1065},"obj":"Disease"},{"id":"T59","span":{"begin":3791,"end":3799},"obj":"Disease"},{"id":"T60","span":{"begin":4012,"end":4015},"obj":"Disease"},{"id":"T62","span":{"begin":4084,"end":4092},"obj":"Disease"},{"id":"T63","span":{"begin":4323,"end":4332},"obj":"Disease"},{"id":"T64","span":{"begin":4862,"end":4870},"obj":"Disease"},{"id":"T65","span":{"begin":4984,"end":4987},"obj":"Disease"},{"id":"T67","span":{"begin":5264,"end":5267},"obj":"Disease"},{"id":"T69","span":{"begin":5748,"end":5756},"obj":"Disease"},{"id":"T70","span":{"begin":5840,"end":5848},"obj":"Disease"},{"id":"T71","span":{"begin":6026,"end":6034},"obj":"Disease"},{"id":"T72","span":{"begin":6364,"end":6367},"obj":"Disease"},{"id":"T74","span":{"begin":6472,"end":6475},"obj":"Disease"},{"id":"T76","span":{"begin":6905,"end":6908},"obj":"Disease"},{"id":"T78","span":{"begin":8189,"end":8198},"obj":"Disease"},{"id":"T79","span":{"begin":8267,"end":8273},"obj":"Disease"},{"id":"T80","span":{"begin":8581,"end":8590},"obj":"Disease"},{"id":"T81","span":{"begin":8775,"end":8784},"obj":"Disease"},{"id":"T82","span":{"begin":8811,"end":8828},"obj":"Disease"},{"id":"T83","span":{"begin":8839,"end":8844},"obj":"Disease"}],"attributes":[{"id":"A56","pred":"mondo_id","subj":"T56","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A57","pred":"mondo_id","subj":"T57","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A58","pred":"mondo_id","subj":"T58","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A59","pred":"mondo_id","subj":"T59","obj":"http://purl.obolibrary.org/obo/MONDO_0005105"},{"id":"A60","pred":"mondo_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A61","pred":"mondo_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A62","pred":"mondo_id","subj":"T62","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A63","pred":"mondo_id","subj":"T63","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A64","pred":"mondo_id","subj":"T64","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A65","pred":"mondo_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A66","pred":"mondo_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A67","pred":"mondo_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A68","pred":"mondo_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A69","pred":"mondo_id","subj":"T69","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A70","pred":"mondo_id","subj":"T70","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A71","pred":"mondo_id","subj":"T71","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A72","pred":"mondo_id","subj":"T72","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A73","pred":"mondo_id","subj":"T72","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A74","pred":"mondo_id","subj":"T74","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A75","pred":"mondo_id","subj":"T74","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A76","pred":"mondo_id","subj":"T76","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A77","pred":"mondo_id","subj":"T76","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A78","pred":"mondo_id","subj":"T78","obj":"http://purl.obolibrary.org/obo/MONDO_0005812"},{"id":"A79","pred":"mondo_id","subj":"T79","obj":"http://purl.obolibrary.org/obo/MONDO_0002280"},{"id":"A80","pred":"mondo_id","subj":"T80","obj":"http://purl.obolibrary.org/obo/MONDO_0005812"},{"id":"A81","pred":"mondo_id","subj":"T81","obj":"http://purl.obolibrary.org/obo/MONDO_0005812"},{"id":"A82","pred":"mondo_id","subj":"T82","obj":"http://purl.obolibrary.org/obo/MONDO_0005875"},{"id":"A83","pred":"mondo_id","subj":"T83","obj":"http://purl.obolibrary.org/obo/MONDO_0000989"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-PD-CLO
{"project":"LitCovid-PD-CLO","denotations":[{"id":"T111","span":{"begin":103,"end":110},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T112","span":{"begin":250,"end":252},"obj":"http://purl.obolibrary.org/obo/CLO_0007074"},{"id":"T113","span":{"begin":250,"end":252},"obj":"http://purl.obolibrary.org/obo/CLO_0051988"},{"id":"T114","span":{"begin":334,"end":339},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T115","span":{"begin":492,"end":498},"obj":"http://purl.obolibrary.org/obo/UBERON_0002415"},{"id":"T116","span":{"begin":504,"end":505},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T117","span":{"begin":539,"end":540},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T118","span":{"begin":541,"end":545},"obj":"http://purl.obolibrary.org/obo/UBERON_0002415"},{"id":"T119","span":{"begin":594,"end":599},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T120","span":{"begin":856,"end":857},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T121","span":{"begin":858,"end":863},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T122","span":{"begin":869,"end":870},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T123","span":{"begin":884,"end":887},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T124","span":{"begin":888,"end":889},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T125","span":{"begin":949,"end":956},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T126","span":{"begin":1078,"end":1079},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T127","span":{"begin":1126,"end":1127},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T128","span":{"begin":1128,"end":1136},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T129","span":{"begin":1386,"end":1387},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T130","span":{"begin":1457,"end":1462},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T131","span":{"begin":1538,"end":1546},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T132","span":{"begin":1700,"end":1705},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T133","span":{"begin":2221,"end":2226},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T134","span":{"begin":2523,"end":2528},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T135","span":{"begin":2622,"end":2623},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T136","span":{"begin":2677,"end":2687},"obj":"http://purl.obolibrary.org/obo/CLO_0001658"},{"id":"T137","span":{"begin":2780,"end":2788},"obj":"http://purl.obolibrary.org/obo/CLO_0001658"},{"id":"T138","span":{"begin":2892,"end":2897},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T139","span":{"begin":2928,"end":2929},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T140","span":{"begin":2961,"end":2969},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T141","span":{"begin":2994,"end":2997},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T142","span":{"begin":2998,"end":2999},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T143","span":{"begin":3040,"end":3045},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T144","span":{"begin":3084,"end":3085},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T145","span":{"begin":3232,"end":3233},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T146","span":{"begin":3286,"end":3295},"obj":"http://purl.obolibrary.org/obo/CLO_0001658"},{"id":"T147","span":{"begin":3342,"end":3350},"obj":"http://purl.obolibrary.org/obo/CLO_0001658"},{"id":"T148","span":{"begin":3391,"end":3392},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T149","span":{"begin":3498,"end":3499},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T150","span":{"begin":3513,"end":3517},"obj":"http://purl.obolibrary.org/obo/CLO_0001000"},{"id":"T151","span":{"begin":3687,"end":3688},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T152","span":{"begin":3731,"end":3734},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T153","span":{"begin":3752,"end":3760},"obj":"http://purl.obolibrary.org/obo/CLO_0001658"},{"id":"T154","span":{"begin":3900,"end":3907},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T155","span":{"begin":4077,"end":4082},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T156","span":{"begin":4116,"end":4118},"obj":"http://purl.obolibrary.org/obo/CLO_0050507"},{"id":"T157","span":{"begin":4294,"end":4299},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T158","span":{"begin":4310,"end":4315},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T159","span":{"begin":4357,"end":4358},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T160","span":{"begin":4449,"end":4452},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T161","span":{"begin":4453,"end":4454},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T162","span":{"begin":4529,"end":4530},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T163","span":{"begin":4617,"end":4625},"obj":"http://purl.obolibrary.org/obo/PR_000018263"},{"id":"T164","span":{"begin":4649,"end":4654},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T165","span":{"begin":4669,"end":4673},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T166","span":{"begin":4675,"end":4677},"obj":"http://purl.obolibrary.org/obo/CLO_0050510"},{"id":"T167","span":{"begin":4714,"end":4716},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T168","span":{"begin":4721,"end":4723},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T169","span":{"begin":4721,"end":4723},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T170","span":{"begin":4733,"end":4734},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T171","span":{"begin":4735,"end":4737},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T172","span":{"begin":4738,"end":4740},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T173","span":{"begin":4738,"end":4740},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T174","span":{"begin":4756,"end":4758},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T175","span":{"begin":4762,"end":4763},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T176","span":{"begin":4806,"end":4810},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T177","span":{"begin":4811,"end":4819},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T178","span":{"begin":4824,"end":4826},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T179","span":{"begin":4824,"end":4826},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T180","span":{"begin":4889,"end":4891},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T181","span":{"begin":4923,"end":4924},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T182","span":{"begin":4938,"end":4943},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T183","span":{"begin":4981,"end":4983},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T184","span":{"begin":5032,"end":5034},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T185","span":{"begin":5070,"end":5072},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T186","span":{"begin":5150,"end":5157},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T187","span":{"begin":5457,"end":5462},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T188","span":{"begin":5828,"end":5836},"obj":"http://purl.obolibrary.org/obo/CLO_0001658"},{"id":"T189","span":{"begin":6053,"end":6054},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T190","span":{"begin":6228,"end":6233},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T191","span":{"begin":6263,"end":6264},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T192","span":{"begin":6327,"end":6328},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T193","span":{"begin":6357,"end":6360},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T194","span":{"begin":6400,"end":6401},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T195","span":{"begin":6509,"end":6512},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T196","span":{"begin":6598,"end":6603},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T197","span":{"begin":6613,"end":6614},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T198","span":{"begin":6721,"end":6726},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T199","span":{"begin":6744,"end":6752},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T200","span":{"begin":6853,"end":6854},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T201","span":{"begin":6884,"end":6885},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T202","span":{"begin":6992,"end":6993},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T203","span":{"begin":7007,"end":7012},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T204","span":{"begin":7082,"end":7084},"obj":"http://purl.obolibrary.org/obo/CLO_0050507"},{"id":"T205","span":{"begin":7164,"end":7169},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T206","span":{"begin":7219,"end":7224},"obj":"http://purl.obolibrary.org/obo/GO_0005623"},{"id":"T207","span":{"begin":7325,"end":7333},"obj":"http://purl.obolibrary.org/obo/UBERON_0000158"},{"id":"T208","span":{"begin":7553,"end":7568},"obj":"http://purl.obolibrary.org/obo/GO_0043234"},{"id":"T209","span":{"begin":7682,"end":7687},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T210","span":{"begin":7723,"end":7728},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T211","span":{"begin":7845,"end":7852},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T212","span":{"begin":7864,"end":7868},"obj":"http://purl.obolibrary.org/obo/CLO_0008774"},{"id":"T213","span":{"begin":7994,"end":8001},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T214","span":{"begin":8028,"end":8032},"obj":"http://purl.obolibrary.org/obo/CLO_0008774"},{"id":"T215","span":{"begin":8201,"end":8206},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T216","span":{"begin":8274,"end":8279},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T217","span":{"begin":8383,"end":8390},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T218","span":{"begin":8518,"end":8521},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T219","span":{"begin":8537,"end":8545},"obj":"http://purl.obolibrary.org/obo/CLO_0001658"},{"id":"T220","span":{"begin":8591,"end":8592},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T221","span":{"begin":8593,"end":8594},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T222","span":{"begin":8595,"end":8602},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T223","span":{"begin":8627,"end":8634},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9031"},{"id":"T224","span":{"begin":8635,"end":8647},"obj":"http://purl.obolibrary.org/obo/CL_0000232"},{"id":"T225","span":{"begin":8657,"end":8659},"obj":"http://purl.obolibrary.org/obo/CLO_0050509"},{"id":"T226","span":{"begin":8711,"end":8716},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T227","span":{"begin":8785,"end":8792},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T228","span":{"begin":8845,"end":8850},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-PD-CHEBI
{"project":"LitCovid-PD-CHEBI","denotations":[{"id":"T259","span":{"begin":381,"end":389},"obj":"Chemical"},{"id":"T260","span":{"begin":454,"end":465},"obj":"Chemical"},{"id":"T261","span":{"begin":454,"end":459},"obj":"Chemical"},{"id":"T262","span":{"begin":460,"end":465},"obj":"Chemical"},{"id":"T263","span":{"begin":911,"end":918},"obj":"Chemical"},{"id":"T264","span":{"begin":979,"end":984},"obj":"Chemical"},{"id":"T265","span":{"begin":1027,"end":1034},"obj":"Chemical"},{"id":"T266","span":{"begin":1090,"end":1102},"obj":"Chemical"},{"id":"T267","span":{"begin":1144,"end":1156},"obj":"Chemical"},{"id":"T268","span":{"begin":1193,"end":1201},"obj":"Chemical"},{"id":"T269","span":{"begin":1224,"end":1232},"obj":"Chemical"},{"id":"T270","span":{"begin":1259,"end":1267},"obj":"Chemical"},{"id":"T271","span":{"begin":1349,"end":1362},"obj":"Chemical"},{"id":"T272","span":{"begin":1370,"end":1382},"obj":"Chemical"},{"id":"T273","span":{"begin":1395,"end":1402},"obj":"Chemical"},{"id":"T274","span":{"begin":1484,"end":1492},"obj":"Chemical"},{"id":"T275","span":{"begin":1968,"end":1976},"obj":"Chemical"},{"id":"T276","span":{"begin":2317,"end":2325},"obj":"Chemical"},{"id":"T277","span":{"begin":2495,"end":2502},"obj":"Chemical"},{"id":"T278","span":{"begin":2587,"end":2594},"obj":"Chemical"},{"id":"T279","span":{"begin":2652,"end":2654},"obj":"Chemical"},{"id":"T281","span":{"begin":2656,"end":2663},"obj":"Chemical"},{"id":"T282","span":{"begin":2735,"end":2742},"obj":"Chemical"},{"id":"T283","span":{"begin":2747,"end":2750},"obj":"Chemical"},{"id":"T286","span":{"begin":2751,"end":2757},"obj":"Chemical"},{"id":"T288","span":{"begin":2930,"end":2932},"obj":"Chemical"},{"id":"T290","span":{"begin":2942,"end":2949},"obj":"Chemical"},{"id":"T291","span":{"begin":3086,"end":3088},"obj":"Chemical"},{"id":"T293","span":{"begin":3098,"end":3105},"obj":"Chemical"},{"id":"T294","span":{"begin":3125,"end":3133},"obj":"Chemical"},{"id":"T295","span":{"begin":3273,"end":3280},"obj":"Chemical"},{"id":"T296","span":{"begin":3827,"end":3834},"obj":"Chemical"},{"id":"T297","span":{"begin":3880,"end":3892},"obj":"Chemical"},{"id":"T298","span":{"begin":3915,"end":3927},"obj":"Chemical"},{"id":"T299","span":{"begin":3955,"end":3962},"obj":"Chemical"},{"id":"T300","span":{"begin":4097,"end":4109},"obj":"Chemical"},{"id":"T301","span":{"begin":4119,"end":4127},"obj":"Chemical"},{"id":"T302","span":{"begin":4180,"end":4192},"obj":"Chemical"},{"id":"T303","span":{"begin":4220,"end":4229},"obj":"Chemical"},{"id":"T304","span":{"begin":4222,"end":4229},"obj":"Chemical"},{"id":"T305","span":{"begin":4237,"end":4249},"obj":"Chemical"},{"id":"T306","span":{"begin":4369,"end":4381},"obj":"Chemical"},{"id":"T307","span":{"begin":4436,"end":4448},"obj":"Chemical"},{"id":"T308","span":{"begin":4488,"end":4500},"obj":"Chemical"},{"id":"T309","span":{"begin":4570,"end":4582},"obj":"Chemical"},{"id":"T310","span":{"begin":4617,"end":4625},"obj":"Chemical"},{"id":"T311","span":{"begin":4721,"end":4723},"obj":"Chemical"},{"id":"T312","span":{"begin":4738,"end":4740},"obj":"Chemical"},{"id":"T313","span":{"begin":4824,"end":4826},"obj":"Chemical"},{"id":"T314","span":{"begin":4997,"end":5009},"obj":"Chemical"},{"id":"T315","span":{"begin":5084,"end":5091},"obj":"Chemical"},{"id":"T316","span":{"begin":5135,"end":5142},"obj":"Chemical"},{"id":"T317","span":{"begin":5165,"end":5172},"obj":"Chemical"},{"id":"T318","span":{"begin":5212,"end":5219},"obj":"Chemical"},{"id":"T319","span":{"begin":5283,"end":5290},"obj":"Chemical"},{"id":"T320","span":{"begin":5543,"end":5555},"obj":"Chemical"},{"id":"T321","span":{"begin":5551,"end":5555},"obj":"Chemical"},{"id":"T322","span":{"begin":5564,"end":5572},"obj":"Chemical"},{"id":"T323","span":{"begin":5635,"end":5642},"obj":"Chemical"},{"id":"T324","span":{"begin":5673,"end":5680},"obj":"Chemical"},{"id":"T325","span":{"begin":5695,"end":5703},"obj":"Chemical"},{"id":"T326","span":{"begin":5759,"end":5766},"obj":"Chemical"},{"id":"T327","span":{"begin":5967,"end":5974},"obj":"Chemical"},{"id":"T328","span":{"begin":6037,"end":6044},"obj":"Chemical"},{"id":"T329","span":{"begin":6151,"end":6158},"obj":"Chemical"},{"id":"T330","span":{"begin":6161,"end":6168},"obj":"Chemical"},{"id":"T331","span":{"begin":6201,"end":6208},"obj":"Chemical"},{"id":"T332","span":{"begin":6265,"end":6267},"obj":"Chemical"},{"id":"T334","span":{"begin":6268,"end":6275},"obj":"Chemical"},{"id":"T335","span":{"begin":6279,"end":6286},"obj":"Chemical"},{"id":"T336","span":{"begin":6314,"end":6321},"obj":"Chemical"},{"id":"T337","span":{"begin":6329,"end":6331},"obj":"Chemical"},{"id":"T338","span":{"begin":6349,"end":6356},"obj":"Chemical"},{"id":"T339","span":{"begin":6437,"end":6444},"obj":"Chemical"},{"id":"T340","span":{"begin":6521,"end":6524},"obj":"Chemical"},{"id":"T341","span":{"begin":6543,"end":6550},"obj":"Chemical"},{"id":"T342","span":{"begin":6688,"end":6696},"obj":"Chemical"},{"id":"T343","span":{"begin":6753,"end":6765},"obj":"Chemical"},{"id":"T344","span":{"begin":6776,"end":6783},"obj":"Chemical"},{"id":"T345","span":{"begin":6799,"end":6801},"obj":"Chemical"},{"id":"T347","span":{"begin":6840,"end":6847},"obj":"Chemical"},{"id":"T348","span":{"begin":6855,"end":6857},"obj":"Chemical"},{"id":"T349","span":{"begin":6978,"end":6985},"obj":"Chemical"},{"id":"T350","span":{"begin":7054,"end":7066},"obj":"Chemical"},{"id":"T351","span":{"begin":7073,"end":7080},"obj":"Chemical"},{"id":"T352","span":{"begin":7102,"end":7109},"obj":"Chemical"},{"id":"T353","span":{"begin":7116,"end":7123},"obj":"Chemical"},{"id":"T354","span":{"begin":7148,"end":7155},"obj":"Chemical"},{"id":"T355","span":{"begin":7241,"end":7248},"obj":"Chemical"},{"id":"T356","span":{"begin":7419,"end":7426},"obj":"Chemical"},{"id":"T357","span":{"begin":7456,"end":7463},"obj":"Chemical"},{"id":"T358","span":{"begin":7475,"end":7487},"obj":"Chemical"},{"id":"T359","span":{"begin":7494,"end":7506},"obj":"Chemical"},{"id":"T360","span":{"begin":7549,"end":7552},"obj":"Chemical"},{"id":"T361","span":{"begin":7553,"end":7560},"obj":"Chemical"},{"id":"T362","span":{"begin":7581,"end":7588},"obj":"Chemical"},{"id":"T363","span":{"begin":7658,"end":7666},"obj":"Chemical"},{"id":"T364","span":{"begin":7787,"end":7794},"obj":"Chemical"},{"id":"T365","span":{"begin":8099,"end":8101},"obj":"Chemical"},{"id":"T366","span":{"begin":8107,"end":8109},"obj":"Chemical"},{"id":"T371","span":{"begin":8112,"end":8118},"obj":"Chemical"},{"id":"T373","span":{"begin":8139,"end":8141},"obj":"Chemical"},{"id":"T378","span":{"begin":8144,"end":8150},"obj":"Chemical"},{"id":"T380","span":{"begin":8338,"end":8340},"obj":"Chemical"},{"id":"T385","span":{"begin":8414,"end":8426},"obj":"Chemical"},{"id":"T386","span":{"begin":8469,"end":8471},"obj":"Chemical"},{"id":"T391","span":{"begin":8561,"end":8563},"obj":"Chemical"}],"attributes":[{"id":"A259","pred":"chebi_id","subj":"T259","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A260","pred":"chebi_id","subj":"T260","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A261","pred":"chebi_id","subj":"T261","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A262","pred":"chebi_id","subj":"T262","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A263","pred":"chebi_id","subj":"T263","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A264","pred":"chebi_id","subj":"T264","obj":"http://purl.obolibrary.org/obo/CHEBI_18059"},{"id":"A265","pred":"chebi_id","subj":"T265","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A266","pred":"chebi_id","subj":"T266","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A267","pred":"chebi_id","subj":"T267","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A268","pred":"chebi_id","subj":"T268","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A269","pred":"chebi_id","subj":"T269","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A270","pred":"chebi_id","subj":"T270","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A271","pred":"chebi_id","subj":"T271","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A272","pred":"chebi_id","subj":"T272","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A273","pred":"chebi_id","subj":"T273","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A274","pred":"chebi_id","subj":"T274","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A275","pred":"chebi_id","subj":"T275","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A276","pred":"chebi_id","subj":"T276","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A277","pred":"chebi_id","subj":"T277","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A278","pred":"chebi_id","subj":"T278","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A279","pred":"chebi_id","subj":"T279","obj":"http://purl.obolibrary.org/obo/CHEBI_55460"},{"id":"A280","pred":"chebi_id","subj":"T279","obj":"http://purl.obolibrary.org/obo/CHEBI_74861"},{"id":"A281","pred":"chebi_id","subj":"T281","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A282","pred":"chebi_id","subj":"T282","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A283","pred":"chebi_id","subj":"T283","obj":"http://purl.obolibrary.org/obo/CHEBI_16761"},{"id":"A284","pred":"chebi_id","subj":"T283","obj":"http://purl.obolibrary.org/obo/CHEBI_456216"},{"id":"A285","pred":"chebi_id","subj":"T283","obj":"http://purl.obolibrary.org/obo/CHEBI_73342"},{"id":"A286","pred":"chebi_id","subj":"T286","obj":"http://purl.obolibrary.org/obo/CHEBI_33942"},{"id":"A287","pred":"chebi_id","subj":"T286","obj":"http://purl.obolibrary.org/obo/CHEBI_47013"},{"id":"A288","pred":"chebi_id","subj":"T288","obj":"http://purl.obolibrary.org/obo/CHEBI_55460"},{"id":"A289","pred":"chebi_id","subj":"T288","obj":"http://purl.obolibrary.org/obo/CHEBI_74861"},{"id":"A290","pred":"chebi_id","subj":"T290","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A291","pred":"chebi_id","subj":"T291","obj":"http://purl.obolibrary.org/obo/CHEBI_55460"},{"id":"A292","pred":"chebi_id","subj":"T291","obj":"http://purl.obolibrary.org/obo/CHEBI_74861"},{"id":"A293","pred":"chebi_id","subj":"T293","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A294","pred":"chebi_id","subj":"T294","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A295","pred":"chebi_id","subj":"T295","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A296","pred":"chebi_id","subj":"T296","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A297","pred":"chebi_id","subj":"T297","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A298","pred":"chebi_id","subj":"T298","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A299","pred":"chebi_id","subj":"T299","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A300","pred":"chebi_id","subj":"T300","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A301","pred":"chebi_id","subj":"T301","obj":"http://purl.obolibrary.org/obo/CHEBI_59520"},{"id":"A302","pred":"chebi_id","subj":"T302","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A303","pred":"chebi_id","subj":"T303","obj":"http://purl.obolibrary.org/obo/CHEBI_59520"},{"id":"A304","pred":"chebi_id","subj":"T304","obj":"http://purl.obolibrary.org/obo/CHEBI_18154"},{"id":"A305","pred":"chebi_id","subj":"T305","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A306","pred":"chebi_id","subj":"T306","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A307","pred":"chebi_id","subj":"T307","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A308","pred":"chebi_id","subj":"T308","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A309","pred":"chebi_id","subj":"T309","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A310","pred":"chebi_id","subj":"T310","obj":"http://purl.obolibrary.org/obo/CHEBI_16670"},{"id":"A311","pred":"chebi_id","subj":"T311","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A312","pred":"chebi_id","subj":"T312","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A313","pred":"chebi_id","subj":"T313","obj":"http://purl.obolibrary.org/obo/CHEBI_29387"},{"id":"A314","pred":"chebi_id","subj":"T314","obj":"http://purl.obolibrary.org/obo/CHEBI_16646"},{"id":"A315","pred":"chebi_id","subj":"T315","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A316","pred":"chebi_id","subj":"T316","obj":"http://purl.obolibrary.org/obo/CHEBI_16541"},{"id":"A317","pred":"chebi_id","subj":"T317","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A318","pred":"chebi_id","subj":"T318","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A319","pred":"chebi_id","subj":"T319","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A320","pred":"chebi_id","subj":"T320","obj":"http://purl.obolibrary.org/obo/CHEBI_33696"},{"id":"A321","pred":"chebi_id","subj":"T321","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A322","pred":"chebi_id","subj":"T322","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A323","pred":"chebi_id","subj":"T323","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A324","pred":"chebi_id","subj":"T324","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A325","pred":"chebi_id","subj":"T325","obj":"http://purl.obolibrary.org/obo/CHEBI_28087"},{"id":"A326","pred":"chebi_id","subj":"T326","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A327","pred":"chebi_id","subj":"T327","obj":"http://purl.obolibrary.org/obo/CHEBI_33252"},{"id":"A328","pred":"chebi_id","subj":"T328","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A329","pred":"chebi_id","subj":"T329","obj":"http://purl.obolibrary.org/obo/CHEBI_16541"},{"id":"A330","pred":"chebi_id","subj":"T330","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A331","pred":"chebi_id","subj":"T331","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A332","pred":"chebi_id","subj":"T332","obj":"http://purl.obolibrary.org/obo/CHEBI_55460"},{"id":"A333","pred":"chebi_id","subj":"T332","obj":"http://purl.obolibrary.org/obo/CHEBI_74861"},{"id":"A334","pred":"chebi_id","subj":"T334","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A335","pred":"chebi_id","subj":"T335","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A336","pred":"chebi_id","subj":"T336","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A337","pred":"chebi_id","subj":"T337","obj":"http://purl.obolibrary.org/obo/CHEBI_74709"},{"id":"A338","pred":"chebi_id","subj":"T338","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A339","pred":"chebi_id","subj":"T339","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A340","pred":"chebi_id","subj":"T340","obj":"http://purl.obolibrary.org/obo/CHEBI_24870"},{"id":"A341","pred":"chebi_id","subj":"T341","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A342","pred":"chebi_id","subj":"T342","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A343","pred":"chebi_id","subj":"T343","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A344","pred":"chebi_id","subj":"T344","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A345","pred":"chebi_id","subj":"T345","obj":"http://purl.obolibrary.org/obo/CHEBI_55460"},{"id":"A346","pred":"chebi_id","subj":"T345","obj":"http://purl.obolibrary.org/obo/CHEBI_74861"},{"id":"A347","pred":"chebi_id","subj":"T347","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A348","pred":"chebi_id","subj":"T348","obj":"http://purl.obolibrary.org/obo/CHEBI_74709"},{"id":"A349","pred":"chebi_id","subj":"T349","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A350","pred":"chebi_id","subj":"T350","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A351","pred":"chebi_id","subj":"T351","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A352","pred":"chebi_id","subj":"T352","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A353","pred":"chebi_id","subj":"T353","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A354","pred":"chebi_id","subj":"T354","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A355","pred":"chebi_id","subj":"T355","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A356","pred":"chebi_id","subj":"T356","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A357","pred":"chebi_id","subj":"T357","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A358","pred":"chebi_id","subj":"T358","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A359","pred":"chebi_id","subj":"T359","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A360","pred":"chebi_id","subj":"T360","obj":"http://purl.obolibrary.org/obo/CHEBI_141442"},{"id":"A361","pred":"chebi_id","subj":"T361","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A362","pred":"chebi_id","subj":"T362","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A363","pred":"chebi_id","subj":"T363","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A364","pred":"chebi_id","subj":"T364","obj":"http://purl.obolibrary.org/obo/CHEBI_16541"},{"id":"A365","pred":"chebi_id","subj":"T365","obj":"http://purl.obolibrary.org/obo/CHEBI_33696"},{"id":"A366","pred":"chebi_id","subj":"T366","obj":"http://purl.obolibrary.org/obo/CHEBI_35962"},{"id":"A367","pred":"chebi_id","subj":"T366","obj":"http://purl.obolibrary.org/obo/CHEBI_38358"},{"id":"A368","pred":"chebi_id","subj":"T366","obj":"http://purl.obolibrary.org/obo/CHEBI_45373"},{"id":"A369","pred":"chebi_id","subj":"T366","obj":"http://purl.obolibrary.org/obo/CHEBI_74801"},{"id":"A370","pred":"chebi_id","subj":"T366","obj":"http://purl.obolibrary.org/obo/CHEBI_26667"},{"id":"A371","pred":"chebi_id","subj":"T371","obj":"http://purl.obolibrary.org/obo/CHEBI_40574"},{"id":"A372","pred":"chebi_id","subj":"T371","obj":"http://purl.obolibrary.org/obo/CHEBI_46887"},{"id":"A373","pred":"chebi_id","subj":"T373","obj":"http://purl.obolibrary.org/obo/CHEBI_35962"},{"id":"A374","pred":"chebi_id","subj":"T373","obj":"http://purl.obolibrary.org/obo/CHEBI_38358"},{"id":"A375","pred":"chebi_id","subj":"T373","obj":"http://purl.obolibrary.org/obo/CHEBI_45373"},{"id":"A376","pred":"chebi_id","subj":"T373","obj":"http://purl.obolibrary.org/obo/CHEBI_74801"},{"id":"A377","pred":"chebi_id","subj":"T373","obj":"http://purl.obolibrary.org/obo/CHEBI_26667"},{"id":"A378","pred":"chebi_id","subj":"T378","obj":"http://purl.obolibrary.org/obo/CHEBI_40574"},{"id":"A379","pred":"chebi_id","subj":"T378","obj":"http://purl.obolibrary.org/obo/CHEBI_46887"},{"id":"A380","pred":"chebi_id","subj":"T380","obj":"http://purl.obolibrary.org/obo/CHEBI_35962"},{"id":"A381","pred":"chebi_id","subj":"T380","obj":"http://purl.obolibrary.org/obo/CHEBI_38358"},{"id":"A382","pred":"chebi_id","subj":"T380","obj":"http://purl.obolibrary.org/obo/CHEBI_45373"},{"id":"A383","pred":"chebi_id","subj":"T380","obj":"http://purl.obolibrary.org/obo/CHEBI_74801"},{"id":"A384","pred":"chebi_id","subj":"T380","obj":"http://purl.obolibrary.org/obo/CHEBI_26667"},{"id":"A385","pred":"chebi_id","subj":"T385","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A386","pred":"chebi_id","subj":"T386","obj":"http://purl.obolibrary.org/obo/CHEBI_35962"},{"id":"A387","pred":"chebi_id","subj":"T386","obj":"http://purl.obolibrary.org/obo/CHEBI_38358"},{"id":"A388","pred":"chebi_id","subj":"T386","obj":"http://purl.obolibrary.org/obo/CHEBI_45373"},{"id":"A389","pred":"chebi_id","subj":"T386","obj":"http://purl.obolibrary.org/obo/CHEBI_74801"},{"id":"A390","pred":"chebi_id","subj":"T386","obj":"http://purl.obolibrary.org/obo/CHEBI_26667"},{"id":"A391","pred":"chebi_id","subj":"T391","obj":"http://purl.obolibrary.org/obo/CHEBI_35962"},{"id":"A392","pred":"chebi_id","subj":"T391","obj":"http://purl.obolibrary.org/obo/CHEBI_38358"},{"id":"A393","pred":"chebi_id","subj":"T391","obj":"http://purl.obolibrary.org/obo/CHEBI_45373"},{"id":"A394","pred":"chebi_id","subj":"T391","obj":"http://purl.obolibrary.org/obo/CHEBI_74801"},{"id":"A395","pred":"chebi_id","subj":"T391","obj":"http://purl.obolibrary.org/obo/CHEBI_26667"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-PD-GO-BP
{"project":"LitCovid-PD-GO-BP","denotations":[{"id":"T9","span":{"begin":351,"end":364},"obj":"http://purl.obolibrary.org/obo/GO_0003968"},{"id":"T10","span":{"begin":351,"end":364},"obj":"http://purl.obolibrary.org/obo/GO_0003899"},{"id":"T11","span":{"begin":1735,"end":1746},"obj":"http://purl.obolibrary.org/obo/GO_0006810"},{"id":"T12","span":{"begin":1817,"end":1827},"obj":"http://purl.obolibrary.org/obo/GO_0006887"},{"id":"T13","span":{"begin":2045,"end":2058},"obj":"http://purl.obolibrary.org/obo/GO_0003968"},{"id":"T14","span":{"begin":2045,"end":2058},"obj":"http://purl.obolibrary.org/obo/GO_0003899"},{"id":"T15","span":{"begin":2138,"end":2151},"obj":"http://purl.obolibrary.org/obo/GO_0006351"},{"id":"T16","span":{"begin":2495,"end":2514},"obj":"http://purl.obolibrary.org/obo/GO_0006412"},{"id":"T17","span":{"begin":2533,"end":2556},"obj":"http://purl.obolibrary.org/obo/GO_0045087"},{"id":"T18","span":{"begin":2761,"end":2772},"obj":"http://purl.obolibrary.org/obo/GO_0016791"},{"id":"T19","span":{"begin":2862,"end":2876},"obj":"http://purl.obolibrary.org/obo/GO_0004843"},{"id":"T20","span":{"begin":5330,"end":5343},"obj":"http://purl.obolibrary.org/obo/GO_0003968"},{"id":"T21","span":{"begin":5330,"end":5343},"obj":"http://purl.obolibrary.org/obo/GO_0003899"},{"id":"T22","span":{"begin":5394,"end":5417},"obj":"http://purl.obolibrary.org/obo/GO_0045087"},{"id":"T23","span":{"begin":5472,"end":5485},"obj":"http://purl.obolibrary.org/obo/GO_0006351"},{"id":"T24","span":{"begin":5508,"end":5522},"obj":"http://purl.obolibrary.org/obo/GO_0034337"},{"id":"T25","span":{"begin":5659,"end":5672},"obj":"http://purl.obolibrary.org/obo/GO_0034337"},{"id":"T26","span":{"begin":5729,"end":5743},"obj":"http://purl.obolibrary.org/obo/GO_0016310"},{"id":"T27","span":{"begin":5946,"end":5959},"obj":"http://purl.obolibrary.org/obo/GO_0000398"},{"id":"T28","span":{"begin":5946,"end":5959},"obj":"http://purl.obolibrary.org/obo/GO_0000394"},{"id":"T29","span":{"begin":5946,"end":5959},"obj":"http://purl.obolibrary.org/obo/GO_0000374"},{"id":"T30","span":{"begin":5946,"end":5959},"obj":"http://purl.obolibrary.org/obo/GO_0000373"},{"id":"T31","span":{"begin":5946,"end":5959},"obj":"http://purl.obolibrary.org/obo/GO_0000372"},{"id":"T32","span":{"begin":5951,"end":5959},"obj":"http://purl.obolibrary.org/obo/GO_0045292"},{"id":"T33","span":{"begin":5990,"end":6001},"obj":"http://purl.obolibrary.org/obo/GO_0006412"},{"id":"T34","span":{"begin":6071,"end":6084},"obj":"http://purl.obolibrary.org/obo/GO_0006351"},{"id":"T35","span":{"begin":6113,"end":6125},"obj":"http://purl.obolibrary.org/obo/GO_0009058"},{"id":"T36","span":{"begin":6521,"end":6532},"obj":"http://purl.obolibrary.org/obo/GO_0022831"},{"id":"T37","span":{"begin":7007,"end":7021},"obj":"http://purl.obolibrary.org/obo/GO_0019068"},{"id":"T38","span":{"begin":7347,"end":7358},"obj":"http://purl.obolibrary.org/obo/GO_0006810"},{"id":"T39","span":{"begin":7367,"end":7377},"obj":"http://purl.obolibrary.org/obo/GO_0006887"},{"id":"T40","span":{"begin":7507,"end":7516},"obj":"http://purl.obolibrary.org/obo/GO_0051235"},{"id":"T41","span":{"begin":7634,"end":7648},"obj":"http://purl.obolibrary.org/obo/GO_0019068"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-sentences
{"project":"LitCovid-sentences","denotations":[{"id":"T87","span":{"begin":0,"end":2},"obj":"Sentence"},{"id":"T88","span":{"begin":3,"end":47},"obj":"Sentence"},{"id":"T89","span":{"begin":48,"end":215},"obj":"Sentence"},{"id":"T90","span":{"begin":216,"end":390},"obj":"Sentence"},{"id":"T91","span":{"begin":391,"end":466},"obj":"Sentence"},{"id":"T92","span":{"begin":467,"end":561},"obj":"Sentence"},{"id":"T93","span":{"begin":562,"end":727},"obj":"Sentence"},{"id":"T94","span":{"begin":728,"end":734},"obj":"Sentence"},{"id":"T95","span":{"begin":735,"end":855},"obj":"Sentence"},{"id":"T96","span":{"begin":856,"end":948},"obj":"Sentence"},{"id":"T97","span":{"begin":949,"end":1041},"obj":"Sentence"},{"id":"T98","span":{"begin":1042,"end":1177},"obj":"Sentence"},{"id":"T99","span":{"begin":1178,"end":1327},"obj":"Sentence"},{"id":"T100","span":{"begin":1328,"end":1468},"obj":"Sentence"},{"id":"T101","span":{"begin":1469,"end":1630},"obj":"Sentence"},{"id":"T102","span":{"begin":1631,"end":1716},"obj":"Sentence"},{"id":"T103","span":{"begin":1717,"end":1828},"obj":"Sentence"},{"id":"T104","span":{"begin":1829,"end":1897},"obj":"Sentence"},{"id":"T105","span":{"begin":1898,"end":2073},"obj":"Sentence"},{"id":"T106","span":{"begin":2074,"end":2181},"obj":"Sentence"},{"id":"T107","span":{"begin":2182,"end":2272},"obj":"Sentence"},{"id":"T108","span":{"begin":2273,"end":2370},"obj":"Sentence"},{"id":"T109","span":{"begin":2371,"end":2427},"obj":"Sentence"},{"id":"T110","span":{"begin":2428,"end":2557},"obj":"Sentence"},{"id":"T111","span":{"begin":2558,"end":2613},"obj":"Sentence"},{"id":"T112","span":{"begin":2614,"end":2688},"obj":"Sentence"},{"id":"T113","span":{"begin":2689,"end":2817},"obj":"Sentence"},{"id":"T114","span":{"begin":2818,"end":2919},"obj":"Sentence"},{"id":"T115","span":{"begin":2920,"end":2988},"obj":"Sentence"},{"id":"T116","span":{"begin":2989,"end":3106},"obj":"Sentence"},{"id":"T117","span":{"begin":3107,"end":3249},"obj":"Sentence"},{"id":"T118","span":{"begin":3250,"end":3351},"obj":"Sentence"},{"id":"T119","span":{"begin":3352,"end":3415},"obj":"Sentence"},{"id":"T120","span":{"begin":3416,"end":3487},"obj":"Sentence"},{"id":"T121","span":{"begin":3488,"end":3535},"obj":"Sentence"},{"id":"T122","span":{"begin":3536,"end":3609},"obj":"Sentence"},{"id":"T123","span":{"begin":3610,"end":3717},"obj":"Sentence"},{"id":"T124","span":{"begin":3718,"end":3724},"obj":"Sentence"},{"id":"T125","span":{"begin":3725,"end":3849},"obj":"Sentence"},{"id":"T126","span":{"begin":3851,"end":3855},"obj":"Sentence"},{"id":"T127","span":{"begin":3856,"end":3892},"obj":"Sentence"},{"id":"T128","span":{"begin":3893,"end":3933},"obj":"Sentence"},{"id":"T129","span":{"begin":3934,"end":4083},"obj":"Sentence"},{"id":"T130","span":{"begin":4084,"end":4153},"obj":"Sentence"},{"id":"T131","span":{"begin":4154,"end":4230},"obj":"Sentence"},{"id":"T132","span":{"begin":4231,"end":4339},"obj":"Sentence"},{"id":"T133","span":{"begin":4340,"end":4429},"obj":"Sentence"},{"id":"T134","span":{"begin":4430,"end":4480},"obj":"Sentence"},{"id":"T135","span":{"begin":4481,"end":4679},"obj":"Sentence"},{"id":"T136","span":{"begin":4680,"end":4857},"obj":"Sentence"},{"id":"T137","span":{"begin":4858,"end":4976},"obj":"Sentence"},{"id":"T138","span":{"begin":4977,"end":5058},"obj":"Sentence"},{"id":"T139","span":{"begin":5059,"end":5111},"obj":"Sentence"},{"id":"T140","span":{"begin":5113,"end":5117},"obj":"Sentence"},{"id":"T141","span":{"begin":5118,"end":5142},"obj":"Sentence"},{"id":"T142","span":{"begin":5143,"end":5205},"obj":"Sentence"},{"id":"T143","span":{"begin":5206,"end":5225},"obj":"Sentence"},{"id":"T144","span":{"begin":5226,"end":5276},"obj":"Sentence"},{"id":"T145","span":{"begin":5277,"end":5358},"obj":"Sentence"},{"id":"T146","span":{"begin":5359,"end":5369},"obj":"Sentence"},{"id":"T147","span":{"begin":5370,"end":5427},"obj":"Sentence"},{"id":"T148","span":{"begin":5428,"end":5610},"obj":"Sentence"},{"id":"T149","span":{"begin":5611,"end":5681},"obj":"Sentence"},{"id":"T150","span":{"begin":5682,"end":5854},"obj":"Sentence"},{"id":"T151","span":{"begin":5855,"end":6131},"obj":"Sentence"},{"id":"T152","span":{"begin":6133,"end":6137},"obj":"Sentence"},{"id":"T153","span":{"begin":6138,"end":6158},"obj":"Sentence"},{"id":"T154","span":{"begin":6159,"end":6259},"obj":"Sentence"},{"id":"T155","span":{"begin":6260,"end":6342},"obj":"Sentence"},{"id":"T156","span":{"begin":6343,"end":6430},"obj":"Sentence"},{"id":"T157","span":{"begin":6431,"end":6505},"obj":"Sentence"},{"id":"T158","span":{"begin":6506,"end":6540},"obj":"Sentence"},{"id":"T159","span":{"begin":6541,"end":6680},"obj":"Sentence"},{"id":"T160","span":{"begin":6681,"end":6737},"obj":"Sentence"},{"id":"T161","span":{"begin":6739,"end":6743},"obj":"Sentence"},{"id":"T162","span":{"begin":6744,"end":6769},"obj":"Sentence"},{"id":"T163","span":{"begin":6770,"end":6868},"obj":"Sentence"},{"id":"T164","span":{"begin":6869,"end":6971},"obj":"Sentence"},{"id":"T165","span":{"begin":6972,"end":7086},"obj":"Sentence"},{"id":"T166","span":{"begin":7087,"end":7278},"obj":"Sentence"},{"id":"T167","span":{"begin":7279,"end":7404},"obj":"Sentence"},{"id":"T168","span":{"begin":7405,"end":7544},"obj":"Sentence"},{"id":"T169","span":{"begin":7545,"end":7744},"obj":"Sentence"},{"id":"T170","span":{"begin":7746,"end":7750},"obj":"Sentence"},{"id":"T171","span":{"begin":7751,"end":7794},"obj":"Sentence"},{"id":"T172","span":{"begin":7795,"end":7837},"obj":"Sentence"},{"id":"T173","span":{"begin":7838,"end":7966},"obj":"Sentence"},{"id":"T174","span":{"begin":7967,"end":8033},"obj":"Sentence"},{"id":"T175","span":{"begin":8034,"end":8138},"obj":"Sentence"},{"id":"T176","span":{"begin":8139,"end":8305},"obj":"Sentence"},{"id":"T177","span":{"begin":8306,"end":8391},"obj":"Sentence"},{"id":"T178","span":{"begin":8392,"end":8510},"obj":"Sentence"},{"id":"T179","span":{"begin":8511,"end":8551},"obj":"Sentence"},{"id":"T180","span":{"begin":8552,"end":8661},"obj":"Sentence"},{"id":"T181","span":{"begin":8662,"end":8737},"obj":"Sentence"},{"id":"T182","span":{"begin":8738,"end":8851},"obj":"Sentence"}],"namespaces":[{"prefix":"_base","uri":"http://pubannotation.org/ontology/tao.owl#"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
2_test
{"project":"2_test","denotations":[{"id":"32604730-32290293-51943960","span":{"begin":1173,"end":1175},"obj":"32290293"},{"id":"32604730-32288441-51943961","span":{"begin":1464,"end":1466},"obj":"32288441"},{"id":"32604730-24121034-51943962","span":{"begin":4675,"end":4677},"obj":"24121034"},{"id":"32604730-15862300-51943963","span":{"begin":6127,"end":6129},"obj":"15862300"},{"id":"32604730-25726972-51943964","span":{"begin":6676,"end":6678},"obj":"25726972"},{"id":"32604730-19120977-51943965","span":{"begin":7082,"end":7084},"obj":"19120977"},{"id":"32604730-31226023-51943966","span":{"begin":7397,"end":7399},"obj":"31226023"},{"id":"32604730-23698585-51943967","span":{"begin":7400,"end":7402},"obj":"23698585"},{"id":"32604730-25926653-51943968","span":{"begin":8547,"end":8549},"obj":"25926653"},{"id":"32604730-19871229-51943969","span":{"begin":8657,"end":8659},"obj":"19871229"},{"id":"T87050","span":{"begin":1173,"end":1175},"obj":"32290293"},{"id":"T12608","span":{"begin":1464,"end":1466},"obj":"32288441"},{"id":"T51478","span":{"begin":4675,"end":4677},"obj":"24121034"},{"id":"T39967","span":{"begin":6127,"end":6129},"obj":"15862300"},{"id":"T99173","span":{"begin":6676,"end":6678},"obj":"25726972"},{"id":"T66447","span":{"begin":7082,"end":7084},"obj":"19120977"},{"id":"T81165","span":{"begin":7397,"end":7399},"obj":"31226023"},{"id":"T62246","span":{"begin":7400,"end":7402},"obj":"23698585"},{"id":"T97535","span":{"begin":8547,"end":8549},"obj":"25926653"},{"id":"T59612","span":{"begin":8657,"end":8659},"obj":"19871229"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}
LitCovid-PD-HP
{"project":"LitCovid-PD-HP","denotations":[{"id":"T5","span":{"begin":3791,"end":3799},"obj":"Phenotype"},{"id":"T6","span":{"begin":8267,"end":8273},"obj":"Phenotype"}],"attributes":[{"id":"A5","pred":"hp_id","subj":"T5","obj":"http://purl.obolibrary.org/obo/HP_0002861"},{"id":"A6","pred":"hp_id","subj":"T6","obj":"http://purl.obolibrary.org/obo/HP_0001903"}],"text":"3. Structure, Components and Life Cycle of CoVs\nCoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: 6M71).\nGenerally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis.\nThe RNA generates the replicase as two polyproteins, pp1a and pp1ab. The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. Nsp1 to Nsp16 are known to have multiple enzyme regions. For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. Nsp2 recognizes the specific protein called prohibitin. Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. Nsp4 is a TM scaffold protein for double-membrane vesicle structure. Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: 6VWW). Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition.\n\n3.1. Spike (S) Transmembrane Glycoprotein\nIn RNA viruses, the S glycoprotein (PDB: 6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). The S glycoprotein has a distinct spike structure. When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. The CTD of S1 recognizes protein receptors via RBDs.\n\n3.2. Nucleocapsid (N) Protein\nIn RNA viruses, the N protein recognizes the viral RNA genome. The N protein (PDB: 6M3M) binds to the RNA genome via the NTD and CTD. The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). NSP3 (PDB: 6VXS) of CoV blocks the innate immune responses of hosts. After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. Representatively, the N protein is known as the RNA chaperone protein. For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20].\n\n3.3. Envelope (E) Protein\nE protein is the most abundant structural protein needed to assemble virus particles in the cytosol. As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. The E protein bears an ectodomain in the NTD and an endodomain in the CTD. It has also an ion channel domain. E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. Most E proteins assemble and bud in new virus particles.\n\n3.4. Membrane Glycoprotein (M)\nThe M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles.\n\n3.5. Hemagglutinin-Esterase (HE) Dimeric Protein\nHE hemagglutinates and destroys receptors. As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. For example, enveloped RNA viruses evade the hosts via their RDEs. Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. The HE has acetylesterase activity [26]. In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. They caused hemagglutination through clumping by virus-borne hemagglutinin. These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus."}