Id |
Subject |
Object |
Predicate |
Lexical cue |
T87 |
0-2 |
Sentence |
denotes |
3. |
T88 |
3-47 |
Sentence |
denotes |
Structure, Components and Life Cycle of CoVs |
T89 |
48-215 |
Sentence |
denotes |
CoVs are 60–140 nm in size and are enveloped (+) ssRNA viruses, which feature an RNA genome, directly available to function as mRNA and thus result in rapid infection. |
T90 |
216-390 |
Sentence |
denotes |
CoVs exhibit RNA genomes of 28–32 kb, comprised of two large overlapping open reading frames (ORFs), which encode the virus replicase (transcriptase) and structural proteins. |
T91 |
391-466 |
Sentence |
denotes |
The SARS-CoV-2 genome is 29,891 bp in size, which encodes 9860 amino acids. |
T92 |
467-561 |
Sentence |
denotes |
The ssRNA are capped and tailed with a 5′-capping structure and 3′-poly A tail at the termini. |
T93 |
562-727 |
Sentence |
denotes |
The genome is the same sense as virus mRNA indicating that the viral RNA is translated through its own (+) RNA to synthesize RNA dependent RNA polymerase (RdRp; PDB: |
T94 |
728-734 |
Sentence |
denotes |
6M71). |
T95 |
735-855 |
Sentence |
denotes |
Generally, viral families are determined by the genome structure and virion morphologies of an envelope or naked capsid. |
T96 |
856-948 |
Sentence |
denotes |
A virus with a naked capsid has a coat of nucleocapsid protein (N) coating the viral genome. |
T97 |
949-1041 |
Sentence |
denotes |
Viruses with an envelope have lipid envelopes further surrounding the outmost protein layer. |
T98 |
1042-1177 |
Sentence |
denotes |
The 2019-nCoV (SARS-CoV-2) contains a spike (S) glycoprotein, E, dimeric HE enzyme, a membrane matrix glycoprotein (M), N and RNA [16]. |
T99 |
1178-1327 |
Sentence |
denotes |
The structural proteins are the S, N, M and E proteins, while the non-structural proteins are proteases such as Nsp3 and Nsp5 and RdRp such as Nsp12. |
T100 |
1328-1468 |
Sentence |
denotes |
Among the N, M and S glycoproteins, the S glycoprotein is a fusion protein that recognizes the host receptor and enters the host cells [17]. |
T101 |
1469-1630 |
Sentence |
denotes |
The S, M and E proteins anchored into the endoplasmic reticulum (ER) membrane are trafficked to the endoplasmic reticulum–Golgi intermediate compartment (ERGIC). |
T102 |
1631-1716 |
Sentence |
denotes |
The RNA genome linked with nucleoprotein buds into the ERGIC to form virus particles. |
T103 |
1717-1828 |
Sentence |
denotes |
Assembled virions transported to the vesicular surface are released to the extracellular milieu via exocytosis. |
T104 |
1829-1897 |
Sentence |
denotes |
The RNA generates the replicase as two polyproteins, pp1a and pp1ab. |
T105 |
1898-2073 |
Sentence |
denotes |
The replicase-encoded viral proteases generate up to 16 nonstructural proteins (Nsps) in the cytosol to produce replicase enzyme and the replicase–transcriptase complex (RTC). |
T106 |
2074-2181 |
Sentence |
denotes |
These enzymes including RTC synthesize RNAs for replication and transcription to generate viral RNA genome. |
T107 |
2182-2272 |
Sentence |
denotes |
CoV genomes bear two or three protease genes and the coding enzymes cleave the replicases. |
T108 |
2273-2370 |
Sentence |
denotes |
Together with the replicases, nonstructural proteins, termed Nsps, assemble into the RTC complex. |
T109 |
2371-2427 |
Sentence |
denotes |
Nsp1 to Nsp16 are known to have multiple enzyme regions. |
T110 |
2428-2557 |
Sentence |
denotes |
For example, Nsp1 degrades cellular mRNAs and, consequently blocks protein translation in host cells and innate immune responses. |
T111 |
2558-2613 |
Sentence |
denotes |
Nsp2 recognizes the specific protein called prohibitin. |
T112 |
2614-2688 |
Sentence |
denotes |
Nsp3 is a multi-domain transmembrane (TM) protein with diverse activities. |
T113 |
2689-2817 |
Sentence |
denotes |
Ubiquitin-like 1 and acidic domains bind to N protein and ADP-ribose-1′-phosphatase (ADRP) activity induces cytokine expression. |
T114 |
2818-2919 |
Sentence |
denotes |
The papain-like protease (PLpro)(PDB:6WX4)/ deubiquitinase domain cleaves virus-produced polyprotein. |
T115 |
2920-2988 |
Sentence |
denotes |
Nsp4 is a TM scaffold protein for double-membrane vesicle structure. |
T116 |
2989-3106 |
Sentence |
denotes |
Nsp5 has a main protease domain which also cleaves virus-produced polyprotein and Nsp6 acts as a TM scaffold protein. |
T117 |
3107-3249 |
Sentence |
denotes |
The Nsp7 and Nsp8 proteins form the Nsp7-Nsp8 hexadecameric complex, which functions as an RNA polymerase-specific clamp and a primase enzyme. |
T118 |
3250-3351 |
Sentence |
denotes |
Nsp9 is an RNA-binding protein that activates ExoN and 2-O-methyltrnasferase (MTase) enzyme activity. |
T119 |
3352-3415 |
Sentence |
denotes |
Nsp10 binds to Nsp16 and Nsp14 to form a heterodimeric complex. |
T120 |
3416-3487 |
Sentence |
denotes |
Nsp12 is the RdRp and Nsps13 is the RNA helicase and 5′ triphosphatase. |
T121 |
3488-3535 |
Sentence |
denotes |
Nsps14 is a N7 MTase and 3′-5′ exoribonuclease. |
T122 |
3536-3609 |
Sentence |
denotes |
ExoN of Nsap14 acts as an N7 MTase and attaches the 5′ cap to viral RNAs. |
T123 |
3610-3717 |
Sentence |
denotes |
Viral exoribonuclease enzyme proofreads the viral RNA genome, where Nsp15 is a viral endoribonuclease (PDB: |
T124 |
3718-3724 |
Sentence |
denotes |
6VWW). |
T125 |
3725-3849 |
Sentence |
denotes |
Nsp16 has 2-O-MTase enzyme activity, which shields viral RNA from melanoma differentiation associated protein-5 recognition. |
T126 |
3851-3855 |
Sentence |
denotes |
3.1. |
T127 |
3856-3892 |
Sentence |
denotes |
Spike (S) Transmembrane Glycoprotein |
T128 |
3893-3933 |
Sentence |
denotes |
In RNA viruses, the S glycoprotein (PDB: |
T129 |
3934-4083 |
Sentence |
denotes |
6VSB) is the biggest protein, heavily glycosylated and its N-terminal domain (NTD) sequence binds to the host receptor to enter the ER of host cells. |
T130 |
4084-4153 |
Sentence |
denotes |
SARS-CoV-2 S-glycoprotein bears 22 N-glycan sequons in each protomer. |
T131 |
4154-4230 |
Sentence |
denotes |
Therefore, the trimeric S glycoprotein surface is dominated by 66 N-glycans. |
T132 |
4231-4339 |
Sentence |
denotes |
The S glycoprotein mediates direct and indirect interaction of virus with host cells in the infection cycle. |
T133 |
4340-4429 |
Sentence |
denotes |
All CoVs exhibit a surface S glycoprotein, which bears the receptor-binding domain (RBD). |
T134 |
4430-4480 |
Sentence |
denotes |
The S glycoprotein has a distinct spike structure. |
T135 |
4481-4679 |
Sentence |
denotes |
When S glycoprotein binds to its host receptor, a host furin-like protease cleaves the S glycoprotein, which liberates the spike fusion peptides, allowing entry of the virus into the host cell [18]. |
T136 |
4680-4857 |
Sentence |
denotes |
The furin-like protease-generated S1 and S2 exist as a S1/S2 complex, where S1 in a homotrimeric form interacts with the host cell membrane and S2 penetrates the cytosolic area. |
T137 |
4858-4976 |
Sentence |
denotes |
For SARS-CoV and MERS-CoV, the S1 C-terminal domains (CTDs) have a dual role in virus entry via attachment and fusion. |
T138 |
4977-5058 |
Sentence |
denotes |
The S1 NTD binds to carbohydrate receptors because the S1 domains act as the RBD. |
T139 |
5059-5111 |
Sentence |
denotes |
The CTD of S1 recognizes protein receptors via RBDs. |
T140 |
5113-5117 |
Sentence |
denotes |
3.2. |
T141 |
5118-5142 |
Sentence |
denotes |
Nucleocapsid (N) Protein |
T142 |
5143-5205 |
Sentence |
denotes |
In RNA viruses, the N protein recognizes the viral RNA genome. |
T143 |
5206-5225 |
Sentence |
denotes |
The N protein (PDB: |
T144 |
5226-5276 |
Sentence |
denotes |
6M3M) binds to the RNA genome via the NTD and CTD. |
T145 |
5277-5358 |
Sentence |
denotes |
The N protein tethers to the viral RNA and replicase–transcriptase complex (RTC). |
T146 |
5359-5369 |
Sentence |
denotes |
NSP3 (PDB: |
T147 |
5370-5427 |
Sentence |
denotes |
6VXS) of CoV blocks the innate immune responses of hosts. |
T148 |
5428-5610 |
Sentence |
denotes |
After entrance into the host cells, for CoV transcription and particle release, RNA chaperones such as nonspecific nucleic acid binding proteins potentiate ssRNA conformation shifts. |
T149 |
5611-5681 |
Sentence |
denotes |
Representatively, the N protein is known as the RNA chaperone protein. |
T150 |
5682-5854 |
Sentence |
denotes |
For example, glycogen synthase kinase 3 (GSK3) phosphorylates the SARS-CoV N-protein and thus, GSK3 inhibition contributes to reduced replication activity of SARS-CoV [19]. |
T151 |
5855-6131 |
Sentence |
denotes |
In addition, heterogeneous nuclear ribonucleoprotein A1 (hnRNP A1) regulates the preformed mRNA splicing in the nucleus and continuous translation. hnRNPA1 interacts with SARS-CoV N protein to form a replication and transcription complex during ssRNA genome biosynthesis [20]. |
T152 |
6133-6137 |
Sentence |
denotes |
3.3. |
T153 |
6138-6158 |
Sentence |
denotes |
Envelope (E) Protein |
T154 |
6159-6259 |
Sentence |
denotes |
E protein is the most abundant structural protein needed to assemble virus particles in the cytosol. |
T155 |
6260-6342 |
Sentence |
denotes |
As a TM protein, E protein is the smallest structural protein with a MW of 12 kDa. |
T156 |
6343-6430 |
Sentence |
denotes |
The E protein has an NTD in the extracellular region and a CTD in the cytosolic region. |
T157 |
6431-6505 |
Sentence |
denotes |
The E protein bears an ectodomain in the NTD and an endodomain in the CTD. |
T158 |
6506-6540 |
Sentence |
denotes |
It has also an ion channel domain. |
T159 |
6541-6680 |
Sentence |
denotes |
E protein is present in the cytosolic region of infected cells and only a limited amount is incorporated into the envelope of virions [21]. |
T160 |
6681-6737 |
Sentence |
denotes |
Most E proteins assemble and bud in new virus particles. |
T161 |
6739-6743 |
Sentence |
denotes |
3.4. |
T162 |
6744-6769 |
Sentence |
denotes |
Membrane Glycoprotein (M) |
T163 |
6770-6868 |
Sentence |
denotes |
The M protein contains three TM domains and is an abundant structural protein with a MW of 30 kDa. |
T164 |
6869-6971 |
Sentence |
denotes |
It consists of a small glycosylated NTD in the extracellular region and CTD in the cytoplasmic region. |
T165 |
6972-7086 |
Sentence |
denotes |
The M protein forms a scaffold for virus assembly in the cytosol via binding to S glycoprotein and N protein [22]. |
T166 |
7087-7278 |
Sentence |
denotes |
For example, E protein and N protein are co-expressed with M protein to form virus-like particles (VLPs) that are released from the cells, as the M and E protein are involved in CoV assembly. |
T167 |
7279-7404 |
Sentence |
denotes |
Then, CoVs bud into the ERGIC, trafficking by membrane vesicles and transported via the exocytosis-secretory pathway [23,24]. |
T168 |
7405-7544 |
Sentence |
denotes |
The dimeric M protein binds to the nucleocapsid.-M protein binds to S glycoprotein for S glycoprotein retention in the ERGIC/Golgi complex. |
T169 |
7545-7744 |
Sentence |
denotes |
The M-N protein complex keeps the N protein–RNA complex stable, for nucleocapsid and the viral assembly.-M and E proteins constitute the virus envelope for successful release of virus-like particles. |
T170 |
7746-7750 |
Sentence |
denotes |
3.5. |
T171 |
7751-7794 |
Sentence |
denotes |
Hemagglutinin-Esterase (HE) Dimeric Protein |
T172 |
7795-7837 |
Sentence |
denotes |
HE hemagglutinates and destroys receptors. |
T173 |
7838-7966 |
Sentence |
denotes |
As RNA viruses, CoVs bear RDEs, which are used in effective attachment to hosts and also reversely in detachment from the hosts. |
T174 |
7967-8033 |
Sentence |
denotes |
For example, enveloped RNA viruses evade the hosts via their RDEs. |
T175 |
8034-8138 |
Sentence |
denotes |
Currently, RDE-related functional enzymes such as neuraminidase (NA) and SA-O-acetyl-esterase are known. |
T176 |
8139-8305 |
Sentence |
denotes |
SA-O-acetyl-esterase was originally identified in influenza C virus and in nidoviruses (CoV and torovirus) as well as in salmon anemia virus (teleost orthomyxovirus). |
T177 |
8306-8391 |
Sentence |
denotes |
The origin and evolution of CoV SA-O-acetylesterases are correlated to other viruses. |
T178 |
8392-8510 |
Sentence |
denotes |
The fusion event of S glycoprotein and HE is specific for HCoV attachment to SA-associated receptors in the host [25]. |
T179 |
8511-8551 |
Sentence |
denotes |
The HE has acetylesterase activity [26]. |
T180 |
8552-8661 |
Sentence |
denotes |
In early SA-related biology, influenza A/B viruses were found to recognize chicken erythrocytes in 1942 [27]. |
T181 |
8662-8737 |
Sentence |
denotes |
They caused hemagglutination through clumping by virus-borne hemagglutinin. |
T182 |
8738-8851 |
Sentence |
denotes |
These phenomena were widely found in influenza viruses, paramyxoviruses, Newcastle disease (NDV) and mumps virus. |