PMC:7258756 / 6237-13345 JSONTXT

Annnotations TAB JSON ListView MergeView

    LitCovid-PubTator

    {"project":"LitCovid-PubTator","denotations":[{"id":"127","span":{"begin":241,"end":251},"obj":"Species"},{"id":"131","span":{"begin":266,"end":276},"obj":"Species"},{"id":"132","span":{"begin":858,"end":868},"obj":"Species"},{"id":"133","span":{"begin":418,"end":438},"obj":"Disease"},{"id":"137","span":{"begin":1211,"end":1215},"obj":"Gene"},{"id":"138","span":{"begin":1221,"end":1225},"obj":"Gene"},{"id":"139","span":{"begin":1120,"end":1130},"obj":"Species"},{"id":"145","span":{"begin":2425,"end":2428},"obj":"Gene"},{"id":"146","span":{"begin":2388,"end":2391},"obj":"Gene"},{"id":"147","span":{"begin":1514,"end":1524},"obj":"Species"},{"id":"148","span":{"begin":1687,"end":1697},"obj":"Species"},{"id":"149","span":{"begin":2518,"end":2528},"obj":"Species"},{"id":"155","span":{"begin":3397,"end":3401},"obj":"Gene"},{"id":"156","span":{"begin":2562,"end":2572},"obj":"Species"},{"id":"157","span":{"begin":3278,"end":3288},"obj":"Species"},{"id":"158","span":{"begin":3365,"end":3375},"obj":"Species"},{"id":"159","span":{"begin":3045,"end":3047},"obj":"Chemical"},{"id":"164","span":{"begin":3535,"end":3540},"obj":"Gene"},{"id":"165","span":{"begin":3440,"end":3445},"obj":"Gene"},{"id":"166","span":{"begin":3524,"end":3534},"obj":"Species"},{"id":"167","span":{"begin":3985,"end":3995},"obj":"Species"},{"id":"174","span":{"begin":6566,"end":6569},"obj":"Gene"},{"id":"175","span":{"begin":5993,"end":5998},"obj":"Gene"},{"id":"176","span":{"begin":4504,"end":4509},"obj":"Gene"},{"id":"177","span":{"begin":4493,"end":4503},"obj":"Species"},{"id":"178","span":{"begin":6486,"end":6491},"obj":"Species"},{"id":"179","span":{"begin":6427,"end":6432},"obj":"Species"},{"id":"182","span":{"begin":6988,"end":7004},"obj":"Gene"},{"id":"183","span":{"begin":7006,"end":7011},"obj":"Gene"}],"attributes":[{"id":"A127","pred":"tao:has_database_id","subj":"127","obj":"Tax:2697049"},{"id":"A131","pred":"tao:has_database_id","subj":"131","obj":"Tax:2697049"},{"id":"A132","pred":"tao:has_database_id","subj":"132","obj":"Tax:2697049"},{"id":"A133","pred":"tao:has_database_id","subj":"133","obj":"MESH:C000657245"},{"id":"A137","pred":"tao:has_database_id","subj":"137","obj":"Gene:43740578"},{"id":"A138","pred":"tao:has_database_id","subj":"138","obj":"Gene:43740578"},{"id":"A139","pred":"tao:has_database_id","subj":"139","obj":"Tax:2697049"},{"id":"A145","pred":"tao:has_database_id","subj":"145","obj":"Gene:1178"},{"id":"A146","pred":"tao:has_database_id","subj":"146","obj":"Gene:1178"},{"id":"A147","pred":"tao:has_database_id","subj":"147","obj":"Tax:2697049"},{"id":"A148","pred":"tao:has_database_id","subj":"148","obj":"Tax:2697049"},{"id":"A149","pred":"tao:has_database_id","subj":"149","obj":"Tax:2697049"},{"id":"A155","pred":"tao:has_database_id","subj":"155","obj":"Gene:6195"},{"id":"A156","pred":"tao:has_database_id","subj":"156","obj":"Tax:2697049"},{"id":"A157","pred":"tao:has_database_id","subj":"157","obj":"Tax:2697049"},{"id":"A158","pred":"tao:has_database_id","subj":"158","obj":"Tax:2697049"},{"id":"A159","pred":"tao:has_database_id","subj":"159","obj":"MESH:D009711"},{"id":"A164","pred":"tao:has_database_id","subj":"164","obj":"Gene:43740568"},{"id":"A165","pred":"tao:has_database_id","subj":"165","obj":"Gene:43740568"},{"id":"A166","pred":"tao:has_database_id","subj":"166","obj":"Tax:2697049"},{"id":"A167","pred":"tao:has_database_id","subj":"167","obj":"Tax:2697049"},{"id":"A174","pred":"tao:has_database_id","subj":"174","obj":"Gene:8615"},{"id":"A175","pred":"tao:has_database_id","subj":"175","obj":"Gene:43740568"},{"id":"A176","pred":"tao:has_database_id","subj":"176","obj":"Gene:43740568"},{"id":"A177","pred":"tao:has_database_id","subj":"177","obj":"Tax:2697049"},{"id":"A178","pred":"tao:has_database_id","subj":"178","obj":"Tax:9606"},{"id":"A179","pred":"tao:has_database_id","subj":"179","obj":"Tax:9606"},{"id":"A182","pred":"tao:has_database_id","subj":"182","obj":"Gene:3458"},{"id":"A183","pred":"tao:has_database_id","subj":"183","obj":"Gene:3458"}],"namespaces":[{"prefix":"Tax","uri":"https://www.ncbi.nlm.nih.gov/taxonomy/"},{"prefix":"MESH","uri":"https://id.nlm.nih.gov/mesh/"},{"prefix":"Gene","uri":"https://www.ncbi.nlm.nih.gov/gene/"},{"prefix":"CVCL","uri":"https://web.expasy.org/cellosaurus/CVCL_"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}

    LitCovid-PD-FMA-UBERON

    {"project":"LitCovid-PD-FMA-UBERON","denotations":[{"id":"T24","span":{"begin":299,"end":304},"obj":"Body_part"},{"id":"T25","span":{"begin":309,"end":315},"obj":"Body_part"},{"id":"T26","span":{"begin":914,"end":917},"obj":"Body_part"},{"id":"T27","span":{"begin":968,"end":971},"obj":"Body_part"},{"id":"T28","span":{"begin":1067,"end":1070},"obj":"Body_part"},{"id":"T29","span":{"begin":1233,"end":1237},"obj":"Body_part"},{"id":"T30","span":{"begin":1244,"end":1248},"obj":"Body_part"},{"id":"T31","span":{"begin":1258,"end":1262},"obj":"Body_part"},{"id":"T32","span":{"begin":1486,"end":1492},"obj":"Body_part"},{"id":"T33","span":{"begin":1602,"end":1605},"obj":"Body_part"},{"id":"T34","span":{"begin":1746,"end":1749},"obj":"Body_part"},{"id":"T35","span":{"begin":1860,"end":1863},"obj":"Body_part"},{"id":"T36","span":{"begin":1901,"end":1904},"obj":"Body_part"},{"id":"T37","span":{"begin":1945,"end":1948},"obj":"Body_part"},{"id":"T38","span":{"begin":2542,"end":2548},"obj":"Body_part"},{"id":"T39","span":{"begin":3033,"end":3043},"obj":"Body_part"},{"id":"T40","span":{"begin":3045,"end":3047},"obj":"Body_part"},{"id":"T41","span":{"begin":3145,"end":3155},"obj":"Body_part"},{"id":"T42","span":{"begin":3171,"end":3181},"obj":"Body_part"},{"id":"T43","span":{"begin":3183,"end":3185},"obj":"Body_part"},{"id":"T44","span":{"begin":3446,"end":3453},"obj":"Body_part"},{"id":"T45","span":{"begin":3545,"end":3557},"obj":"Body_part"},{"id":"T46","span":{"begin":3668,"end":3675},"obj":"Body_part"},{"id":"T47","span":{"begin":3758,"end":3762},"obj":"Body_part"},{"id":"T48","span":{"begin":3957,"end":3961},"obj":"Body_part"},{"id":"T49","span":{"begin":4510,"end":4517},"obj":"Body_part"},{"id":"T50","span":{"begin":4770,"end":4777},"obj":"Body_part"},{"id":"T51","span":{"begin":5418,"end":5429},"obj":"Body_part"},{"id":"T52","span":{"begin":5911,"end":5918},"obj":"Body_part"},{"id":"T53","span":{"begin":5999,"end":6007},"obj":"Body_part"},{"id":"T54","span":{"begin":6079,"end":6111},"obj":"Body_part"},{"id":"T55","span":{"begin":6113,"end":6116},"obj":"Body_part"},{"id":"T56","span":{"begin":6139,"end":6162},"obj":"Body_part"},{"id":"T57","span":{"begin":6296,"end":6310},"obj":"Body_part"},{"id":"T58","span":{"begin":6376,"end":6380},"obj":"Body_part"},{"id":"T59","span":{"begin":6433,"end":6442},"obj":"Body_part"},{"id":"T60","span":{"begin":6452,"end":6455},"obj":"Body_part"},{"id":"T61","span":{"begin":6557,"end":6564},"obj":"Body_part"},{"id":"T62","span":{"begin":6950,"end":6954},"obj":"Body_part"},{"id":"T63","span":{"begin":6965,"end":6969},"obj":"Body_part"}],"attributes":[{"id":"A24","pred":"fma_id","subj":"T24","obj":"http://purl.org/sig/ont/fma/fma9670"},{"id":"A25","pred":"fma_id","subj":"T25","obj":"http://purl.org/sig/ont/fma/fma228738"},{"id":"A26","pred":"fma_id","subj":"T26","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A27","pred":"fma_id","subj":"T27","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A28","pred":"fma_id","subj":"T28","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A29","pred":"fma_id","subj":"T29","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A30","pred":"fma_id","subj":"T30","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A31","pred":"fma_id","subj":"T31","obj":"http://purl.org/sig/ont/fma/fma74402"},{"id":"A32","pred":"fma_id","subj":"T32","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A33","pred":"fma_id","subj":"T33","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A34","pred":"fma_id","subj":"T34","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A35","pred":"fma_id","subj":"T35","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A36","pred":"fma_id","subj":"T36","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A37","pred":"fma_id","subj":"T37","obj":"http://purl.org/sig/ont/fma/fma67095"},{"id":"A38","pred":"fma_id","subj":"T38","obj":"http://purl.org/sig/ont/fma/fma84116"},{"id":"A39","pred":"fma_id","subj":"T39","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A40","pred":"fma_id","subj":"T40","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A41","pred":"fma_id","subj":"T41","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A42","pred":"fma_id","subj":"T42","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A43","pred":"fma_id","subj":"T43","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A44","pred":"fma_id","subj":"T44","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A45","pred":"fma_id","subj":"T45","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A46","pred":"fma_id","subj":"T46","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A47","pred":"fma_id","subj":"T47","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A48","pred":"fma_id","subj":"T48","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A49","pred":"fma_id","subj":"T49","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A50","pred":"fma_id","subj":"T50","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A51","pred":"fma_id","subj":"T51","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A52","pred":"fma_id","subj":"T52","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A53","pred":"fma_id","subj":"T53","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A54","pred":"fma_id","subj":"T54","obj":"http://purl.org/sig/ont/fma/fma84079"},{"id":"A55","pred":"fma_id","subj":"T55","obj":"http://purl.org/sig/ont/fma/fma84079"},{"id":"A56","pred":"fma_id","subj":"T56","obj":"http://purl.org/sig/ont/fma/fma70573"},{"id":"A57","pred":"fma_id","subj":"T57","obj":"http://purl.org/sig/ont/fma/fma74616"},{"id":"A58","pred":"fma_id","subj":"T58","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A59","pred":"fma_id","subj":"T59","obj":"http://purl.org/sig/ont/fma/fma62852"},{"id":"A60","pred":"fma_id","subj":"T60","obj":"http://purl.org/sig/ont/fma/fma84795"},{"id":"A61","pred":"fma_id","subj":"T61","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A62","pred":"fma_id","subj":"T62","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A63","pred":"fma_id","subj":"T63","obj":"http://purl.org/sig/ont/fma/fma68646"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}

    LitCovid-PD-UBERON

    {"project":"LitCovid-PD-UBERON","denotations":[{"id":"T2","span":{"begin":299,"end":304},"obj":"Body_part"},{"id":"T3","span":{"begin":309,"end":315},"obj":"Body_part"}],"attributes":[{"id":"A2","pred":"uberon_id","subj":"T2","obj":"http://purl.obolibrary.org/obo/UBERON_0000178"},{"id":"A3","pred":"uberon_id","subj":"T3","obj":"http://purl.obolibrary.org/obo/UBERON_0000341"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}

    LitCovid-PD-MONDO

    {"project":"LitCovid-PD-MONDO","denotations":[{"id":"T56","span":{"begin":241,"end":249},"obj":"Disease"},{"id":"T57","span":{"begin":241,"end":245},"obj":"Disease"},{"id":"T58","span":{"begin":266,"end":274},"obj":"Disease"},{"id":"T59","span":{"begin":266,"end":270},"obj":"Disease"},{"id":"T60","span":{"begin":322,"end":324},"obj":"Disease"},{"id":"T62","span":{"begin":418,"end":426},"obj":"Disease"},{"id":"T63","span":{"begin":418,"end":422},"obj":"Disease"},{"id":"T64","span":{"begin":429,"end":438},"obj":"Disease"},{"id":"T65","span":{"begin":513,"end":515},"obj":"Disease"},{"id":"T67","span":{"begin":858,"end":866},"obj":"Disease"},{"id":"T68","span":{"begin":858,"end":862},"obj":"Disease"},{"id":"T69","span":{"begin":941,"end":943},"obj":"Disease"},{"id":"T71","span":{"begin":1120,"end":1128},"obj":"Disease"},{"id":"T72","span":{"begin":1120,"end":1124},"obj":"Disease"},{"id":"T73","span":{"begin":1514,"end":1522},"obj":"Disease"},{"id":"T74","span":{"begin":1514,"end":1518},"obj":"Disease"},{"id":"T75","span":{"begin":1624,"end":1626},"obj":"Disease"},{"id":"T77","span":{"begin":1687,"end":1695},"obj":"Disease"},{"id":"T78","span":{"begin":1687,"end":1691},"obj":"Disease"},{"id":"T79","span":{"begin":2388,"end":2391},"obj":"Disease"},{"id":"T80","span":{"begin":2425,"end":2428},"obj":"Disease"},{"id":"T81","span":{"begin":2518,"end":2526},"obj":"Disease"},{"id":"T82","span":{"begin":2518,"end":2522},"obj":"Disease"},{"id":"T83","span":{"begin":2562,"end":2570},"obj":"Disease"},{"id":"T84","span":{"begin":2562,"end":2566},"obj":"Disease"},{"id":"T85","span":{"begin":3278,"end":3286},"obj":"Disease"},{"id":"T86","span":{"begin":3278,"end":3282},"obj":"Disease"},{"id":"T87","span":{"begin":3365,"end":3373},"obj":"Disease"},{"id":"T88","span":{"begin":3365,"end":3369},"obj":"Disease"},{"id":"T89","span":{"begin":3524,"end":3532},"obj":"Disease"},{"id":"T90","span":{"begin":3524,"end":3528},"obj":"Disease"},{"id":"T91","span":{"begin":3985,"end":3993},"obj":"Disease"},{"id":"T92","span":{"begin":3985,"end":3989},"obj":"Disease"},{"id":"T93","span":{"begin":4493,"end":4501},"obj":"Disease"},{"id":"T94","span":{"begin":4493,"end":4497},"obj":"Disease"}],"attributes":[{"id":"A56","pred":"mondo_id","subj":"T56","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A57","pred":"mondo_id","subj":"T57","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A58","pred":"mondo_id","subj":"T58","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A59","pred":"mondo_id","subj":"T59","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A60","pred":"mondo_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/MONDO_0010979"},{"id":"A61","pred":"mondo_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/MONDO_0016455"},{"id":"A62","pred":"mondo_id","subj":"T62","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A63","pred":"mondo_id","subj":"T63","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A64","pred":"mondo_id","subj":"T64","obj":"http://purl.obolibrary.org/obo/MONDO_0005550"},{"id":"A65","pred":"mondo_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/MONDO_0010979"},{"id":"A66","pred":"mondo_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/MONDO_0016455"},{"id":"A67","pred":"mondo_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A68","pred":"mondo_id","subj":"T68","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A69","pred":"mondo_id","subj":"T69","obj":"http://purl.obolibrary.org/obo/MONDO_0010979"},{"id":"A70","pred":"mondo_id","subj":"T69","obj":"http://purl.obolibrary.org/obo/MONDO_0016455"},{"id":"A71","pred":"mondo_id","subj":"T71","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A72","pred":"mondo_id","subj":"T72","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A73","pred":"mondo_id","subj":"T73","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A74","pred":"mondo_id","subj":"T74","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A75","pred":"mondo_id","subj":"T75","obj":"http://purl.obolibrary.org/obo/MONDO_0010979"},{"id":"A76","pred":"mondo_id","subj":"T75","obj":"http://purl.obolibrary.org/obo/MONDO_0016455"},{"id":"A77","pred":"mondo_id","subj":"T77","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A78","pred":"mondo_id","subj":"T78","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A79","pred":"mondo_id","subj":"T79","obj":"http://purl.obolibrary.org/obo/MONDO_0004315"},{"id":"A80","pred":"mondo_id","subj":"T80","obj":"http://purl.obolibrary.org/obo/MONDO_0004315"},{"id":"A81","pred":"mondo_id","subj":"T81","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A82","pred":"mondo_id","subj":"T82","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A83","pred":"mondo_id","subj":"T83","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A84","pred":"mondo_id","subj":"T84","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A85","pred":"mondo_id","subj":"T85","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A86","pred":"mondo_id","subj":"T86","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A87","pred":"mondo_id","subj":"T87","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A88","pred":"mondo_id","subj":"T88","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A89","pred":"mondo_id","subj":"T89","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A90","pred":"mondo_id","subj":"T90","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A91","pred":"mondo_id","subj":"T91","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A92","pred":"mondo_id","subj":"T92","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A93","pred":"mondo_id","subj":"T93","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A94","pred":"mondo_id","subj":"T94","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}

    LitCovid-PD-CLO

    {"project":"LitCovid-PD-CLO","denotations":[{"id":"T58","span":{"begin":299,"end":304},"obj":"http://purl.obolibrary.org/obo/UBERON_0000178"},{"id":"T59","span":{"begin":299,"end":304},"obj":"http://www.ebi.ac.uk/efo/EFO_0000296"},{"id":"T60","span":{"begin":814,"end":815},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T61","span":{"begin":1096,"end":1103},"obj":"http://purl.obolibrary.org/obo/UBERON_0000473"},{"id":"T62","span":{"begin":1233,"end":1237},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T63","span":{"begin":1244,"end":1248},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T64","span":{"begin":1258,"end":1262},"obj":"http://purl.obolibrary.org/obo/OGG_0000000002"},{"id":"T65","span":{"begin":1338,"end":1343},"obj":"http://purl.obolibrary.org/obo/UBERON_0000473"},{"id":"T66","span":{"begin":1386,"end":1390},"obj":"http://purl.obolibrary.org/obo/UBERON_0000473"},{"id":"T67","span":{"begin":1500,"end":1505},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T68","span":{"begin":1893,"end":1894},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T69","span":{"begin":2388,"end":2391},"obj":"http://purl.obolibrary.org/obo/CLO_0002494"},{"id":"T70","span":{"begin":2425,"end":2428},"obj":"http://purl.obolibrary.org/obo/CLO_0002494"},{"id":"T71","span":{"begin":2930,"end":2931},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T72","span":{"begin":3136,"end":3139},"obj":"http://purl.obolibrary.org/obo/CLO_0008190"},{"id":"T73","span":{"begin":3183,"end":3185},"obj":"http://purl.obolibrary.org/obo/CLO_0001627"},{"id":"T74","span":{"begin":3289,"end":3296},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_10239"},{"id":"T75","span":{"begin":3756,"end":3762},"obj":"http://purl.obolibrary.org/obo/CL_0000236"},{"id":"T76","span":{"begin":3955,"end":3961},"obj":"http://purl.obolibrary.org/obo/CL_0000236"},{"id":"T77","span":{"begin":4092,"end":4099},"obj":"http://purl.obolibrary.org/obo/PR_000018263"},{"id":"T78","span":{"begin":4259,"end":4266},"obj":"http://purl.obolibrary.org/obo/PR_000018263"},{"id":"T79","span":{"begin":4395,"end":4396},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T80","span":{"begin":5050,"end":5051},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T81","span":{"begin":5169,"end":5170},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T82","span":{"begin":5260,"end":5261},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T83","span":{"begin":5281,"end":5282},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T84","span":{"begin":5462,"end":5463},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T85","span":{"begin":5576,"end":5577},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T86","span":{"begin":6374,"end":6380},"obj":"http://purl.obolibrary.org/obo/CL_0000084"},{"id":"T87","span":{"begin":6427,"end":6432},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T88","span":{"begin":6433,"end":6442},"obj":"http://purl.obolibrary.org/obo/CL_0000738"},{"id":"T89","span":{"begin":6486,"end":6491},"obj":"http://purl.obolibrary.org/obo/NCBITaxon_9606"},{"id":"T90","span":{"begin":6566,"end":6569},"obj":"http://purl.obolibrary.org/obo/PR_000002979"},{"id":"T91","span":{"begin":6948,"end":6954},"obj":"http://purl.obolibrary.org/obo/CL_0000236"},{"id":"T92","span":{"begin":6963,"end":6969},"obj":"http://purl.obolibrary.org/obo/CL_0000084"},{"id":"T93","span":{"begin":6988,"end":7004},"obj":"http://purl.obolibrary.org/obo/PR_000000017"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}

    LitCovid-PD-CHEBI

    {"project":"LitCovid-PD-CHEBI","denotations":[{"id":"T14","span":{"begin":322,"end":324},"obj":"Chemical"},{"id":"T15","span":{"begin":513,"end":515},"obj":"Chemical"},{"id":"T16","span":{"begin":941,"end":943},"obj":"Chemical"},{"id":"T17","span":{"begin":1624,"end":1626},"obj":"Chemical"},{"id":"T18","span":{"begin":2804,"end":2808},"obj":"Chemical"},{"id":"T19","span":{"begin":3033,"end":3043},"obj":"Chemical"},{"id":"T20","span":{"begin":3045,"end":3047},"obj":"Chemical"},{"id":"T21","span":{"begin":3119,"end":3123},"obj":"Chemical"},{"id":"T22","span":{"begin":3145,"end":3155},"obj":"Chemical"},{"id":"T23","span":{"begin":3171,"end":3181},"obj":"Chemical"},{"id":"T24","span":{"begin":3171,"end":3176},"obj":"Chemical"},{"id":"T25","span":{"begin":3177,"end":3181},"obj":"Chemical"},{"id":"T26","span":{"begin":3183,"end":3185},"obj":"Chemical"},{"id":"T27","span":{"begin":3446,"end":3453},"obj":"Chemical"},{"id":"T28","span":{"begin":3458,"end":3465},"obj":"Chemical"},{"id":"T29","span":{"begin":3545,"end":3557},"obj":"Chemical"},{"id":"T30","span":{"begin":3668,"end":3675},"obj":"Chemical"},{"id":"T31","span":{"begin":4010,"end":4017},"obj":"Chemical"},{"id":"T32","span":{"begin":4092,"end":4099},"obj":"Chemical"},{"id":"T33","span":{"begin":4259,"end":4266},"obj":"Chemical"},{"id":"T34","span":{"begin":4510,"end":4517},"obj":"Chemical"},{"id":"T35","span":{"begin":4753,"end":4755},"obj":"Chemical"},{"id":"T39","span":{"begin":4770,"end":4777},"obj":"Chemical"},{"id":"T40","span":{"begin":4871,"end":4873},"obj":"Chemical"},{"id":"T44","span":{"begin":5122,"end":5129},"obj":"Chemical"},{"id":"T45","span":{"begin":5149,"end":5151},"obj":"Chemical"},{"id":"T49","span":{"begin":5418,"end":5429},"obj":"Chemical"},{"id":"T50","span":{"begin":5418,"end":5423},"obj":"Chemical"},{"id":"T51","span":{"begin":5424,"end":5429},"obj":"Chemical"},{"id":"T52","span":{"begin":5514,"end":5521},"obj":"Chemical"},{"id":"T53","span":{"begin":5632,"end":5639},"obj":"Chemical"},{"id":"T54","span":{"begin":5911,"end":5918},"obj":"Chemical"},{"id":"T55","span":{"begin":5999,"end":6007},"obj":"Chemical"},{"id":"T56","span":{"begin":6126,"end":6135},"obj":"Chemical"},{"id":"T57","span":{"begin":6443,"end":6450},"obj":"Chemical"},{"id":"T58","span":{"begin":6557,"end":6564},"obj":"Chemical"},{"id":"T59","span":{"begin":6988,"end":6998},"obj":"Chemical"},{"id":"T60","span":{"begin":6999,"end":7004},"obj":"Chemical"}],"attributes":[{"id":"A14","pred":"chebi_id","subj":"T14","obj":"http://purl.obolibrary.org/obo/CHEBI_73664"},{"id":"A15","pred":"chebi_id","subj":"T15","obj":"http://purl.obolibrary.org/obo/CHEBI_73664"},{"id":"A16","pred":"chebi_id","subj":"T16","obj":"http://purl.obolibrary.org/obo/CHEBI_73664"},{"id":"A17","pred":"chebi_id","subj":"T17","obj":"http://purl.obolibrary.org/obo/CHEBI_73664"},{"id":"A18","pred":"chebi_id","subj":"T18","obj":"http://purl.obolibrary.org/obo/CHEBI_6617"},{"id":"A19","pred":"chebi_id","subj":"T19","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A20","pred":"chebi_id","subj":"T20","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A21","pred":"chebi_id","subj":"T21","obj":"http://purl.obolibrary.org/obo/CHEBI_6617"},{"id":"A22","pred":"chebi_id","subj":"T22","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A23","pred":"chebi_id","subj":"T23","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A24","pred":"chebi_id","subj":"T24","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A25","pred":"chebi_id","subj":"T25","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A26","pred":"chebi_id","subj":"T26","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A27","pred":"chebi_id","subj":"T27","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A28","pred":"chebi_id","subj":"T28","obj":"http://purl.obolibrary.org/obo/CHEBI_53000"},{"id":"A29","pred":"chebi_id","subj":"T29","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A30","pred":"chebi_id","subj":"T30","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A31","pred":"chebi_id","subj":"T31","obj":"http://purl.obolibrary.org/obo/CHEBI_53000"},{"id":"A32","pred":"chebi_id","subj":"T32","obj":"http://purl.obolibrary.org/obo/CHEBI_16670"},{"id":"A33","pred":"chebi_id","subj":"T33","obj":"http://purl.obolibrary.org/obo/CHEBI_16670"},{"id":"A34","pred":"chebi_id","subj":"T34","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A35","pred":"chebi_id","subj":"T35","obj":"http://purl.obolibrary.org/obo/CHEBI_28874"},{"id":"A36","pred":"chebi_id","subj":"T35","obj":"http://purl.obolibrary.org/obo/CHEBI_53806"},{"id":"A37","pred":"chebi_id","subj":"T35","obj":"http://purl.obolibrary.org/obo/CHEBI_61484"},{"id":"A38","pred":"chebi_id","subj":"T35","obj":"http://purl.obolibrary.org/obo/CHEBI_74790"},{"id":"A39","pred":"chebi_id","subj":"T39","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A40","pred":"chebi_id","subj":"T40","obj":"http://purl.obolibrary.org/obo/CHEBI_28874"},{"id":"A41","pred":"chebi_id","subj":"T40","obj":"http://purl.obolibrary.org/obo/CHEBI_53806"},{"id":"A42","pred":"chebi_id","subj":"T40","obj":"http://purl.obolibrary.org/obo/CHEBI_61484"},{"id":"A43","pred":"chebi_id","subj":"T40","obj":"http://purl.obolibrary.org/obo/CHEBI_74790"},{"id":"A44","pred":"chebi_id","subj":"T44","obj":"http://purl.obolibrary.org/obo/CHEBI_46787"},{"id":"A45","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_28874"},{"id":"A46","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_53806"},{"id":"A47","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_61484"},{"id":"A48","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_74790"},{"id":"A49","pred":"chebi_id","subj":"T49","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A50","pred":"chebi_id","subj":"T50","obj":"http://purl.obolibrary.org/obo/CHEBI_46882"},{"id":"A51","pred":"chebi_id","subj":"T51","obj":"http://purl.obolibrary.org/obo/CHEBI_37527"},{"id":"A52","pred":"chebi_id","subj":"T52","obj":"http://purl.obolibrary.org/obo/CHEBI_53000"},{"id":"A53","pred":"chebi_id","subj":"T53","obj":"http://purl.obolibrary.org/obo/CHEBI_53000"},{"id":"A54","pred":"chebi_id","subj":"T54","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A55","pred":"chebi_id","subj":"T55","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A56","pred":"chebi_id","subj":"T56","obj":"http://purl.obolibrary.org/obo/CHEBI_25367"},{"id":"A57","pred":"chebi_id","subj":"T57","obj":"http://purl.obolibrary.org/obo/CHEBI_59132"},{"id":"A58","pred":"chebi_id","subj":"T58","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A59","pred":"chebi_id","subj":"T59","obj":"http://purl.obolibrary.org/obo/CHEBI_52999"},{"id":"A60","pred":"chebi_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/CHEBI_30212"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}

    LitCovid-PD-GO-BP

    {"project":"LitCovid-PD-GO-BP","denotations":[{"id":"T7","span":{"begin":533,"end":548},"obj":"http://purl.obolibrary.org/obo/GO_0046794"},{"id":"T8","span":{"begin":539,"end":548},"obj":"http://purl.obolibrary.org/obo/GO_0006810"},{"id":"T9","span":{"begin":1250,"end":1257},"obj":"http://purl.obolibrary.org/obo/GO_0004526"},{"id":"T10","span":{"begin":5873,"end":5895},"obj":"http://purl.obolibrary.org/obo/GO_0006487"},{"id":"T11","span":{"begin":5882,"end":5895},"obj":"http://purl.obolibrary.org/obo/GO_0070085"},{"id":"T12","span":{"begin":6079,"end":6111},"obj":"http://purl.obolibrary.org/obo/GO_0046776"},{"id":"T13","span":{"begin":6113,"end":6116},"obj":"http://purl.obolibrary.org/obo/GO_0046776"},{"id":"T14","span":{"begin":6257,"end":6265},"obj":"http://purl.obolibrary.org/obo/GO_0007612"},{"id":"T15","span":{"begin":6536,"end":6545},"obj":"http://purl.obolibrary.org/obo/GO_0006810"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}

    LitCovid-sentences

    {"project":"LitCovid-sentences","denotations":[{"id":"T48","span":{"begin":0,"end":18},"obj":"Sentence"},{"id":"T49","span":{"begin":19,"end":147},"obj":"Sentence"},{"id":"T50","span":{"begin":148,"end":252},"obj":"Sentence"},{"id":"T51","span":{"begin":253,"end":298},"obj":"Sentence"},{"id":"T52","span":{"begin":299,"end":508},"obj":"Sentence"},{"id":"T53","span":{"begin":509,"end":556},"obj":"Sentence"},{"id":"T54","span":{"begin":557,"end":720},"obj":"Sentence"},{"id":"T55","span":{"begin":721,"end":903},"obj":"Sentence"},{"id":"T56","span":{"begin":904,"end":1052},"obj":"Sentence"},{"id":"T57","span":{"begin":1053,"end":1249},"obj":"Sentence"},{"id":"T58","span":{"begin":1250,"end":1313},"obj":"Sentence"},{"id":"T59","span":{"begin":1314,"end":1403},"obj":"Sentence"},{"id":"T60","span":{"begin":1404,"end":1506},"obj":"Sentence"},{"id":"T61","span":{"begin":1507,"end":1591},"obj":"Sentence"},{"id":"T62","span":{"begin":1592,"end":1732},"obj":"Sentence"},{"id":"T63","span":{"begin":1733,"end":1845},"obj":"Sentence"},{"id":"T64","span":{"begin":1846,"end":1944},"obj":"Sentence"},{"id":"T65","span":{"begin":1945,"end":2151},"obj":"Sentence"},{"id":"T66","span":{"begin":2152,"end":2250},"obj":"Sentence"},{"id":"T67","span":{"begin":2251,"end":2331},"obj":"Sentence"},{"id":"T68","span":{"begin":2332,"end":2447},"obj":"Sentence"},{"id":"T69","span":{"begin":2448,"end":2529},"obj":"Sentence"},{"id":"T70","span":{"begin":2530,"end":2751},"obj":"Sentence"},{"id":"T71","span":{"begin":2752,"end":2929},"obj":"Sentence"},{"id":"T72","span":{"begin":2930,"end":3135},"obj":"Sentence"},{"id":"T73","span":{"begin":3136,"end":3244},"obj":"Sentence"},{"id":"T74","span":{"begin":3245,"end":3403},"obj":"Sentence"},{"id":"T75","span":{"begin":3404,"end":3477},"obj":"Sentence"},{"id":"T76","span":{"begin":3478,"end":3735},"obj":"Sentence"},{"id":"T77","span":{"begin":3736,"end":3856},"obj":"Sentence"},{"id":"T78","span":{"begin":3857,"end":4005},"obj":"Sentence"},{"id":"T79","span":{"begin":4006,"end":4108},"obj":"Sentence"},{"id":"T80","span":{"begin":4109,"end":4222},"obj":"Sentence"},{"id":"T81","span":{"begin":4223,"end":4422},"obj":"Sentence"},{"id":"T82","span":{"begin":4423,"end":4696},"obj":"Sentence"},{"id":"T83","span":{"begin":4697,"end":4848},"obj":"Sentence"},{"id":"T84","span":{"begin":4849,"end":4983},"obj":"Sentence"},{"id":"T85","span":{"begin":4984,"end":5065},"obj":"Sentence"},{"id":"T86","span":{"begin":5066,"end":5144},"obj":"Sentence"},{"id":"T87","span":{"begin":5145,"end":5184},"obj":"Sentence"},{"id":"T88","span":{"begin":5185,"end":5329},"obj":"Sentence"},{"id":"T89","span":{"begin":5330,"end":5461},"obj":"Sentence"},{"id":"T90","span":{"begin":5462,"end":5687},"obj":"Sentence"},{"id":"T91","span":{"begin":5688,"end":5841},"obj":"Sentence"},{"id":"T92","span":{"begin":5842,"end":5988},"obj":"Sentence"},{"id":"T93","span":{"begin":5989,"end":6170},"obj":"Sentence"},{"id":"T94","span":{"begin":6171,"end":6390},"obj":"Sentence"},{"id":"T95","span":{"begin":6391,"end":6500},"obj":"Sentence"},{"id":"T96","span":{"begin":6501,"end":6636},"obj":"Sentence"},{"id":"T97","span":{"begin":6637,"end":6777},"obj":"Sentence"},{"id":"T98","span":{"begin":6778,"end":6911},"obj":"Sentence"},{"id":"T99","span":{"begin":6912,"end":7108},"obj":"Sentence"}],"namespaces":[{"prefix":"_base","uri":"http://pubannotation.org/ontology/tao.owl#"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}

    2_test

    {"project":"2_test","denotations":[{"id":"32242873-29664366-47036564","span":{"begin":2326,"end":2328},"obj":"29664366"},{"id":"32242873-31565258-47036565","span":{"begin":2613,"end":2615},"obj":"31565258"},{"id":"32242873-27004904-47036566","span":{"begin":2828,"end":2831},"obj":"27004904"},{"id":"32242873-27004904-47036567","span":{"begin":3132,"end":3134},"obj":"27004904"},{"id":"32242873-27004904-47036568","span":{"begin":3241,"end":3243},"obj":"27004904"},{"id":"T53539","span":{"begin":2326,"end":2328},"obj":"29664366"},{"id":"T62421","span":{"begin":2613,"end":2615},"obj":"31565258"},{"id":"T79581","span":{"begin":2828,"end":2831},"obj":"27004904"},{"id":"T13164","span":{"begin":3132,"end":3134},"obj":"27004904"},{"id":"T93390","span":{"begin":3241,"end":3243},"obj":"27004904"}],"text":"Material \u0026 Methods\nThe clinical samples were referred by the hospital authorities through the Kerala State Health Services for diagnostic purposes. Further samples were received from different parts of India for establishing the presence of SARS-CoV-2.\nDetection of SARS-CoV-2 in suspected samples: Blood and throat swab (TS) specimens were collected from the suspected cases that complied with the case definition of SARS-CoV-2 infection as per the guidelines of the Ministry of Health and Family Welfare14. The TS was collected in viral transport medium. These samples were referred to the ICMR-NIV, Pune, India (which is the national reference laboratory for India, also referred as the government's apex laboratory). As of February 29, 2020, 881 samples of suspected cases referred from different States, with a travel history to Wuhan, China, and other SARS-CoV-2-affected countries, were screened.\nThe viral RNA was extracted from the TS sample using the Magmax RNA extraction kit (Applied Biosystems, USA) as per the manufacturer's instructions. The extracted RNA was immediately used for testing the presence of SARS-CoV-2 using the real-time RT-PCR protocol published by the WHO12 for the detection of RdRp (1), RdRp (2), E gene and N gene. RNase P gene was used as the internal control for the analysis. Confirmatory laboratory tests were performed as per the WHO-recommended test protocols13. These samples were also sequenced using the NGS approach to retrieve the complete genome of the virus.\nNGS of SARS-CoV-2 from India - Phylogenetic analysis and molecular characterization: The total RNA of three positive TS specimens from Kerala, was extracted from 250-300 μl of the SARS-CoV-2 real-time RT-PCR positive samples. QIAamp Viral RNA extraction kit (QIAGEN, Hilden, Germany) was used according to the manufacturer's instructions. The extracted RNA was further quantified using a Qubit RNA High-Sensitivity kit (Invitrogen, USA). RNA libraries were prepared as per the earlier-defined protocol and quantified using KAPA Library Quantification Kit (Kapa Biosystems, Roche Diagnostics Corporation, USA) as per the manufacturer's protocol. Further, individual libraries were neutralized and loaded on the Miniseq platform (Illumina, USA). The detailed protocols for the steps undertaken have been published earlier1516. The data generated from the machine were analyzed using CLC genomics workbench version 11.0 (CLC, QIAGEN, Germany). Reference-based mapping was performed to retrieve the sequence of the SARS-CoV-2.\nFull-length genome sequences of SARS-CoV-2 were downloaded from the GISAID database17 (Supplementary Table I (available from http://www.ijmr.org.in/articles/2020/151/2/images/IndianJMedRes_2020_151_2_200_281471_sm5.pdf)). Multiple sequence alignment was performed using the MEGA software version 7.018 with retrieved sequences from two of the three positive cases and the available GISAID sequences. A phylogenetic tree was generated using the neighbour joining method and the Kimura-2-parameter as the nucleotide (nt) substitution model with 1000 bootstrap replications as implemented in MEGA software18. Per cent nucleotide divergence and amino acid (aa) divergence were calculated using the p-distance method18. Mutations specific to the Indian SARS-CoV-2 viruses were identified by comparing the coding regions with respect to the SARS-CoV-2, Wuhan, China (Wuhan hu-1).\nThree-dimensional (3D) model of the spike protein and epitope prediction: The pre-fusion structure of the Indian case 1 SARS-CoV-2 spike (S) glycoprotein was modelled using the Swiss-Model server (https://swissmodel.expasy.org/interactive) and the corresponding S protein of Wuhan-Hu-1 (6VSB.PDB) as the template (99.97% identity). Sequential (linear) B-cell epitopes were predicted using BepiPred-2.0 server (http://www.cbs.dtu.dk/services/BepiPred/). The ABCpred prediction tool (http://crdd.osdd.net/raghava/abcpred/) was also used to identify the B-cell epitopes in the Indian SARS-CoV-2 sequence. The epitope prediction probability of \u003e0.8 was set to increase the specificity of the peptide stretch. The overlapping epitopes predicted by BepiPred-2.0 online server and the ABCpred prediction tool were identified. The antigenicity of the shortlisted peptide sequences was further predicted using the Vaxijen online server (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) with a default threshold of 0.4.\nDiscontinuous epitopes on the modelled structure of the Indian case 1 SARS-CoV-2 spike protein were predicted using the online servers, Ellipro (http://tools.iedb.org/ellipro/) and DiscoTope 2.0 (http://tools.iedb.org/discotope/), integrated in the Immune Epitope Database. Ellipro predicts epitopes based on the protusion index (PI), wherein the protein shape is approximated as an ellipsoid (Ref for Ellipro and DiscoTope). An ellipsoid with the PI value of 0.8 indicates that 80 per cent of the residues are within the ellipsoid and 20 per cent are outside. All residues that are outside the 80 per cent ellipsoid will have a score of 0.8. Residues with larger scores are associated with greater solvent accessibility. The PI value was set to a score of 0.8. DiscoTope predicts epitopes using 3D structure and half-sphere exposure as a surface measure in a novel spatial neighbourhood definition method. Default values were set for sensitivity (0.47) and specificity (0.75) for selecting the amino acids forming discontinuous epitopes. A sensitivity of 0.47 means that 47 per cent of the epitope residues are predicted as part of the epitopes, while a specificity of 0.75 means that 25 per cent of the non-epitope residues are predicted as part of the epitopes. Outputs from both the methods were combined, and the final regions were mapped on the modelled 3D-structure as the most probable conformational epitopes. In addition, we also predicted N-linked glycosylation sites in the S protein using NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/). The spike proteins were also screened for the presence of potential epitopes presented by major histocompatibility complex (MHC) class I molecules to cytotoxic T lymphocytes (CTLs). The online NetCTL1.2 server (http://www.cbs.dtu.dk/services/NetCTL/) based on machine learning techniques such as artificial neural network (ANN) and support vector machine (SVM) was used to predict the T-cell epitopes. The prediction was made for all the human leucocyte antigen (HLA) supertypes and the available human alleles. The C terminal cleavage, weight of transport-associated protein (TAP) efficiency and threshold for identification were kept as default. VaxiJen v2.0 tool was used to predict the antigenicity of the predicted epitopes (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html). The sequences were further screened to be potential epitopes using the CTLPred online server (http://crdd.osdd.net/raghava/ctlpred/).\nThe ability of the predicted linear B-cell and the T-cell epitopes to mount interferon-gamma (IFN-γ) response was assessed using the IFNepitope (http://crdd.osdd.net/raghava/ifnepitope/index.php)."}