PMC:7443692 / 9613-13789 JSONTXT

Annnotations TAB JSON ListView MergeView

    LitCovid-sample-MedDRA

    {"project":"LitCovid-sample-MedDRA","denotations":[{"id":"T10","span":{"begin":2953,"end":2963},"obj":"http://purl.bioontology.org/ontology/MEDDRA/10022891"},{"id":"T11","span":{"begin":3144,"end":3150},"obj":"http://purl.bioontology.org/ontology/MEDDRA/10022891"}],"attributes":[{"id":"A10","pred":"meddra_id","subj":"T10","obj":"http://purl.bioontology.org/ontology/MEDDRA/10069374"},{"id":"A11","pred":"meddra_id","subj":"T11","obj":"http://purl.bioontology.org/ontology/MEDDRA/10047890"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-CHEBI

    {"project":"LitCovid-sample-CHEBI","denotations":[{"id":"T38","span":{"begin":67,"end":79},"obj":"Chemical"},{"id":"T39","span":{"begin":557,"end":560},"obj":"Chemical"},{"id":"T40","span":{"begin":703,"end":711},"obj":"Chemical"},{"id":"T41","span":{"begin":1012,"end":1019},"obj":"Chemical"},{"id":"T42","span":{"begin":1054,"end":1063},"obj":"Chemical"},{"id":"T43","span":{"begin":1104,"end":1117},"obj":"Chemical"},{"id":"T44","span":{"begin":1190,"end":1197},"obj":"Chemical"},{"id":"T45","span":{"begin":1239,"end":1242},"obj":"Chemical"},{"id":"T48","span":{"begin":1346,"end":1349},"obj":"Chemical"},{"id":"T51","span":{"begin":1368,"end":1377},"obj":"Chemical"},{"id":"T52","span":{"begin":1391,"end":1394},"obj":"Chemical"},{"id":"T55","span":{"begin":1518,"end":1521},"obj":"Chemical"},{"id":"T58","span":{"begin":1531,"end":1534},"obj":"Chemical"},{"id":"T60","span":{"begin":1573,"end":1576},"obj":"Chemical"},{"id":"T62","span":{"begin":1586,"end":1589},"obj":"Chemical"},{"id":"T63","span":{"begin":1650,"end":1660},"obj":"Chemical"},{"id":"T65","span":{"begin":1683,"end":1693},"obj":"Chemical"},{"id":"T67","span":{"begin":1831,"end":1834},"obj":"Chemical"},{"id":"T70","span":{"begin":1856,"end":1867},"obj":"Chemical"},{"id":"T71","span":{"begin":1920,"end":1923},"obj":"Chemical"},{"id":"T73","span":{"begin":1928,"end":1931},"obj":"Chemical"},{"id":"T74","span":{"begin":2098,"end":2109},"obj":"Chemical"},{"id":"T75","span":{"begin":2608,"end":2620},"obj":"Chemical"},{"id":"T76","span":{"begin":2771,"end":2778},"obj":"Chemical"},{"id":"T77","span":{"begin":3028,"end":3031},"obj":"Chemical"},{"id":"T78","span":{"begin":3083,"end":3090},"obj":"Chemical"},{"id":"T79","span":{"begin":3381,"end":3388},"obj":"Chemical"},{"id":"T80","span":{"begin":3407,"end":3416},"obj":"Chemical"},{"id":"T81","span":{"begin":3440,"end":3448},"obj":"Chemical"},{"id":"T82","span":{"begin":3650,"end":3658},"obj":"Chemical"},{"id":"T83","span":{"begin":3721,"end":3728},"obj":"Chemical"},{"id":"T84","span":{"begin":3776,"end":3783},"obj":"Chemical"},{"id":"T85","span":{"begin":3839,"end":3843},"obj":"Chemical"},{"id":"T86","span":{"begin":3897,"end":3904},"obj":"Chemical"},{"id":"T87","span":{"begin":3905,"end":3909},"obj":"Chemical"},{"id":"T88","span":{"begin":3933,"end":3940},"obj":"Chemical"},{"id":"T89","span":{"begin":3950,"end":3954},"obj":"Chemical"},{"id":"T90","span":{"begin":4037,"end":4041},"obj":"Chemical"},{"id":"T91","span":{"begin":4113,"end":4119},"obj":"Chemical"},{"id":"T92","span":{"begin":4163,"end":4175},"obj":"Chemical"}],"attributes":[{"id":"A60","pred":"chebi_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/CHEBI_17115"},{"id":"A61","pred":"chebi_id","subj":"T60","obj":"http://purl.obolibrary.org/obo/CHEBI_29999"},{"id":"A55","pred":"chebi_id","subj":"T55","obj":"http://purl.obolibrary.org/obo/CHEBI_29950"},{"id":"A56","pred":"chebi_id","subj":"T55","obj":"http://purl.obolibrary.org/obo/CHEBI_15356"},{"id":"A57","pred":"chebi_id","subj":"T55","obj":"http://purl.obolibrary.org/obo/CHEBI_17561"},{"id":"A44","pred":"chebi_id","subj":"T44","obj":"http://purl.obolibrary.org/obo/CHEBI_16670"},{"id":"A71","pred":"chebi_id","subj":"T71","obj":"http://purl.obolibrary.org/obo/CHEBI_17115"},{"id":"A72","pred":"chebi_id","subj":"T71","obj":"http://purl.obolibrary.org/obo/CHEBI_29999"},{"id":"A40","pred":"chebi_id","subj":"T40","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A51","pred":"chebi_id","subj":"T51","obj":"http://purl.obolibrary.org/obo/CHEBI_48343"},{"id":"A73","pred":"chebi_id","subj":"T73","obj":"http://purl.obolibrary.org/obo/CHEBI_30011"},{"id":"A42","pred":"chebi_id","subj":"T42","obj":"http://purl.obolibrary.org/obo/CHEBI_28300"},{"id":"A82","pred":"chebi_id","subj":"T82","obj":"http://purl.obolibrary.org/obo/CHEBI_15356"},{"id":"A58","pred":"chebi_id","subj":"T58","obj":"http://purl.obolibrary.org/obo/CHEBI_16414"},{"id":"A59","pred":"chebi_id","subj":"T58","obj":"http://purl.obolibrary.org/obo/CHEBI_30015"},{"id":"A87","pred":"chebi_id","subj":"T87","obj":"http://purl.obolibrary.org/obo/CHEBI_24870"},{"id":"A84","pred":"chebi_id","subj":"T84","obj":"http://purl.obolibrary.org/obo/CHEBI_16670"},{"id":"A89","pred":"chebi_id","subj":"T89","obj":"http://purl.obolibrary.org/obo/CHEBI_24870"},{"id":"A48","pred":"chebi_id","subj":"T48","obj":"http://purl.obolibrary.org/obo/CHEBI_29950"},{"id":"A49","pred":"chebi_id","subj":"T48","obj":"http://purl.obolibrary.org/obo/CHEBI_15356"},{"id":"A50","pred":"chebi_id","subj":"T48","obj":"http://purl.obolibrary.org/obo/CHEBI_17561"},{"id":"A39","pred":"chebi_id","subj":"T39","obj":"http://purl.obolibrary.org/obo/CHEBI_8984"},{"id":"A65","pred":"chebi_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/CHEBI_16811"},{"id":"A66","pred":"chebi_id","subj":"T65","obj":"http://purl.obolibrary.org/obo/CHEBI_64558"},{"id":"A83","pred":"chebi_id","subj":"T83","obj":"http://purl.obolibrary.org/obo/CHEBI_16670"},{"id":"A80","pred":"chebi_id","subj":"T80","obj":"http://purl.obolibrary.org/obo/CHEBI_28300"},{"id":"A74","pred":"chebi_id","subj":"T74","obj":"http://purl.obolibrary.org/obo/CHEBI_36976"},{"id":"A52","pred":"chebi_id","subj":"T52","obj":"http://purl.obolibrary.org/obo/CHEBI_29950"},{"id":"A53","pred":"chebi_id","subj":"T52","obj":"http://purl.obolibrary.org/obo/CHEBI_15356"},{"id":"A54","pred":"chebi_id","subj":"T52","obj":"http://purl.obolibrary.org/obo/CHEBI_17561"},{"id":"A41","pred":"chebi_id","subj":"T41","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A62","pred":"chebi_id","subj":"T62","obj":"http://purl.obolibrary.org/obo/CHEBI_30011"},{"id":"A88","pred":"chebi_id","subj":"T88","obj":"http://purl.obolibrary.org/obo/CHEBI_16670"},{"id":"A86","pred":"chebi_id","subj":"T86","obj":"http://purl.obolibrary.org/obo/CHEBI_29412"},{"id":"A92","pred":"chebi_id","subj":"T92","obj":"http://purl.obolibrary.org/obo/CHEBI_24396"},{"id":"A45","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_17196"},{"id":"A46","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_22653"},{"id":"A47","pred":"chebi_id","subj":"T45","obj":"http://purl.obolibrary.org/obo/CHEBI_50347"},{"id":"A75","pred":"chebi_id","subj":"T75","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A78","pred":"chebi_id","subj":"T78","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A38","pred":"chebi_id","subj":"T38","obj":"http://purl.obolibrary.org/obo/CHEBI_17089"},{"id":"A91","pred":"chebi_id","subj":"T91","obj":"http://purl.obolibrary.org/obo/CHEBI_33984"},{"id":"A70","pred":"chebi_id","subj":"T70","obj":"http://purl.obolibrary.org/obo/CHEBI_33709"},{"id":"A81","pred":"chebi_id","subj":"T81","obj":"http://purl.obolibrary.org/obo/CHEBI_59520"},{"id":"A85","pred":"chebi_id","subj":"T85","obj":"http://purl.obolibrary.org/obo/CHEBI_24870"},{"id":"A43","pred":"chebi_id","subj":"T43","obj":"http://purl.obolibrary.org/obo/CHEBI_133469"},{"id":"A63","pred":"chebi_id","subj":"T63","obj":"http://purl.obolibrary.org/obo/CHEBI_16811"},{"id":"A64","pred":"chebi_id","subj":"T63","obj":"http://purl.obolibrary.org/obo/CHEBI_64558"},{"id":"A77","pred":"chebi_id","subj":"T77","obj":"http://purl.obolibrary.org/obo/CHEBI_8984"},{"id":"A67","pred":"chebi_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/CHEBI_16044"},{"id":"A68","pred":"chebi_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/CHEBI_16643"},{"id":"A69","pred":"chebi_id","subj":"T67","obj":"http://purl.obolibrary.org/obo/CHEBI_16811"},{"id":"A76","pred":"chebi_id","subj":"T76","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A79","pred":"chebi_id","subj":"T79","obj":"http://purl.obolibrary.org/obo/CHEBI_36080"},{"id":"A90","pred":"chebi_id","subj":"T90","obj":"http://purl.obolibrary.org/obo/CHEBI_24870"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-PD-NCBITaxon

    {"project":"LitCovid-sample-PD-NCBITaxon","denotations":[{"id":"T40","span":{"begin":50,"end":60},"obj":"Species"},{"id":"T41","span":{"begin":99,"end":104},"obj":"Species"},{"id":"T42","span":{"begin":154,"end":164},"obj":"Species"},{"id":"T43","span":{"begin":263,"end":268},"obj":"Species"},{"id":"T44","span":{"begin":2115,"end":2126},"obj":"Species"},{"id":"T45","span":{"begin":2151,"end":2161},"obj":"Species"},{"id":"T46","span":{"begin":2591,"end":2601},"obj":"Species"},{"id":"T47","span":{"begin":2650,"end":2655},"obj":"Species"},{"id":"T48","span":{"begin":2678,"end":2688},"obj":"Species"},{"id":"T49","span":{"begin":2713,"end":2718},"obj":"Species"},{"id":"T50","span":{"begin":3070,"end":3080},"obj":"Species"},{"id":"T51","span":{"begin":3117,"end":3122},"obj":"Species"}],"attributes":[{"id":"A46","pred":"ncbi_taxonomy_id","subj":"T46","obj":"NCBItxid:2697049"},{"id":"A49","pred":"ncbi_taxonomy_id","subj":"T49","obj":"NCBItxid:9606"},{"id":"A44","pred":"ncbi_taxonomy_id","subj":"T44","obj":"NCBItxid:186672"},{"id":"A51","pred":"ncbi_taxonomy_id","subj":"T51","obj":"NCBItxid:9606"},{"id":"A47","pred":"ncbi_taxonomy_id","subj":"T47","obj":"NCBItxid:9606"},{"id":"A43","pred":"ncbi_taxonomy_id","subj":"T43","obj":"NCBItxid:9606"},{"id":"A42","pred":"ncbi_taxonomy_id","subj":"T42","obj":"NCBItxid:2697049"},{"id":"A40","pred":"ncbi_taxonomy_id","subj":"T40","obj":"NCBItxid:2697049"},{"id":"A50","pred":"ncbi_taxonomy_id","subj":"T50","obj":"NCBItxid:2697049"},{"id":"A48","pred":"ncbi_taxonomy_id","subj":"T48","obj":"NCBItxid:2697049"},{"id":"A41","pred":"ncbi_taxonomy_id","subj":"T41","obj":"NCBItxid:9606"},{"id":"A45","pred":"ncbi_taxonomy_id","subj":"T45","obj":"NCBItxid:2697049"}],"namespaces":[{"prefix":"NCBItxid","uri":"http://purl.bioontology.org/ontology/NCBITAXON/"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-sentences

    {"project":"LitCovid-sample-sentences","denotations":[{"id":"T49","span":{"begin":0,"end":109},"obj":"Sentence"},{"id":"T50","span":{"begin":110,"end":616},"obj":"Sentence"},{"id":"T51","span":{"begin":617,"end":745},"obj":"Sentence"},{"id":"T52","span":{"begin":746,"end":945},"obj":"Sentence"},{"id":"T53","span":{"begin":946,"end":1174},"obj":"Sentence"},{"id":"T54","span":{"begin":1175,"end":1422},"obj":"Sentence"},{"id":"T55","span":{"begin":1423,"end":1756},"obj":"Sentence"},{"id":"T56","span":{"begin":1757,"end":1976},"obj":"Sentence"},{"id":"T57","span":{"begin":1977,"end":2218},"obj":"Sentence"},{"id":"T58","span":{"begin":2219,"end":2340},"obj":"Sentence"},{"id":"T59","span":{"begin":2341,"end":2546},"obj":"Sentence"},{"id":"T60","span":{"begin":2547,"end":2660},"obj":"Sentence"},{"id":"T61","span":{"begin":2661,"end":2724},"obj":"Sentence"},{"id":"T62","span":{"begin":2725,"end":2831},"obj":"Sentence"},{"id":"T63","span":{"begin":2832,"end":2918},"obj":"Sentence"},{"id":"T64","span":{"begin":2919,"end":3128},"obj":"Sentence"},{"id":"T65","span":{"begin":3129,"end":3159},"obj":"Sentence"},{"id":"T66","span":{"begin":3160,"end":3329},"obj":"Sentence"},{"id":"T67","span":{"begin":3330,"end":3422},"obj":"Sentence"},{"id":"T68","span":{"begin":3423,"end":3584},"obj":"Sentence"},{"id":"T69","span":{"begin":3585,"end":3621},"obj":"Sentence"},{"id":"T70","span":{"begin":3622,"end":3741},"obj":"Sentence"},{"id":"T71","span":{"begin":3742,"end":3854},"obj":"Sentence"},{"id":"T72","span":{"begin":3855,"end":4008},"obj":"Sentence"},{"id":"T73","span":{"begin":4009,"end":4176},"obj":"Sentence"}],"namespaces":[{"prefix":"_base","uri":"http://pubannotation.org/ontology/tao.owl#"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-PD-MONDO

    {"project":"LitCovid-sample-PD-MONDO","denotations":[{"id":"T30","span":{"begin":50,"end":60},"obj":"Disease"},{"id":"T31","span":{"begin":154,"end":164},"obj":"Disease"},{"id":"T32","span":{"begin":2151,"end":2161},"obj":"Disease"},{"id":"T33","span":{"begin":2591,"end":2601},"obj":"Disease"},{"id":"T34","span":{"begin":2678,"end":2688},"obj":"Disease"},{"id":"T35","span":{"begin":3070,"end":3080},"obj":"Disease"}],"attributes":[{"id":"A30","pred":"mondo_id","subj":"T30","obj":"http://purl.obolibrary.org/obo/MONDO_0100096"},{"id":"A31","pred":"mondo_id","subj":"T31","obj":"http://purl.obolibrary.org/obo/MONDO_0100096"},{"id":"A32","pred":"mondo_id","subj":"T32","obj":"http://purl.obolibrary.org/obo/MONDO_0100096"},{"id":"A33","pred":"mondo_id","subj":"T33","obj":"http://purl.obolibrary.org/obo/MONDO_0100096"},{"id":"A34","pred":"mondo_id","subj":"T34","obj":"http://purl.obolibrary.org/obo/MONDO_0100096"},{"id":"A35","pred":"mondo_id","subj":"T35","obj":"http://purl.obolibrary.org/obo/MONDO_0100096"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-UniProt

    {"project":"LitCovid-sample-UniProt","denotations":[{"id":"T1191","span":{"begin":61,"end":79},"obj":"Protein"},{"id":"T1292","span":{"begin":67,"end":79},"obj":"Protein"},{"id":"T1358","span":{"begin":105,"end":109},"obj":"Protein"},{"id":"T1359","span":{"begin":269,"end":273},"obj":"Protein"},{"id":"T1360","span":{"begin":740,"end":743},"obj":"Protein"},{"id":"T1362","span":{"begin":814,"end":818},"obj":"Protein"},{"id":"T1363","span":{"begin":1010,"end":1019},"obj":"Protein"},{"id":"T1396","span":{"begin":1024,"end":1028},"obj":"Protein"},{"id":"T1397","span":{"begin":1179,"end":1197},"obj":"Protein"},{"id":"T1412","span":{"begin":2602,"end":2620},"obj":"Protein"},{"id":"T1513","span":{"begin":2608,"end":2620},"obj":"Protein"},{"id":"T1579","span":{"begin":2656,"end":2660},"obj":"Protein"},{"id":"T1580","span":{"begin":2719,"end":2723},"obj":"Protein"},{"id":"T1581","span":{"begin":3081,"end":3090},"obj":"Protein"},{"id":"T1614","span":{"begin":3123,"end":3127},"obj":"Protein"},{"id":"T1615","span":{"begin":3765,"end":3783},"obj":"Protein"}],"attributes":[{"id":"A1191","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q9QAS2"},{"id":"A1192","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q9QAR5"},{"id":"A1193","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q9QAQ8"},{"id":"A1194","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q9IW04"},{"id":"A1195","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q9IKD1"},{"id":"A1196","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q990M4"},{"id":"A1197","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q990M3"},{"id":"A1198","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q990M2"},{"id":"A1199","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q990M1"},{"id":"A1200","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q91AV1"},{"id":"A1201","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q91A26"},{"id":"A1202","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q8V436"},{"id":"A1203","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q8JSP8"},{"id":"A1204","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q8BB25"},{"id":"A1205","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q86623"},{"id":"A1206","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q85088"},{"id":"A1207","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q85087"},{"id":"A1208","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q80BV6"},{"id":"A1209","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q7TFB1"},{"id":"A1210","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q7TFA2"},{"id":"A1211","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q7TA19"},{"id":"A1212","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q7T6T3"},{"id":"A1213","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q7T696"},{"id":"A1214","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q77NC4"},{"id":"A1215","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q6TNF9"},{"id":"A1216","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q6R1L7"},{"id":"A1217","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q6QU82"},{"id":"A1218","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q6Q1S2"},{"id":"A1219","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q696Q6"},{"id":"A1220","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q66291"},{"id":"A1221","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q66290"},{"id":"A1222","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q66199"},{"id":"A1223","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q66177"},{"id":"A1224","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q66176"},{"id":"A1225","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q66174"},{"id":"A1226","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q65984"},{"id":"A1227","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q5MQD0"},{"id":"A1228","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q5I5X9"},{"id":"A1229","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q5DIY0"},{"id":"A1230","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q5DIX9"},{"id":"A1231","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q5DIX8"},{"id":"A1232","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q5DIX7"},{"id":"A1233","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q52PA3"},{"id":"A1234","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q4ZJS1"},{"id":"A1235","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q4U5G0"},{"id":"A1236","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q3T8J0"},{"id":"A1237","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q3LZX1"},{"id":"A1238","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q3I5J5"},{"id":"A1239","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q14EB0"},{"id":"A1240","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q0ZME7"},{"id":"A1241","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q0Q4F2"},{"id":"A1242","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q0Q475"},{"id":"A1243","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q0Q466"},{"id":"A1244","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q0GNB8"},{"id":"A1245","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q02385"},{"id":"A1246","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q02167"},{"id":"A1247","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q01977"},{"id":"A1248","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/Q008X4"},{"id":"A1249","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P89344"},{"id":"A1250","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P89343"},{"id":"A1251","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P89342"},{"id":"A1252","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P59594"},{"id":"A1253","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P36334"},{"id":"A1254","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P36300"},{"id":"A1255","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P33470"},{"id":"A1256","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P30208"},{"id":"A1257","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P30207"},{"id":"A1258","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P30206"},{"id":"A1259","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P27662"},{"id":"A1260","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P27655"},{"id":"A1261","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P27277"},{"id":"A1262","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P25194"},{"id":"A1263","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P25193"},{"id":"A1264","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P25192"},{"id":"A1265","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P25191"},{"id":"A1266","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P25190"},{"id":"A1267","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P24413"},{"id":"A1268","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P23052"},{"id":"A1269","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P22432"},{"id":"A1270","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P18450"},{"id":"A1271","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P17662"},{"id":"A1272","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P15777"},{"id":"A1273","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P15423"},{"id":"A1274","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P12722"},{"id":"A1275","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P12651"},{"id":"A1276","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P12650"},{"id":"A1277","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P12647"},{"id":"A1278","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P11225"},{"id":"A1279","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P11224"},{"id":"A1280","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P11223"},{"id":"A1281","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P10033"},{"id":"A1282","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P0DTC2"},{"id":"A1283","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P07946"},{"id":"A1284","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P05135"},{"id":"A1285","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/P05134"},{"id":"A1286","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/O90304"},{"id":"A1287","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/O39227"},{"id":"A1288","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/K9N5Q8"},{"id":"A1289","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/A3EXG6"},{"id":"A1290","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/A3EXD0"},{"id":"A1291","pred":"uniprot_id","subj":"T1191","obj":"https://www.uniprot.org/uniprot/A3EX94"},{"id":"A1292","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q82706"},{"id":"A1293","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q82683"},{"id":"A1294","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q82020"},{"id":"A1295","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q787B5"},{"id":"A1296","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q77SK0"},{"id":"A1297","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q77N36"},{"id":"A1298","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q76G52"},{"id":"A1299","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q75T09"},{"id":"A1300","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q6X1D5"},{"id":"A1301","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q6X1D1"},{"id":"A1302","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q6TYA0"},{"id":"A1303","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q6E0W7"},{"id":"A1304","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q66T62"},{"id":"A1305","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5VKP3"},{"id":"A1306","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5VKN9"},{"id":"A1307","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5K2K4"},{"id":"A1308","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5IX93"},{"id":"A1309","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5IX92"},{"id":"A1310","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5IX91"},{"id":"A1311","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5IX90"},{"id":"A1312","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5IX89"},{"id":"A1313","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5IX88"},{"id":"A1314","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5IX87"},{"id":"A1315","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q5GA86"},{"id":"A1316","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q58FH1"},{"id":"A1317","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q4VKV3"},{"id":"A1318","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q4F900"},{"id":"A1319","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q49LL3"},{"id":"A1320","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q49IU2"},{"id":"A1321","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q49IU1"},{"id":"A1322","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q49IT9"},{"id":"A1323","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q49IT8"},{"id":"A1324","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q49AV0"},{"id":"A1325","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q0GBY1"},{"id":"A1326","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q0GBX6"},{"id":"A1327","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/Q08089"},{"id":"A1328","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P32595"},{"id":"A1329","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P32550"},{"id":"A1330","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P19462"},{"id":"A1331","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P16288"},{"id":"A1332","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P15199"},{"id":"A1333","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P13180"},{"id":"A1334","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P0C572"},{"id":"A1335","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P08667"},{"id":"A1336","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P08163"},{"id":"A1337","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P07923"},{"id":"A1338","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P04884"},{"id":"A1339","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P04883"},{"id":"A1340","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P04882"},{"id":"A1341","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P03524"},{"id":"A1342","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/P03522"},{"id":"A1343","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/O92284"},{"id":"A1344","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/O56677"},{"id":"A1345","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/O10236"},{"id":"A1346","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/J7HBH4"},{"id":"A1347","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/D8V075"},{"id":"A1348","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A7WNB3"},{"id":"A1349","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A4UHQ6"},{"id":"A1350","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A4UHQ1"},{"id":"A1351","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A3RM22"},{"id":"A1352","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A3F5R8"},{"id":"A1353","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A3F5R3"},{"id":"A1354","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A3F5Q8"},{"id":"A1355","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A3F5N3"},{"id":"A1356","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A3F5M3"},{"id":"A1357","pred":"uniprot_id","subj":"T1292","obj":"https://www.uniprot.org/uniprot/A3F5L8"},{"id":"A1358","pred":"uniprot_id","subj":"T1358","obj":"https://www.uniprot.org/uniprot/Q9UFZ6"},{"id":"A1359","pred":"uniprot_id","subj":"T1359","obj":"https://www.uniprot.org/uniprot/Q9UFZ6"},{"id":"A1360","pred":"uniprot_id","subj":"T1360","obj":"https://www.uniprot.org/uniprot/Q4VAY7"},{"id":"A1361","pred":"uniprot_id","subj":"T1360","obj":"https://www.uniprot.org/uniprot/P28222"},{"id":"A1362","pred":"uniprot_id","subj":"T1362","obj":"https://www.uniprot.org/uniprot/Q9UFZ6"},{"id":"A1363","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9UIP0"},{"id":"A1364","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9UIN9"},{"id":"A1365","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9UIN8"},{"id":"A1366","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9UIN7"},{"id":"A1367","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9UIN6"},{"id":"A1368","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9UBH8"},{"id":"A1369","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9NRH8"},{"id":"A1370","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9NRH7"},{"id":"A1371","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9NRH6"},{"id":"A1372","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9NRH5"},{"id":"A1373","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9NRH4"},{"id":"A1374","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9NPG5"},{"id":"A1375","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9NPE0"},{"id":"A1376","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q9NP52"},{"id":"A1377","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q95IF9"},{"id":"A1378","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q8N5P3"},{"id":"A1379","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q8IZU6"},{"id":"A1380","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q8IZU5"},{"id":"A1381","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q8IZU4"},{"id":"A1382","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q86Z04"},{"id":"A1383","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q7YR44"},{"id":"A1384","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q7LA71"},{"id":"A1385","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q7LA70"},{"id":"A1386","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q5STD2"},{"id":"A1387","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q5SQ85"},{"id":"A1388","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q1XI16"},{"id":"A1389","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q1XI12"},{"id":"A1390","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/Q15517"},{"id":"A1391","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/O43509"},{"id":"A1392","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/O19084"},{"id":"A1393","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/B0UYZ7"},{"id":"A1394","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/B0S7V2"},{"id":"A1395","pred":"uniprot_id","subj":"T1363","obj":"https://www.uniprot.org/uniprot/A5A6L9"},{"id":"A1396","pred":"uniprot_id","subj":"T1396","obj":"https://www.uniprot.org/uniprot/Q9UFZ6"},{"id":"A1397","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/Q9DB14"},{"id":"A1398","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/Q9D2P6"},{"id":"A1399","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/Q9CYP0"},{"id":"A1400","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/Q9BS38"},{"id":"A1401","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/Q8CCZ3"},{"id":"A1402","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/Q6FHD0"},{"id":"A1403","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/Q06A28"},{"id":"A1404","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/P27682"},{"id":"A1405","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/P18844"},{"id":"A1406","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/P12961"},{"id":"A1407","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/P05408"},{"id":"A1408","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/P01165"},{"id":"A1409","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/P01164"},{"id":"A1410","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/A5HAK0"},{"id":"A1411","pred":"uniprot_id","subj":"T1397","obj":"https://www.uniprot.org/uniprot/A5A6J6"},{"id":"A1412","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q9QAS2"},{"id":"A1413","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q9QAR5"},{"id":"A1414","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q9QAQ8"},{"id":"A1415","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q9IW04"},{"id":"A1416","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q9IKD1"},{"id":"A1417","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q990M4"},{"id":"A1418","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q990M3"},{"id":"A1419","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q990M2"},{"id":"A1420","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q990M1"},{"id":"A1421","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q91AV1"},{"id":"A1422","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q91A26"},{"id":"A1423","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q8V436"},{"id":"A1424","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q8JSP8"},{"id":"A1425","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q8BB25"},{"id":"A1426","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q86623"},{"id":"A1427","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q85088"},{"id":"A1428","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q85087"},{"id":"A1429","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q80BV6"},{"id":"A1430","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q7TFB1"},{"id":"A1431","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q7TFA2"},{"id":"A1432","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q7TA19"},{"id":"A1433","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q7T6T3"},{"id":"A1434","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q7T696"},{"id":"A1435","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q77NC4"},{"id":"A1436","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q6TNF9"},{"id":"A1437","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q6R1L7"},{"id":"A1438","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q6QU82"},{"id":"A1439","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q6Q1S2"},{"id":"A1440","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q696Q6"},{"id":"A1441","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q66291"},{"id":"A1442","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q66290"},{"id":"A1443","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q66199"},{"id":"A1444","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q66177"},{"id":"A1445","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q66176"},{"id":"A1446","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q66174"},{"id":"A1447","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q65984"},{"id":"A1448","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q5MQD0"},{"id":"A1449","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q5I5X9"},{"id":"A1450","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q5DIY0"},{"id":"A1451","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q5DIX9"},{"id":"A1452","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q5DIX8"},{"id":"A1453","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q5DIX7"},{"id":"A1454","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q52PA3"},{"id":"A1455","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q4ZJS1"},{"id":"A1456","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q4U5G0"},{"id":"A1457","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q3T8J0"},{"id":"A1458","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q3LZX1"},{"id":"A1459","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q3I5J5"},{"id":"A1460","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q14EB0"},{"id":"A1461","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q0ZME7"},{"id":"A1462","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q0Q4F2"},{"id":"A1463","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q0Q475"},{"id":"A1464","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q0Q466"},{"id":"A1465","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q0GNB8"},{"id":"A1466","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q02385"},{"id":"A1467","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q02167"},{"id":"A1468","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q01977"},{"id":"A1469","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/Q008X4"},{"id":"A1470","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P89344"},{"id":"A1471","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P89343"},{"id":"A1472","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P89342"},{"id":"A1473","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P59594"},{"id":"A1474","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P36334"},{"id":"A1475","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P36300"},{"id":"A1476","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P33470"},{"id":"A1477","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P30208"},{"id":"A1478","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P30207"},{"id":"A1479","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P30206"},{"id":"A1480","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P27662"},{"id":"A1481","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P27655"},{"id":"A1482","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P27277"},{"id":"A1483","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P25194"},{"id":"A1484","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P25193"},{"id":"A1485","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P25192"},{"id":"A1486","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P25191"},{"id":"A1487","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P25190"},{"id":"A1488","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P24413"},{"id":"A1489","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P23052"},{"id":"A1490","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P22432"},{"id":"A1491","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P18450"},{"id":"A1492","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P17662"},{"id":"A1493","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P15777"},{"id":"A1494","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P15423"},{"id":"A1495","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P12722"},{"id":"A1496","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P12651"},{"id":"A1497","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P12650"},{"id":"A1498","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P12647"},{"id":"A1499","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P11225"},{"id":"A1500","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P11224"},{"id":"A1501","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P11223"},{"id":"A1502","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P10033"},{"id":"A1503","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P0DTC2"},{"id":"A1504","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P07946"},{"id":"A1505","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P05135"},{"id":"A1506","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/P05134"},{"id":"A1507","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/O90304"},{"id":"A1508","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/O39227"},{"id":"A1509","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/K9N5Q8"},{"id":"A1510","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/A3EXG6"},{"id":"A1511","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/A3EXD0"},{"id":"A1512","pred":"uniprot_id","subj":"T1412","obj":"https://www.uniprot.org/uniprot/A3EX94"},{"id":"A1513","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q82706"},{"id":"A1514","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q82683"},{"id":"A1515","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q82020"},{"id":"A1516","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q787B5"},{"id":"A1517","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q77SK0"},{"id":"A1518","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q77N36"},{"id":"A1519","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q76G52"},{"id":"A1520","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q75T09"},{"id":"A1521","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q6X1D5"},{"id":"A1522","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q6X1D1"},{"id":"A1523","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q6TYA0"},{"id":"A1524","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q6E0W7"},{"id":"A1525","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q66T62"},{"id":"A1526","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5VKP3"},{"id":"A1527","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5VKN9"},{"id":"A1528","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5K2K4"},{"id":"A1529","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5IX93"},{"id":"A1530","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5IX92"},{"id":"A1531","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5IX91"},{"id":"A1532","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5IX90"},{"id":"A1533","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5IX89"},{"id":"A1534","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5IX88"},{"id":"A1535","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5IX87"},{"id":"A1536","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q5GA86"},{"id":"A1537","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q58FH1"},{"id":"A1538","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q4VKV3"},{"id":"A1539","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q4F900"},{"id":"A1540","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q49LL3"},{"id":"A1541","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q49IU2"},{"id":"A1542","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q49IU1"},{"id":"A1543","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q49IT9"},{"id":"A1544","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q49IT8"},{"id":"A1545","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q49AV0"},{"id":"A1546","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q0GBY1"},{"id":"A1547","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q0GBX6"},{"id":"A1548","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/Q08089"},{"id":"A1549","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P32595"},{"id":"A1550","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P32550"},{"id":"A1551","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P19462"},{"id":"A1552","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P16288"},{"id":"A1553","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P15199"},{"id":"A1554","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P13180"},{"id":"A1555","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P0C572"},{"id":"A1556","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P08667"},{"id":"A1557","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P08163"},{"id":"A1558","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P07923"},{"id":"A1559","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P04884"},{"id":"A1560","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P04883"},{"id":"A1561","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P04882"},{"id":"A1562","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P03524"},{"id":"A1563","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/P03522"},{"id":"A1564","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/O92284"},{"id":"A1565","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/O56677"},{"id":"A1566","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/O10236"},{"id":"A1567","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/J7HBH4"},{"id":"A1568","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/D8V075"},{"id":"A1569","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A7WNB3"},{"id":"A1570","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A4UHQ6"},{"id":"A1571","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A4UHQ1"},{"id":"A1572","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A3RM22"},{"id":"A1573","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A3F5R8"},{"id":"A1574","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A3F5R3"},{"id":"A1575","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A3F5Q8"},{"id":"A1576","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A3F5N3"},{"id":"A1577","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A3F5M3"},{"id":"A1578","pred":"uniprot_id","subj":"T1513","obj":"https://www.uniprot.org/uniprot/A3F5L8"},{"id":"A1579","pred":"uniprot_id","subj":"T1579","obj":"https://www.uniprot.org/uniprot/Q9UFZ6"},{"id":"A1580","pred":"uniprot_id","subj":"T1580","obj":"https://www.uniprot.org/uniprot/Q9UFZ6"},{"id":"A1581","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9UIP0"},{"id":"A1582","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9UIN9"},{"id":"A1583","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9UIN8"},{"id":"A1584","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9UIN7"},{"id":"A1585","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9UIN6"},{"id":"A1586","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9UBH8"},{"id":"A1587","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9NRH8"},{"id":"A1588","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9NRH7"},{"id":"A1589","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9NRH6"},{"id":"A1590","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9NRH5"},{"id":"A1591","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9NRH4"},{"id":"A1592","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9NPG5"},{"id":"A1593","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9NPE0"},{"id":"A1594","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q9NP52"},{"id":"A1595","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q95IF9"},{"id":"A1596","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q8N5P3"},{"id":"A1597","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q8IZU6"},{"id":"A1598","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q8IZU5"},{"id":"A1599","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q8IZU4"},{"id":"A1600","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q86Z04"},{"id":"A1601","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q7YR44"},{"id":"A1602","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q7LA71"},{"id":"A1603","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q7LA70"},{"id":"A1604","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q5STD2"},{"id":"A1605","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q5SQ85"},{"id":"A1606","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q1XI16"},{"id":"A1607","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q1XI12"},{"id":"A1608","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/Q15517"},{"id":"A1609","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/O43509"},{"id":"A1610","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/O19084"},{"id":"A1611","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/B0UYZ7"},{"id":"A1612","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/B0S7V2"},{"id":"A1613","pred":"uniprot_id","subj":"T1581","obj":"https://www.uniprot.org/uniprot/A5A6L9"},{"id":"A1614","pred":"uniprot_id","subj":"T1614","obj":"https://www.uniprot.org/uniprot/Q9UFZ6"},{"id":"A1615","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/Q9DB14"},{"id":"A1616","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/Q9D2P6"},{"id":"A1617","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/Q9CYP0"},{"id":"A1618","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/Q9BS38"},{"id":"A1619","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/Q8CCZ3"},{"id":"A1620","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/Q6FHD0"},{"id":"A1621","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/Q06A28"},{"id":"A1622","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/P27682"},{"id":"A1623","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/P18844"},{"id":"A1624","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/P12961"},{"id":"A1625","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/P05408"},{"id":"A1626","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/P01165"},{"id":"A1627","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/P01164"},{"id":"A1628","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/A5HAK0"},{"id":"A1629","pred":"uniprot_id","subj":"T1615","obj":"https://www.uniprot.org/uniprot/A5A6J6"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-PD-IDO

    {"project":"LitCovid-sample-PD-IDO","denotations":[{"id":"T45","span":{"begin":435,"end":440},"obj":"http://purl.obolibrary.org/obo/CL_0000000"},{"id":"T46","span":{"begin":2386,"end":2390},"obj":"http://purl.obolibrary.org/obo/BFO_0000029"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-PD-FMA

    {"project":"LitCovid-sample-PD-FMA","denotations":[{"id":"T46","span":{"begin":67,"end":79},"obj":"Body_part"},{"id":"T47","span":{"begin":435,"end":440},"obj":"Body_part"},{"id":"T48","span":{"begin":703,"end":711},"obj":"Body_part"},{"id":"T49","span":{"begin":1012,"end":1019},"obj":"Body_part"},{"id":"T50","span":{"begin":1054,"end":1063},"obj":"Body_part"},{"id":"T51","span":{"begin":1650,"end":1660},"obj":"Body_part"},{"id":"T52","span":{"begin":1683,"end":1693},"obj":"Body_part"},{"id":"T53","span":{"begin":1856,"end":1867},"obj":"Body_part"},{"id":"T54","span":{"begin":2098,"end":2109},"obj":"Body_part"},{"id":"T55","span":{"begin":2608,"end":2620},"obj":"Body_part"},{"id":"T56","span":{"begin":2771,"end":2778},"obj":"Body_part"},{"id":"T57","span":{"begin":3083,"end":3090},"obj":"Body_part"},{"id":"T58","span":{"begin":3381,"end":3388},"obj":"Body_part"},{"id":"T59","span":{"begin":3407,"end":3416},"obj":"Body_part"},{"id":"T60","span":{"begin":3650,"end":3658},"obj":"Body_part"},{"id":"T61","span":{"begin":4113,"end":4119},"obj":"Body_part"},{"id":"T62","span":{"begin":4163,"end":4175},"obj":"Body_part"}],"attributes":[{"id":"A53","pred":"fma_id","subj":"T53","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A61","pred":"fma_id","subj":"T61","obj":"http://purl.org/sig/ont/fma/fma82790"},{"id":"A54","pred":"fma_id","subj":"T54","obj":"http://purl.org/sig/ont/fma/fma82740"},{"id":"A55","pred":"fma_id","subj":"T55","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A52","pred":"fma_id","subj":"T52","obj":"http://purl.org/sig/ont/fma/fma82759"},{"id":"A46","pred":"fma_id","subj":"T46","obj":"http://purl.org/sig/ont/fma/fma62925"},{"id":"A57","pred":"fma_id","subj":"T57","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A60","pred":"fma_id","subj":"T60","obj":"http://purl.org/sig/ont/fma/fma82751"},{"id":"A56","pred":"fma_id","subj":"T56","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A50","pred":"fma_id","subj":"T50","obj":"http://purl.org/sig/ont/fma/fma82752"},{"id":"A47","pred":"fma_id","subj":"T47","obj":"http://purl.org/sig/ont/fma/fma68646"},{"id":"A59","pred":"fma_id","subj":"T59","obj":"http://purl.org/sig/ont/fma/fma82752"},{"id":"A49","pred":"fma_id","subj":"T49","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A48","pred":"fma_id","subj":"T48","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A62","pred":"fma_id","subj":"T62","obj":"http://purl.org/sig/ont/fma/fma82784"},{"id":"A51","pred":"fma_id","subj":"T51","obj":"http://purl.org/sig/ont/fma/fma82759"},{"id":"A58","pred":"fma_id","subj":"T58","obj":"http://purl.org/sig/ont/fma/fma67257"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-PD-MAT

    {"project":"LitCovid-sample-PD-MAT","denotations":[{"id":"T2","span":{"begin":4147,"end":4155},"obj":"http://purl.obolibrary.org/obo/MAT_0000086"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-PD-GO-BP-0

    {"project":"LitCovid-sample-PD-GO-BP-0","denotations":[{"id":"T32","span":{"begin":203,"end":216},"obj":"http://purl.obolibrary.org/obo/GO_0070085"},{"id":"T33","span":{"begin":349,"end":362},"obj":"http://purl.obolibrary.org/obo/GO_0070085"},{"id":"T34","span":{"begin":642,"end":652},"obj":"http://purl.obolibrary.org/obo/GO_0007586"},{"id":"T35","span":{"begin":863,"end":873},"obj":"http://purl.obolibrary.org/obo/GO_0007586"},{"id":"T36","span":{"begin":1320,"end":1330},"obj":"http://purl.obolibrary.org/obo/GO_0007586"},{"id":"T37","span":{"begin":1732,"end":1743},"obj":"http://purl.obolibrary.org/obo/GO_0006412"},{"id":"T38","span":{"begin":2357,"end":2379},"obj":"http://purl.obolibrary.org/obo/GO_0006413"},{"id":"T39","span":{"begin":2357,"end":2368},"obj":"http://purl.obolibrary.org/obo/GO_0006412"},{"id":"T40","span":{"begin":2446,"end":2457},"obj":"http://purl.obolibrary.org/obo/GO_0006412"},{"id":"T41","span":{"begin":2855,"end":2868},"obj":"http://purl.obolibrary.org/obo/GO_0070085"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    LitCovid-sample-GO-BP

    {"project":"LitCovid-sample-GO-BP","denotations":[{"id":"T30","span":{"begin":203,"end":216},"obj":"http://purl.obolibrary.org/obo/GO_0070085"},{"id":"T31","span":{"begin":349,"end":362},"obj":"http://purl.obolibrary.org/obo/GO_0070085"},{"id":"T32","span":{"begin":642,"end":652},"obj":"http://purl.obolibrary.org/obo/GO_0007586"},{"id":"T33","span":{"begin":863,"end":873},"obj":"http://purl.obolibrary.org/obo/GO_0007586"},{"id":"T34","span":{"begin":1320,"end":1330},"obj":"http://purl.obolibrary.org/obo/GO_0007586"},{"id":"T35","span":{"begin":1732,"end":1743},"obj":"http://purl.obolibrary.org/obo/GO_0006412"},{"id":"T36","span":{"begin":2357,"end":2379},"obj":"http://purl.obolibrary.org/obo/GO_0006413"},{"id":"T37","span":{"begin":2357,"end":2368},"obj":"http://purl.obolibrary.org/obo/GO_0006412"},{"id":"T38","span":{"begin":2446,"end":2457},"obj":"http://purl.obolibrary.org/obo/GO_0006412"},{"id":"T39","span":{"begin":2855,"end":2868},"obj":"http://purl.obolibrary.org/obo/GO_0070085"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}

    2_test

    {"project":"2_test","denotations":[{"id":"32841605-30778233-19659502","span":{"begin":1470,"end":1474},"obj":"30778233"}],"text":"Expression, Purification, and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer and Soluble Human ACE2\nA trimer-stabilized, soluble variant of the SARS-CoV-2 S that contains 22 canonical N-linked glycosylation sequons per protomer and a soluble version of human ACE2 that contains six, lacking the most C-terminal seventh, canonical N-linked glycosylation sequons (Figure 1 A) were purified from the media of transfected HEK293 cells, and the quaternary structure confirmed by negative EM staining for the S trimer (Figure 1B) and purity examined by SDS-PAGE Coomassie G-250 stained gels for both (Figure 1C). In addition, proteolytic digestions followed by proteomic analyses confirmed that the proteins were highly purified (Table S12). Finally, the N terminus of both the mature S and the soluble mature ACE2 were empirically determined via proteolytic digestions and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analyses. These results confirmed that both the secreted, mature forms of S protein and ACE2 begin with an N-terminal glutamine that has undergone condensation to form pyroglutamine at residues 14 and 18, respectively (Figures 1D and S1). The N-terminal peptide observed for S also contains a glycan at Asn-0017 (Figure 1D), and mass spectrometry analysis of non-reducing proteolytic digestions confirmed that Cys-0015 of S is in a disulfide linkage with Cys-0136 (Figure S2; Table S2). Given that SignalP (Almagro Armenteros et al., 2019) predicts signal sequence cleavage between Cys-0015 and Val-0016 but we observed cleavage between Ser-0013 and Gln-0014, we examined the possibility that an in-frame upstream methionine to the proposed start methionine (Figure 1A) might be used to initiate translation (Figure S3). If one examines the predicted signal sequence cleavage using the in-frame Met that is encoded nine amino acids upstream, SignalP now predicts cleavage between the Ser and Gln that we observed in our studies (Figure S3). To examine whether this impacted S expression, we expressed constructs that contained or did not contain the upstream 27 nucleotides in a pseudovirus (VSV) system expressing SARS-CoV-2 S (Figure S4) and in our HEK293 system (data not shown). Both expression systems produced a similar amount of S regardless of which expression construct was utilized (Figure S4). Thus, while the translation initiation start site has still not been fully defined, allowing for earlier translation in expression construct design did not have a significant impact on the generation of S.\nFigure 1 Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2\n(A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red.\n(B and C) Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250-stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM, molecular weight markers.\n(D) A representative Step-HCD fragmentation spectrum from mass-spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Figure 2) is overlaid by using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated, and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide."}