Id |
Subject |
Object |
Predicate |
Lexical cue |
T74 |
0-105 |
Sentence |
denotes |
Glycomics-Informed Glycoproteomics Reveals Site-Specific Microheterogeneity of SARS-CoV-2 S Glycosylation |
T75 |
106-190 |
Sentence |
denotes |
We utilized multiple approaches to examine glycosylation of the SARS-CoV-2 S trimer. |
T76 |
191-326 |
Sentence |
denotes |
First, the portfolio of glycans linked to SARS-CoV-2 S trimer immunogen was analyzed after their release from the polypeptide backbone. |
T77 |
327-453 |
Sentence |
denotes |
N-glycans were released from protein by treatment with PNGase F- and O-glycans were subsequently released by beta-elimination. |
T78 |
454-650 |
Sentence |
denotes |
After permethylation to enhance detection sensitivity and structural characterization, released glycans were analyzed by multi-stage mass spectrometry (MSn) (Aoki et al., 2007; Aoki et al., 2008). |
T79 |
651-776 |
Sentence |
denotes |
Mass spectra were processed by GRITS Toolbox, and the resulting annotations were validated manually (Weatherly et al., 2019). |
T80 |
777-933 |
Sentence |
denotes |
Glycan assignments were grouped by type and by additional structural features for relative quantification of profile characteristics (Figure 2 A; Table S3). |
T81 |
934-1102 |
Sentence |
denotes |
This analysis quantified 49 N-glycans and revealed that 55% of the total glycan abundance was of the complex type, 17% was of the hybrid type, and 28% was high mannose. |
T82 |
1103-1252 |
Sentence |
denotes |
Among the complex and hybrid N-glycans, we observed a high degree of core fucosylation and significant abundance of bisected and LacDiNAc structures. |
T83 |
1253-1494 |
Sentence |
denotes |
We also observed sulfated N-linked glycans by using negative mode MSn analyses (Table S13), although signal intensity was too low in positive ion mode (at least 10-fold lower than any of the non-sulfated glycans) for accurate quantification. |
T84 |
1495-1582 |
Sentence |
denotes |
In addition, we detected 15 O-glycans released from the S trimer (Figure S5; Table S4). |
T85 |
1583-1721 |
Sentence |
denotes |
Figure 2 Glycomics-Informed Glycoproteomics Reveals Substantial Site-Specific Microheterogeneity of N-linked Glycosylation on SARS-CoV-2 S |
T86 |
1722-1825 |
Sentence |
denotes |
(A) Glycans released from SARS-CoV-2 S protein trimer immunogen were permethylated and analyzed by MSn. |
T87 |
1826-1947 |
Sentence |
denotes |
Structures were assigned and grouped by type and structural features, and prevalence was determined based on ion current. |
T88 |
1948-2006 |
Sentence |
denotes |
The pie chart shows basic division by broad N-glycan type. |
T89 |
2007-2075 |
Sentence |
denotes |
The bar graph provides additional detail about the glycans detected. |
T90 |
2076-2249 |
Sentence |
denotes |
The most abundant structure with a unique categorization by glycomics for each N-glycan type in the pie chart, or above each feature category in the bar graph, is indicated. |
T91 |
2250-2474 |
Sentence |
denotes |
(B–E) Glycopeptides were prepared from SARS-CoV-2 S protein trimer immunogen by using multiple combinations of proteases, analyzed by LC-MSn, and the resulting data were searched by using several different software packages. |
T92 |
2475-2597 |
Sentence |
denotes |
Four representative sites of N-linked glycosylation with specific features of interest were chosen and are presented here. |
T93 |
2598-2826 |
Sentence |
denotes |
N0074 (B) and N0149 (C) are shown that occur in variable insert regions of S compared to SARS-CoV and other related coronaviruses, and there are emerging variants of SARS-CoV-2 that disrupt these two sites of glycosylation in S. |
T94 |
2827-2885 |
Sentence |
denotes |
N0234 (D) contains the most high-mannose N-linked glycans. |
T95 |
2886-3036 |
Sentence |
denotes |
N0801 (D) is an example of glycosylation in the S2 region of the immunogen and displays a high degree of hybrid glycosylation compared to other sites. |
T96 |
3037-3119 |
Sentence |
denotes |
The abundance of each composition is graphed in terms of assigned spectral counts. |
T97 |
3120-3240 |
Sentence |
denotes |
Representative glycans (as determined by glycomics analysis) for several abundant compositions are shown in SNFG format. |
T98 |
3241-3372 |
Sentence |
denotes |
The abbreviations used here and throughout the manuscript are as follows: N, HexNAc; H, hexose; F, fucose; A, Neu5Ac; S, sulfation. |
T99 |
3373-3537 |
Sentence |
denotes |
Note that the graphs for the other 18 sites and other graphs grouping the microheterogeneity observed by other properties are presented in Supplemental Information. |
T100 |
3538-3778 |
Sentence |
denotes |
To determine occupancy of N-linked glycans at each site, we employed a sequential deglycoslyation approach by using Endoglycosidase H and PNGase F in the presence of 18O-H2O after tryptic digestion of S (Wang et al., 2020; Yu et al., 2018). |
T101 |
3779-3910 |
Sentence |
denotes |
After LC-MS/MS analyses, the resulting data confirmed that 19 of the canonical sequons had occupancies greater than 95% (Table S5). |
T102 |
3911-4075 |
Sentence |
denotes |
One canonical sequence, N0149, had insufficient spectral counts for quantification by this method, but subsequent analyses described below suggested high occupancy. |
T103 |
4076-4181 |
Sentence |
denotes |
The two most C-terminal N-linked sites, N1173 and N1194, had reduced occupancy, 52% and 82% respectively. |
T104 |
4182-4361 |
Sentence |
denotes |
Reduced occupancy at these sites could reflect hindered en bloc transfer by the oligosaccharyltransferase (OST) due to primary amino acid sequences at or near the N-linked sequon. |
T105 |
4362-4694 |
Sentence |
denotes |
Alternatively, this could reflect these two sites being post-translationally modified after release of the protein by the ribosome by a less efficient STT3B-containing OST, either due to activity or initial folding of the polypeptide, as opposed to co-translationally modified by the STT3A-containing OST (Ruiz-Canada et al., 2009). |
T106 |
4695-5021 |
Sentence |
denotes |
None of the non-canonical sequons (three N-X-C sites and four N-G-L/I/V sites; Zielinska et al., 2010) showed significant occupancy (>5%), except for N0501, which showed moderate (19%) conversion to 18O-Asp that could be due to deamidation that is facilitated by glycine at the +1 position (Table S5) (Palmisano et al., 2012). |
T107 |
5022-5177 |
Sentence |
denotes |
Further analysis of this site (see below) by direct glycopeptide analyses allowed us to determine that N0501 undergoes deamidation but is not glycosylated. |
T108 |
5178-5339 |
Sentence |
denotes |
Thus, all, and only the, 22 canonical sequences for N-linked glycosylation (N-X-S/T) are utilized, with only N1173 and N1194 demonstrating occupancies below 95%. |
T109 |
5340-5502 |
Sentence |
denotes |
Next, we applied three different proteolytic digestion strategies to the SARS-CoV-2 S immunogen to maximize glycopeptide coverage by subsequent LC-MS/MS analyses. |
T110 |
5503-5803 |
Sentence |
denotes |
Extended gradient nanoflow reverse-phase LC-MS/MS was carried out on a ThermoFisher Lumos Tribrid instrument using Step-HCD fragmentation on each of the samples (see STAR Methods for details, as well as Duan et al., 2018; Escolano et al., 2019; Wang et al., 2020; Yu et al., 2018; Zhou et al., 2017). |
T111 |
5804-6117 |
Sentence |
denotes |
After data analyses using pGlyco 2.2.2 (Liu et al., 2017), Byonic (Bern et al., 2012), and manual validation of glycan compositions against our released glycomics findings (Figure 2A; Tables S3 and S13), we were able to determine the microheterogeneity at each of the 22 canonical sites (Figures 2B–2E; Table S6). |
T112 |
6118-6226 |
Sentence |
denotes |
Notably, none of the non-canonical consensus sequences, including N0501, displayed any quantifiable glycans. |
T113 |
6227-6354 |
Sentence |
denotes |
The N-glycosites N0074 (Figure 2B) and N0149 (Figure 2C) are highly processed and display a typical mammalian N-glycan profile. |
T114 |
6355-6445 |
Sentence |
denotes |
N0149 is, however, modified with several hybrid N-glycan structures, whereas N0074 is not. |
T115 |
6446-6636 |
Sentence |
denotes |
N0234 (Figure 2D) and N0801 (Figure 2E) have N-glycan profiles more similar to those found on other viruses such as HIV (Watanabe et al., 2019) that are dominated by high-mannose structures. |
T116 |
6637-6791 |
Sentence |
denotes |
N0234 (Figure 2D) displays an abundance of Man7-Man9 high-mannose structures, suggesting stalled processing by early-acting ER and cis-Golgi mannosidases. |
T117 |
6792-6989 |
Sentence |
denotes |
In contrast, N0801 (Figure 2E) is processed more efficiently to Man5 high-mannose and hybrid structures, suggesting that access to the glycan at this site by MGAT1 and α-Mannosidase II is hindered. |
T118 |
6990-7257 |
Sentence |
denotes |
In general, for all 22 sites (Figures 2B–2E; Table S6), we observed underprocessing of complex glycan antennae (i.e., under-galactosylation and under-sialylation) and a high degree of core fucosylation in agreement with released glycan analyses (Figure 2A; Table S3). |
T119 |
7258-7356 |
Sentence |
denotes |
We also observed a small percent of sulfated N-linked glycans at several sites (Tables S6 and S8). |
T120 |
7357-7572 |
Sentence |
denotes |
Based on the assignments and the spectral counts for each topology, we were able to determine the percent of total N-linked glycan types (high-mannose, hybrid, or complex) present at each site (Figure 3 ; Table S7). |
T121 |
7573-7819 |
Sentence |
denotes |
Notably, three of the sites (N0234, N0709, and N0717) displayed more than 50% high-mannose glycans, whereas 11 other sites (N0017, N0074, N0149, N0165, N0282, N0331, N0657, N1134, N1158, N1173, and N1194) were more than 90% complex when occupied. |
T122 |
7820-7886 |
Sentence |
denotes |
The other eight sites were distributed between these two extremes. |
T123 |
7887-8017 |
Sentence |
denotes |
Notably, only one site (N0717 at 45%), which also had greater than 50% high mannose (55%), had greater than 33% hybrid structures. |
T124 |
8018-8288 |
Sentence |
denotes |
To further evaluate the heterogeneity, we grouped all the topologies into the 20 classes recently described by the Crispin laboratory, adding two categories (sulfated and unoccupied) that we refer to here as the Oxford classification (Table S8) (Watanabe et al., 2020a). |
T125 |
8289-8588 |
Sentence |
denotes |
Among other features observed, this classification allowed us to observe that although most sites with high-mannose structures were dominated by the Man5GlcNAc2 structure, N0234 and N0717 were dominated by the higher Man structures of Man8GlcNAc2 and Man7GlcNAc2, respectively (Figure S7; Table S8). |
T126 |
8589-8812 |
Sentence |
denotes |
Limited processing at N0234 is in agreement with a recent report suggesting that high-mannose structures at this site help to stabilize the receptor-binding domain of S (www.biorxiv.org/content/10.1101/2020.06.11.146522v1). |
T127 |
8813-9140 |
Sentence |
denotes |
Furthermore, applying the Oxford classifications to our dataset clearly demonstrates that the three most C-terminal sites (N1158, N1173, and N1194), dominated by complex-type glycans, were more often further processed (i.e., multiple antennae) and elaborated (i.e., galactosylation and sialylation) than other sites (Table S8). |
T128 |
9141-9235 |
Sentence |
denotes |
Figure 3 SARS-CoV-2 S Immunogen N-glycan Sites Are Predominantly Modified by Complex N-glycans |
T129 |
9236-9517 |
Sentence |
denotes |
N-glycan topologies were assigned to all 22 sites of the S protomer and the spectral counts for each of the three types of N-glycans (high-mannose, hybrid, and complex), as well as the unoccupied peptide spectral match counts at each site, were summed and visualized as pie charts. |
T130 |
9518-9605 |
Sentence |
denotes |
Note that only N1173 and N1194 show an appreciable amount of the unoccupied amino acid. |
T131 |
9606-9889 |
Sentence |
denotes |
We also analyzed our generated mass spectrometry data for the presence of O-linked glycans based on our glycomic findings (Figure S5; Table S4) and a recent manuscript suggesting significant levels of O-glycosylation of S1 and S2 when expressed independently (Shajahan et al., 2020). |
T132 |
9890-10026 |
Sentence |
denotes |
We were able to confirm sites of O-glycan modification with microheterogeneity observed for the vast majority of these sites (Table S9). |
T133 |
10027-10230 |
Sentence |
denotes |
However, occupancy at each site, determined by spectral counts, was observed to be very low (below 4%), except for Thr0323, which had a modestly higher but still low 11% occupancy (Figure S6; Table S10). |