PMC:7553147 / 19258-32149 JSONTXT

Annnotations TAB JSON ListView MergeView

    LitCovid-PD-FMA-UBERON

    {"project":"LitCovid-PD-FMA-UBERON","denotations":[{"id":"T25","span":{"begin":126,"end":129},"obj":"Body_part"},{"id":"T26","span":{"begin":261,"end":268},"obj":"Body_part"},{"id":"T27","span":{"begin":363,"end":370},"obj":"Body_part"},{"id":"T28","span":{"begin":929,"end":932},"obj":"Body_part"},{"id":"T29","span":{"begin":1055,"end":1062},"obj":"Body_part"},{"id":"T30","span":{"begin":1250,"end":1253},"obj":"Body_part"},{"id":"T31","span":{"begin":1339,"end":1342},"obj":"Body_part"},{"id":"T32","span":{"begin":1504,"end":1507},"obj":"Body_part"},{"id":"T33","span":{"begin":1876,"end":1879},"obj":"Body_part"},{"id":"T34","span":{"begin":1927,"end":1930},"obj":"Body_part"},{"id":"T35","span":{"begin":2041,"end":2044},"obj":"Body_part"},{"id":"T36","span":{"begin":2092,"end":2095},"obj":"Body_part"},{"id":"T37","span":{"begin":2150,"end":2153},"obj":"Body_part"},{"id":"T38","span":{"begin":2281,"end":2289},"obj":"Body_part"},{"id":"T39","span":{"begin":2387,"end":2390},"obj":"Body_part"},{"id":"T40","span":{"begin":3306,"end":3309},"obj":"Body_part"},{"id":"T41","span":{"begin":3464,"end":3467},"obj":"Body_part"},{"id":"T42","span":{"begin":3768,"end":3771},"obj":"Body_part"},{"id":"T43","span":{"begin":4061,"end":4064},"obj":"Body_part"},{"id":"T44","span":{"begin":4199,"end":4202},"obj":"Body_part"},{"id":"T45","span":{"begin":5003,"end":5008},"obj":"Body_part"},{"id":"T46","span":{"begin":5104,"end":5109},"obj":"Body_part"},{"id":"T47","span":{"begin":5144,"end":5148},"obj":"Body_part"},{"id":"T48","span":{"begin":5326,"end":5329},"obj":"Body_part"},{"id":"T49","span":{"begin":5481,"end":5485},"obj":"Body_part"},{"id":"T50","span":{"begin":5667,"end":5677},"obj":"Body_part"},{"id":"T51","span":{"begin":5952,"end":5956},"obj":"Body_part"},{"id":"T52","span":{"begin":6310,"end":6313},"obj":"Body_part"},{"id":"T53","span":{"begin":6541,"end":6546},"obj":"Body_part"},{"id":"T54","span":{"begin":6968,"end":6979},"obj":"Body_part"},{"id":"T55","span":{"begin":7062,"end":7073},"obj":"Body_part"},{"id":"T56","span":{"begin":7160,"end":7165},"obj":"Body_part"},{"id":"T57","span":{"begin":7678,"end":7681},"obj":"Body_part"},{"id":"T58","span":{"begin":9740,"end":9744},"obj":"Body_part"},{"id":"T59","span":{"begin":10391,"end":10395},"obj":"Body_part"},{"id":"T60","span":{"begin":12311,"end":12318},"obj":"Body_part"},{"id":"T61","span":{"begin":12362,"end":12369},"obj":"Body_part"},{"id":"T62","span":{"begin":12643,"end":12646},"obj":"Body_part"}],"attributes":[{"id":"A25","pred":"fma_id","subj":"T25","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A26","pred":"fma_id","subj":"T26","obj":"http://purl.org/sig/ont/fma/fma82761"},{"id":"A27","pred":"fma_id","subj":"T27","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A28","pred":"fma_id","subj":"T28","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A29","pred":"fma_id","subj":"T29","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A30","pred":"fma_id","subj":"T30","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A31","pred":"fma_id","subj":"T31","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A32","pred":"fma_id","subj":"T32","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A33","pred":"fma_id","subj":"T33","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A34","pred":"fma_id","subj":"T34","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A35","pred":"fma_id","subj":"T35","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A36","pred":"fma_id","subj":"T36","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A37","pred":"fma_id","subj":"T37","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A38","pred":"fma_id","subj":"T38","obj":"http://purl.org/sig/ont/fma/fma303630"},{"id":"A39","pred":"fma_id","subj":"T39","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A40","pred":"fma_id","subj":"T40","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A41","pred":"fma_id","subj":"T41","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A42","pred":"fma_id","subj":"T42","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A43","pred":"fma_id","subj":"T43","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A44","pred":"fma_id","subj":"T44","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A45","pred":"fma_id","subj":"T45","obj":"http://purl.org/sig/ont/fma/fma24938"},{"id":"A46","pred":"fma_id","subj":"T46","obj":"http://purl.org/sig/ont/fma/fma24938"},{"id":"A47","pred":"fma_id","subj":"T47","obj":"http://purl.org/sig/ont/fma/fma12520"},{"id":"A48","pred":"fma_id","subj":"T48","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A49","pred":"fma_id","subj":"T49","obj":"http://purl.org/sig/ont/fma/fma256135"},{"id":"A50","pred":"fma_id","subj":"T50","obj":"http://purl.org/sig/ont/fma/fma9713"},{"id":"A51","pred":"fma_id","subj":"T51","obj":"http://purl.org/sig/ont/fma/fma256135"},{"id":"A52","pred":"fma_id","subj":"T52","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A53","pred":"fma_id","subj":"T53","obj":"http://purl.org/sig/ont/fma/fma24938"},{"id":"A54","pred":"fma_id","subj":"T54","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A55","pred":"fma_id","subj":"T55","obj":"http://purl.org/sig/ont/fma/fma82739"},{"id":"A56","pred":"fma_id","subj":"T56","obj":"http://purl.org/sig/ont/fma/fma24938"},{"id":"A57","pred":"fma_id","subj":"T57","obj":"http://purl.org/sig/ont/fma/fma67847"},{"id":"A58","pred":"fma_id","subj":"T58","obj":"http://purl.org/sig/ont/fma/fma12520"},{"id":"A59","pred":"fma_id","subj":"T59","obj":"http://purl.org/sig/ont/fma/fma256135"},{"id":"A60","pred":"fma_id","subj":"T60","obj":"http://purl.org/sig/ont/fma/fma82761"},{"id":"A61","pred":"fma_id","subj":"T61","obj":"http://purl.org/sig/ont/fma/fma67257"},{"id":"A62","pred":"fma_id","subj":"T62","obj":"http://purl.org/sig/ont/fma/fma67847"}],"text":"3. Results  \nWith the goal set at analyzing spike flexibility, we describe our key results step by step.\n\n3.1. The ensemble map and the way to obtain it  \nIn the following, we describe the analysis of SARS-CoV-2 spike stabilized in the prefusion state by two proline substitutions in S2 (S-2P). We will objectively demonstrate that the flexibility of the spike protein should be understood as a quasi-continuum of conformations, so that when performing a structural analysis on this specimen special care has to be paid to the image-processing workflows, since they may directly impact on the interpretability of the results.\nStarting from the original SARS-CoV-2 S-2P data set of Wrapp et al. (2020 ▸), we have completely reanalyzed the data using our public domain software integration platform Scipion (de la Rosa-Trevín et al., 2016 ▸), breaking the global 3 Å resolution barrier. A representative view of the new ensemble map and its corresponding global FSC curve is shown in Fig. 1 ▸(a) (EMDB entry EMD-11328); the sequence of a monomer of the S protein is shown on the right to facilitate the further discussion of structure–function relationships (from Wrapp et al., 2020 ▸). Figs. 1 ▸(b) and 1 ▸(c) show a comparison between the original map (Wrapp et al., 2020 ▸) with EMDB code EMD-21375 and the newly reconstructed ensemble map corresponding to EMD-11328. Clearly, the local resolution (Vilas et al., 2018 ▸), which is shown on the left in Figs. 1 ▸(b) and 1 ▸(c), is increased in the new map, and the anisotropy, which is shown in the center, is much reduced. Finally, on the right we present plots of the radially averaged tangential resolution, which is related to the quality of the angular alignment (Vilas et al., 2020 ▸); the steeper the slope, the higher the angular assignment error. As can be appreciated, the slope calculated from the newly obtained map is almost zero when compared with that for the map from Wrapp et al. (2020 ▸), indicating that, in relative terms, the particle alignment used to create the new map is better than that used to build the original map. The result is an overall quantitative enhancement in map quality.\nIn terms of tracing, besides modeling several additional residue side chains and improving the geometry of the carbon skeleton (see Supplementary Fig. S2), one of the most noticeable improvements that we observed in the new map is an extension of the glycan chains that were initially built, particularly throughout the S2 fusion subunit (PDB entry 6zow). A quantitative comparison can be made between the length of glycan chains in the new ‘ensemble structure’ with respect to the previous structure (PDB entry 6vsb; see Supplementary Table S2). Although the total number of N-linked glycosylation sequons throughout the SARS-CoV-2 S trimer is essentially the same in the new structure (45) and PDB entry 6vsb (44), we have substantially increased the length of the glycan chains, expanding the total number of glycans by about 50%. We note the importance of this extensive glycosylation for epitope accessibility and how the accurate determination of this glycan shield will facilitate efforts to rapidly develop effective vaccines and therapeutics. Supplementary Fig. S2 shows a representative section of sharpened versions of the ensemble map (EMDB entry EMD-11328) compared with EMDB entry EMD-21375, in which the glycans can now be better traced. However, we should not forget that the ensemble map contains images in which the receptor-binding domain (RBD) and N-terminal domain (NTD) are in different positions (see Section 3.2), and consequently these domains appear to be blurred. Details of how the tracing was performed can be found in Section 2, while in Supplementary Fig. S3 we present two map-to-model quality figures indicating the good fit in general, with the obvious exception of the variable parts.\n\n3.2. Flexibility analysis  \nStarting from a carefully selected set of particles obtained from our consensus and cleaning approaches (see Section 2), together with the ensemble map described previously, we subjected the data to the following flexibility analysis.\nThe original images that were part of the ensemble map went through a ‘consensus classification’ procedure aimed at separating them into two algorithmically stable classes. Essentially, and as described in more detail in Section 2, we performed two independent classifications, further selecting those particles that were consistently together throughout the two classifications. In this way, we obtained the two new classes shown in Fig. 2 ▸(a). We will refer to these as the ‘closed conformation’ [Fig. 2 ▸(a), Class 1, EMDB entry EMD-11336] and the ‘open conformation’ [Fig. 2 ▸(a), Class 2, EMDB entry EMD-11337]. The number of images in each class was reduced to 45 000 in one case and 21 000 in the other, with global FSC-based resolutions of 3.1 and 3.3 Å, respectively.\nThe open and closed structures depict a clear and concerted movement of the ‘thumb’ formed by the RBD and sub­domains 1 and 2 (SD1 and SD2) and the NTD of an adjacent chain. The thumb moves away from the central spike axis, exposing the RBD in the up conformation. In order to make clearer where the changes are at the level of the Class 1 and Class 2 maps, we have made use of the representation of map local strains in Sorzano et al. (2016 ▸), which helps to very clearly visualize the type of strains needed to relate two maps, whether these are rigid-body rotations or more complex deformations (stretching). We have termed the maps resulting from this elastic analysis as ‘1s’ (Class 1, stretching) and ‘1r’ (Class 1, rotations) on the right-hand side of Fig. 2 ▸(a) and the same for Class 2. The color scale for both stretching and rotations goes from blue for small to red for large. Clearly, the differences among the classes with respect to the NTD and RBD have a very substantial component of pure coordinated rigid-body rotations, while the different RBDs present a much more complex pattern of deformations (stretching), indicating an important structural rearrangement in this area that does not occur elsewhere in the specimen. In terms of atomic modeling, we performed a flexible fitting of the ensemble model onto the closed and open forms [see Fig. 2 ▸(a), rightmost map; the PDB code for the open conformation is PDB entry 6zp7, while that for the closed conformation is 6zp5]. Focusing on rotations, which are the most simple element to follow, we can quantify that the degree of rotation of the thumb in these classes is close to 6°, as shown in Fig. 2 ▸(b). Given this flexibility, we consider that the best way to correctly present the experimental results is through the movie provided as Supplementary Movie S1, in which maps and atomic models are presented. Within the approximation to modeling that a flexible fitting represents, we can appreciate two hinge movements of the RBD–SD1–SD2 domains: one located between amino acids 318–326 and 588–595 that produces most of the displacement, and the other between amino acids 330–335 and 527–531 that accompanies a less pronounced ‘up’ movement of the RBD. This thumb motion is completed by the accompanying motion of the NTD from an adjacent chain. Also in a collective way, other NTDs and RBDs in the down conformation move slightly, as can better be appreciated in Supplementary Movie S1, where the transition between fitted models overlaps with the interpolation between observed high-resolution class maps.\nTo further investigate whether or not the flexibility was continuous, we proceeded as follows. Images from the two classes were pooled together and, using the ensemble map, subjected to a 3D principal component analysis (PCA). The approach we followed is based on Tagare et al. (2015 ▸), with some minor modifications of the method. A detailed explanation of the modifications is given in Section 2. We initialized the first principal component (PC) to the difference between the open and closed conformation, while the remaining PCs were initialized randomly. Upon convergence, the eigenvalue of each PC and the scatter of the images in PC space was calculated. The eigenvalues of the PCs are shown in Fig. 3 ▸(a). Clearly, the first three PCs are significant. The scatter plot of the image data in PC1–PC3 space is shown in Fig. 3 ▸(b). Fig. 3 ▸(b) strongly suggests that there is ‘continuous flexibility’ rather than ‘tightly clustered’ flexibility. Fig. 3 ▸(b) also shows the projection of the maps corresponding to the open and closed conformations on the extremes of the first three PCs. It is clear that the open and closed conformations are aligned mostly along the first PC, suggesting that the open/closed classification captures the most significant changes. Fig. 3 ▸(c) shows side views of a pair of structures (mean ± 2 × std, where std is the square root of the eigenvalue) for each PC. Additional details of these structures are available in Supplementary Figs. S4 and S5. Note that PCs are not to be understood as structural pathways with a biological meaning, but as directions that summarize the variance of a data set. For instance, the fact that the RBD appears and disappears at the two extremes of PC3 indicates that there is an important variability in these voxels, which is probably indicative of the up and down conformations of the RBD [to be understood in the context of the elastic analysis shown in Fig. 2 ▸(b)].\nThrough this combination of approaches, we have learnt that the spike conformation fluctuates virtually randomly in a rather continuous manner. Additionally, the approach taken to define the two algorithmically stable ‘classes’ has clearly partitioned the data set according to the main axis of variance, PC1, since the projections of the maps of these classes fall almost exclusively along PC1 and are located towards the extremes of the image-projection cloud. Note that the fraction of structural flexibility owing to PC2 and PC3 is also important in terms of the total variance of the complete image set, but that classification approaches do not seem to properly explore it. Unfortunately, the resolution in PC2 and PC3 is currently limited, so it is difficult to derive clear structural conclusions from these low-resolution maps. However, it is clear from these data that the dynamics of the spike are far richer than just a rigid body closing and opening, and involves more profound rearrangements, especially at the RBD but also at other sites. This observation is similar to that of Ke et al. (2020 ▸) when working with subtomogram averaging.\nAdditionally, the fact that PCA indicates this continuous flexibility to be a key characteristic of the spike dynamics also suggests that many other forms of partitioning (rather than properly ‘classifying’) of this continuous data set could be devised, this fact just being a consequence of the intrinsic instability created by forcing a quasi-continuous data distribution without any clustering structure to fit into a defined set of clusters. In this work, we have clearly forced the classification to go to the extremes of the data distribution, as shown in Fig. 3 ▸, probably by enforcing an algorithmically stable classification, but the key result is that any other degree of movement of the spike in between these extremes of PC1 as well as PC2 and PC3 would also be consistent with the experimental data. In other words, since the continuum of conformations does not have clear ‘cutting/classification’ points, there is a certain algorithmic uncertainty and instability as to the possible results of a classification process. Note that this instability could be exacerbated by the step of particle picking, in the sense that different picking algorithms may have different biases (precisely to minimize this instability, we have performed a ‘consensus’ approach to picking throughout this work).\nClearly, flexibility is key in this system, so that alterations in its dynamics may cause profound effects, including viral neutralization, and this could be one of the reasons for the neutralization mechanism of antibodies directed against the NTD (Chi et al., 2020 ▸).\n\n3.3. Structure of a biochemically stabilized form of the spike  \nWe have also worked with a more recent variant containing six proline substitutions in S2 (HexaPro). This second protein was also studied by Hsieh et al. (2020 ▸). In this case, after going through the same stringent particle-selection process as for the previous specimen, as presented in depth in Section 2, it was impossible to obtain stable classes, so that in Fig. 4 ▸ we present a single map (EMDB entry EMD-11341) together with its global FSC curve and a local resolution analysis. It is clear that the local resolution has increased in the moving parts (mostly the RBD and NTD), although we did not feel confident in further modeling."}

    LitCovid-PD-UBERON

    {"project":"LitCovid-PD-UBERON","denotations":[{"id":"T2","span":{"begin":2281,"end":2289},"obj":"Body_part"},{"id":"T3","span":{"begin":2397,"end":2406},"obj":"Body_part"},{"id":"T4","span":{"begin":5003,"end":5008},"obj":"Body_part"},{"id":"T5","span":{"begin":5104,"end":5109},"obj":"Body_part"},{"id":"T6","span":{"begin":5673,"end":5677},"obj":"Body_part"},{"id":"T7","span":{"begin":5734,"end":5739},"obj":"Body_part"},{"id":"T8","span":{"begin":6541,"end":6546},"obj":"Body_part"},{"id":"T9","span":{"begin":7160,"end":7165},"obj":"Body_part"}],"attributes":[{"id":"A2","pred":"uberon_id","subj":"T2","obj":"http://purl.obolibrary.org/obo/UBERON_0004288"},{"id":"A3","pred":"uberon_id","subj":"T3","obj":"http://purl.obolibrary.org/obo/UBERON_2000106"},{"id":"A4","pred":"uberon_id","subj":"T4","obj":"http://purl.obolibrary.org/obo/UBERON_0001463"},{"id":"A5","pred":"uberon_id","subj":"T5","obj":"http://purl.obolibrary.org/obo/UBERON_0001463"},{"id":"A6","pred":"uberon_id","subj":"T6","obj":"http://purl.obolibrary.org/obo/UBERON_0002398"},{"id":"A7","pred":"uberon_id","subj":"T7","obj":"http://purl.obolibrary.org/obo/UBERON_0002542"},{"id":"A8","pred":"uberon_id","subj":"T8","obj":"http://purl.obolibrary.org/obo/UBERON_0001463"},{"id":"A9","pred":"uberon_id","subj":"T9","obj":"http://purl.obolibrary.org/obo/UBERON_0001463"}],"text":"3. Results  \nWith the goal set at analyzing spike flexibility, we describe our key results step by step.\n\n3.1. The ensemble map and the way to obtain it  \nIn the following, we describe the analysis of SARS-CoV-2 spike stabilized in the prefusion state by two proline substitutions in S2 (S-2P). We will objectively demonstrate that the flexibility of the spike protein should be understood as a quasi-continuum of conformations, so that when performing a structural analysis on this specimen special care has to be paid to the image-processing workflows, since they may directly impact on the interpretability of the results.\nStarting from the original SARS-CoV-2 S-2P data set of Wrapp et al. (2020 ▸), we have completely reanalyzed the data using our public domain software integration platform Scipion (de la Rosa-Trevín et al., 2016 ▸), breaking the global 3 Å resolution barrier. A representative view of the new ensemble map and its corresponding global FSC curve is shown in Fig. 1 ▸(a) (EMDB entry EMD-11328); the sequence of a monomer of the S protein is shown on the right to facilitate the further discussion of structure–function relationships (from Wrapp et al., 2020 ▸). Figs. 1 ▸(b) and 1 ▸(c) show a comparison between the original map (Wrapp et al., 2020 ▸) with EMDB code EMD-21375 and the newly reconstructed ensemble map corresponding to EMD-11328. Clearly, the local resolution (Vilas et al., 2018 ▸), which is shown on the left in Figs. 1 ▸(b) and 1 ▸(c), is increased in the new map, and the anisotropy, which is shown in the center, is much reduced. Finally, on the right we present plots of the radially averaged tangential resolution, which is related to the quality of the angular alignment (Vilas et al., 2020 ▸); the steeper the slope, the higher the angular assignment error. As can be appreciated, the slope calculated from the newly obtained map is almost zero when compared with that for the map from Wrapp et al. (2020 ▸), indicating that, in relative terms, the particle alignment used to create the new map is better than that used to build the original map. The result is an overall quantitative enhancement in map quality.\nIn terms of tracing, besides modeling several additional residue side chains and improving the geometry of the carbon skeleton (see Supplementary Fig. S2), one of the most noticeable improvements that we observed in the new map is an extension of the glycan chains that were initially built, particularly throughout the S2 fusion subunit (PDB entry 6zow). A quantitative comparison can be made between the length of glycan chains in the new ‘ensemble structure’ with respect to the previous structure (PDB entry 6vsb; see Supplementary Table S2). Although the total number of N-linked glycosylation sequons throughout the SARS-CoV-2 S trimer is essentially the same in the new structure (45) and PDB entry 6vsb (44), we have substantially increased the length of the glycan chains, expanding the total number of glycans by about 50%. We note the importance of this extensive glycosylation for epitope accessibility and how the accurate determination of this glycan shield will facilitate efforts to rapidly develop effective vaccines and therapeutics. Supplementary Fig. S2 shows a representative section of sharpened versions of the ensemble map (EMDB entry EMD-11328) compared with EMDB entry EMD-21375, in which the glycans can now be better traced. However, we should not forget that the ensemble map contains images in which the receptor-binding domain (RBD) and N-terminal domain (NTD) are in different positions (see Section 3.2), and consequently these domains appear to be blurred. Details of how the tracing was performed can be found in Section 2, while in Supplementary Fig. S3 we present two map-to-model quality figures indicating the good fit in general, with the obvious exception of the variable parts.\n\n3.2. Flexibility analysis  \nStarting from a carefully selected set of particles obtained from our consensus and cleaning approaches (see Section 2), together with the ensemble map described previously, we subjected the data to the following flexibility analysis.\nThe original images that were part of the ensemble map went through a ‘consensus classification’ procedure aimed at separating them into two algorithmically stable classes. Essentially, and as described in more detail in Section 2, we performed two independent classifications, further selecting those particles that were consistently together throughout the two classifications. In this way, we obtained the two new classes shown in Fig. 2 ▸(a). We will refer to these as the ‘closed conformation’ [Fig. 2 ▸(a), Class 1, EMDB entry EMD-11336] and the ‘open conformation’ [Fig. 2 ▸(a), Class 2, EMDB entry EMD-11337]. The number of images in each class was reduced to 45 000 in one case and 21 000 in the other, with global FSC-based resolutions of 3.1 and 3.3 Å, respectively.\nThe open and closed structures depict a clear and concerted movement of the ‘thumb’ formed by the RBD and sub­domains 1 and 2 (SD1 and SD2) and the NTD of an adjacent chain. The thumb moves away from the central spike axis, exposing the RBD in the up conformation. In order to make clearer where the changes are at the level of the Class 1 and Class 2 maps, we have made use of the representation of map local strains in Sorzano et al. (2016 ▸), which helps to very clearly visualize the type of strains needed to relate two maps, whether these are rigid-body rotations or more complex deformations (stretching). We have termed the maps resulting from this elastic analysis as ‘1s’ (Class 1, stretching) and ‘1r’ (Class 1, rotations) on the right-hand side of Fig. 2 ▸(a) and the same for Class 2. The color scale for both stretching and rotations goes from blue for small to red for large. Clearly, the differences among the classes with respect to the NTD and RBD have a very substantial component of pure coordinated rigid-body rotations, while the different RBDs present a much more complex pattern of deformations (stretching), indicating an important structural rearrangement in this area that does not occur elsewhere in the specimen. In terms of atomic modeling, we performed a flexible fitting of the ensemble model onto the closed and open forms [see Fig. 2 ▸(a), rightmost map; the PDB code for the open conformation is PDB entry 6zp7, while that for the closed conformation is 6zp5]. Focusing on rotations, which are the most simple element to follow, we can quantify that the degree of rotation of the thumb in these classes is close to 6°, as shown in Fig. 2 ▸(b). Given this flexibility, we consider that the best way to correctly present the experimental results is through the movie provided as Supplementary Movie S1, in which maps and atomic models are presented. Within the approximation to modeling that a flexible fitting represents, we can appreciate two hinge movements of the RBD–SD1–SD2 domains: one located between amino acids 318–326 and 588–595 that produces most of the displacement, and the other between amino acids 330–335 and 527–531 that accompanies a less pronounced ‘up’ movement of the RBD. This thumb motion is completed by the accompanying motion of the NTD from an adjacent chain. Also in a collective way, other NTDs and RBDs in the down conformation move slightly, as can better be appreciated in Supplementary Movie S1, where the transition between fitted models overlaps with the interpolation between observed high-resolution class maps.\nTo further investigate whether or not the flexibility was continuous, we proceeded as follows. Images from the two classes were pooled together and, using the ensemble map, subjected to a 3D principal component analysis (PCA). The approach we followed is based on Tagare et al. (2015 ▸), with some minor modifications of the method. A detailed explanation of the modifications is given in Section 2. We initialized the first principal component (PC) to the difference between the open and closed conformation, while the remaining PCs were initialized randomly. Upon convergence, the eigenvalue of each PC and the scatter of the images in PC space was calculated. The eigenvalues of the PCs are shown in Fig. 3 ▸(a). Clearly, the first three PCs are significant. The scatter plot of the image data in PC1–PC3 space is shown in Fig. 3 ▸(b). Fig. 3 ▸(b) strongly suggests that there is ‘continuous flexibility’ rather than ‘tightly clustered’ flexibility. Fig. 3 ▸(b) also shows the projection of the maps corresponding to the open and closed conformations on the extremes of the first three PCs. It is clear that the open and closed conformations are aligned mostly along the first PC, suggesting that the open/closed classification captures the most significant changes. Fig. 3 ▸(c) shows side views of a pair of structures (mean ± 2 × std, where std is the square root of the eigenvalue) for each PC. Additional details of these structures are available in Supplementary Figs. S4 and S5. Note that PCs are not to be understood as structural pathways with a biological meaning, but as directions that summarize the variance of a data set. For instance, the fact that the RBD appears and disappears at the two extremes of PC3 indicates that there is an important variability in these voxels, which is probably indicative of the up and down conformations of the RBD [to be understood in the context of the elastic analysis shown in Fig. 2 ▸(b)].\nThrough this combination of approaches, we have learnt that the spike conformation fluctuates virtually randomly in a rather continuous manner. Additionally, the approach taken to define the two algorithmically stable ‘classes’ has clearly partitioned the data set according to the main axis of variance, PC1, since the projections of the maps of these classes fall almost exclusively along PC1 and are located towards the extremes of the image-projection cloud. Note that the fraction of structural flexibility owing to PC2 and PC3 is also important in terms of the total variance of the complete image set, but that classification approaches do not seem to properly explore it. Unfortunately, the resolution in PC2 and PC3 is currently limited, so it is difficult to derive clear structural conclusions from these low-resolution maps. However, it is clear from these data that the dynamics of the spike are far richer than just a rigid body closing and opening, and involves more profound rearrangements, especially at the RBD but also at other sites. This observation is similar to that of Ke et al. (2020 ▸) when working with subtomogram averaging.\nAdditionally, the fact that PCA indicates this continuous flexibility to be a key characteristic of the spike dynamics also suggests that many other forms of partitioning (rather than properly ‘classifying’) of this continuous data set could be devised, this fact just being a consequence of the intrinsic instability created by forcing a quasi-continuous data distribution without any clustering structure to fit into a defined set of clusters. In this work, we have clearly forced the classification to go to the extremes of the data distribution, as shown in Fig. 3 ▸, probably by enforcing an algorithmically stable classification, but the key result is that any other degree of movement of the spike in between these extremes of PC1 as well as PC2 and PC3 would also be consistent with the experimental data. In other words, since the continuum of conformations does not have clear ‘cutting/classification’ points, there is a certain algorithmic uncertainty and instability as to the possible results of a classification process. Note that this instability could be exacerbated by the step of particle picking, in the sense that different picking algorithms may have different biases (precisely to minimize this instability, we have performed a ‘consensus’ approach to picking throughout this work).\nClearly, flexibility is key in this system, so that alterations in its dynamics may cause profound effects, including viral neutralization, and this could be one of the reasons for the neutralization mechanism of antibodies directed against the NTD (Chi et al., 2020 ▸).\n\n3.3. Structure of a biochemically stabilized form of the spike  \nWe have also worked with a more recent variant containing six proline substitutions in S2 (HexaPro). This second protein was also studied by Hsieh et al. (2020 ▸). In this case, after going through the same stringent particle-selection process as for the previous specimen, as presented in depth in Section 2, it was impossible to obtain stable classes, so that in Fig. 4 ▸ we present a single map (EMDB entry EMD-11341) together with its global FSC curve and a local resolution analysis. It is clear that the local resolution has increased in the moving parts (mostly the RBD and NTD), although we did not feel confident in further modeling."}

    LitCovid-PD-MONDO

    {"project":"LitCovid-PD-MONDO","denotations":[{"id":"T16","span":{"begin":203,"end":211},"obj":"Disease"},{"id":"T17","span":{"begin":655,"end":663},"obj":"Disease"},{"id":"T18","span":{"begin":2785,"end":2793},"obj":"Disease"},{"id":"T19","span":{"begin":3550,"end":3553},"obj":"Disease"},{"id":"T21","span":{"begin":5074,"end":5077},"obj":"Disease"},{"id":"T23","span":{"begin":5880,"end":5883},"obj":"Disease"},{"id":"T25","span":{"begin":7220,"end":7223},"obj":"Disease"},{"id":"T27","span":{"begin":8310,"end":8313},"obj":"Disease"},{"id":"T28","span":{"begin":8314,"end":8317},"obj":"Disease"},{"id":"T29","span":{"begin":9230,"end":9233},"obj":"Disease"},{"id":"T30","span":{"begin":9758,"end":9761},"obj":"Disease"},{"id":"T31","span":{"begin":9844,"end":9847},"obj":"Disease"},{"id":"T32","span":{"begin":9974,"end":9977},"obj":"Disease"},{"id":"T33","span":{"begin":9982,"end":9985},"obj":"Disease"},{"id":"T34","span":{"begin":10166,"end":10169},"obj":"Disease"},{"id":"T35","span":{"begin":10174,"end":10177},"obj":"Disease"},{"id":"T36","span":{"begin":11340,"end":11343},"obj":"Disease"},{"id":"T37","span":{"begin":11355,"end":11358},"obj":"Disease"},{"id":"T38","span":{"begin":11363,"end":11366},"obj":"Disease"},{"id":"T39","span":{"begin":12156,"end":12159},"obj":"Disease"},{"id":"T41","span":{"begin":12830,"end":12833},"obj":"Disease"}],"attributes":[{"id":"A16","pred":"mondo_id","subj":"T16","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A17","pred":"mondo_id","subj":"T17","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A18","pred":"mondo_id","subj":"T18","obj":"http://purl.obolibrary.org/obo/MONDO_0005091"},{"id":"A19","pred":"mondo_id","subj":"T19","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A20","pred":"mondo_id","subj":"T19","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A21","pred":"mondo_id","subj":"T21","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A22","pred":"mondo_id","subj":"T21","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A23","pred":"mondo_id","subj":"T23","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A24","pred":"mondo_id","subj":"T23","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A25","pred":"mondo_id","subj":"T25","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A26","pred":"mondo_id","subj":"T25","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A27","pred":"mondo_id","subj":"T27","obj":"http://purl.obolibrary.org/obo/MONDO_0008173"},{"id":"A28","pred":"mondo_id","subj":"T28","obj":"http://purl.obolibrary.org/obo/MONDO_0014324"},{"id":"A29","pred":"mondo_id","subj":"T29","obj":"http://purl.obolibrary.org/obo/MONDO_0014324"},{"id":"A30","pred":"mondo_id","subj":"T30","obj":"http://purl.obolibrary.org/obo/MONDO_0008173"},{"id":"A31","pred":"mondo_id","subj":"T31","obj":"http://purl.obolibrary.org/obo/MONDO_0008173"},{"id":"A32","pred":"mondo_id","subj":"T32","obj":"http://purl.obolibrary.org/obo/MONDO_0008174"},{"id":"A33","pred":"mondo_id","subj":"T33","obj":"http://purl.obolibrary.org/obo/MONDO_0014324"},{"id":"A34","pred":"mondo_id","subj":"T34","obj":"http://purl.obolibrary.org/obo/MONDO_0008174"},{"id":"A35","pred":"mondo_id","subj":"T35","obj":"http://purl.obolibrary.org/obo/MONDO_0014324"},{"id":"A36","pred":"mondo_id","subj":"T36","obj":"http://purl.obolibrary.org/obo/MONDO_0008173"},{"id":"A37","pred":"mondo_id","subj":"T37","obj":"http://purl.obolibrary.org/obo/MONDO_0008174"},{"id":"A38","pred":"mondo_id","subj":"T38","obj":"http://purl.obolibrary.org/obo/MONDO_0014324"},{"id":"A39","pred":"mondo_id","subj":"T39","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A40","pred":"mondo_id","subj":"T39","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"},{"id":"A41","pred":"mondo_id","subj":"T41","obj":"http://purl.obolibrary.org/obo/MONDO_0008449"},{"id":"A42","pred":"mondo_id","subj":"T41","obj":"http://purl.obolibrary.org/obo/MONDO_0018075"}],"text":"3. Results  \nWith the goal set at analyzing spike flexibility, we describe our key results step by step.\n\n3.1. The ensemble map and the way to obtain it  \nIn the following, we describe the analysis of SARS-CoV-2 spike stabilized in the prefusion state by two proline substitutions in S2 (S-2P). We will objectively demonstrate that the flexibility of the spike protein should be understood as a quasi-continuum of conformations, so that when performing a structural analysis on this specimen special care has to be paid to the image-processing workflows, since they may directly impact on the interpretability of the results.\nStarting from the original SARS-CoV-2 S-2P data set of Wrapp et al. (2020 ▸), we have completely reanalyzed the data using our public domain software integration platform Scipion (de la Rosa-Trevín et al., 2016 ▸), breaking the global 3 Å resolution barrier. A representative view of the new ensemble map and its corresponding global FSC curve is shown in Fig. 1 ▸(a) (EMDB entry EMD-11328); the sequence of a monomer of the S protein is shown on the right to facilitate the further discussion of structure–function relationships (from Wrapp et al., 2020 ▸). Figs. 1 ▸(b) and 1 ▸(c) show a comparison between the original map (Wrapp et al., 2020 ▸) with EMDB code EMD-21375 and the newly reconstructed ensemble map corresponding to EMD-11328. Clearly, the local resolution (Vilas et al., 2018 ▸), which is shown on the left in Figs. 1 ▸(b) and 1 ▸(c), is increased in the new map, and the anisotropy, which is shown in the center, is much reduced. Finally, on the right we present plots of the radially averaged tangential resolution, which is related to the quality of the angular alignment (Vilas et al., 2020 ▸); the steeper the slope, the higher the angular assignment error. As can be appreciated, the slope calculated from the newly obtained map is almost zero when compared with that for the map from Wrapp et al. (2020 ▸), indicating that, in relative terms, the particle alignment used to create the new map is better than that used to build the original map. The result is an overall quantitative enhancement in map quality.\nIn terms of tracing, besides modeling several additional residue side chains and improving the geometry of the carbon skeleton (see Supplementary Fig. S2), one of the most noticeable improvements that we observed in the new map is an extension of the glycan chains that were initially built, particularly throughout the S2 fusion subunit (PDB entry 6zow). A quantitative comparison can be made between the length of glycan chains in the new ‘ensemble structure’ with respect to the previous structure (PDB entry 6vsb; see Supplementary Table S2). Although the total number of N-linked glycosylation sequons throughout the SARS-CoV-2 S trimer is essentially the same in the new structure (45) and PDB entry 6vsb (44), we have substantially increased the length of the glycan chains, expanding the total number of glycans by about 50%. We note the importance of this extensive glycosylation for epitope accessibility and how the accurate determination of this glycan shield will facilitate efforts to rapidly develop effective vaccines and therapeutics. Supplementary Fig. S2 shows a representative section of sharpened versions of the ensemble map (EMDB entry EMD-11328) compared with EMDB entry EMD-21375, in which the glycans can now be better traced. However, we should not forget that the ensemble map contains images in which the receptor-binding domain (RBD) and N-terminal domain (NTD) are in different positions (see Section 3.2), and consequently these domains appear to be blurred. Details of how the tracing was performed can be found in Section 2, while in Supplementary Fig. S3 we present two map-to-model quality figures indicating the good fit in general, with the obvious exception of the variable parts.\n\n3.2. Flexibility analysis  \nStarting from a carefully selected set of particles obtained from our consensus and cleaning approaches (see Section 2), together with the ensemble map described previously, we subjected the data to the following flexibility analysis.\nThe original images that were part of the ensemble map went through a ‘consensus classification’ procedure aimed at separating them into two algorithmically stable classes. Essentially, and as described in more detail in Section 2, we performed two independent classifications, further selecting those particles that were consistently together throughout the two classifications. In this way, we obtained the two new classes shown in Fig. 2 ▸(a). We will refer to these as the ‘closed conformation’ [Fig. 2 ▸(a), Class 1, EMDB entry EMD-11336] and the ‘open conformation’ [Fig. 2 ▸(a), Class 2, EMDB entry EMD-11337]. The number of images in each class was reduced to 45 000 in one case and 21 000 in the other, with global FSC-based resolutions of 3.1 and 3.3 Å, respectively.\nThe open and closed structures depict a clear and concerted movement of the ‘thumb’ formed by the RBD and sub­domains 1 and 2 (SD1 and SD2) and the NTD of an adjacent chain. The thumb moves away from the central spike axis, exposing the RBD in the up conformation. In order to make clearer where the changes are at the level of the Class 1 and Class 2 maps, we have made use of the representation of map local strains in Sorzano et al. (2016 ▸), which helps to very clearly visualize the type of strains needed to relate two maps, whether these are rigid-body rotations or more complex deformations (stretching). We have termed the maps resulting from this elastic analysis as ‘1s’ (Class 1, stretching) and ‘1r’ (Class 1, rotations) on the right-hand side of Fig. 2 ▸(a) and the same for Class 2. The color scale for both stretching and rotations goes from blue for small to red for large. Clearly, the differences among the classes with respect to the NTD and RBD have a very substantial component of pure coordinated rigid-body rotations, while the different RBDs present a much more complex pattern of deformations (stretching), indicating an important structural rearrangement in this area that does not occur elsewhere in the specimen. In terms of atomic modeling, we performed a flexible fitting of the ensemble model onto the closed and open forms [see Fig. 2 ▸(a), rightmost map; the PDB code for the open conformation is PDB entry 6zp7, while that for the closed conformation is 6zp5]. Focusing on rotations, which are the most simple element to follow, we can quantify that the degree of rotation of the thumb in these classes is close to 6°, as shown in Fig. 2 ▸(b). Given this flexibility, we consider that the best way to correctly present the experimental results is through the movie provided as Supplementary Movie S1, in which maps and atomic models are presented. Within the approximation to modeling that a flexible fitting represents, we can appreciate two hinge movements of the RBD–SD1–SD2 domains: one located between amino acids 318–326 and 588–595 that produces most of the displacement, and the other between amino acids 330–335 and 527–531 that accompanies a less pronounced ‘up’ movement of the RBD. This thumb motion is completed by the accompanying motion of the NTD from an adjacent chain. Also in a collective way, other NTDs and RBDs in the down conformation move slightly, as can better be appreciated in Supplementary Movie S1, where the transition between fitted models overlaps with the interpolation between observed high-resolution class maps.\nTo further investigate whether or not the flexibility was continuous, we proceeded as follows. Images from the two classes were pooled together and, using the ensemble map, subjected to a 3D principal component analysis (PCA). The approach we followed is based on Tagare et al. (2015 ▸), with some minor modifications of the method. A detailed explanation of the modifications is given in Section 2. We initialized the first principal component (PC) to the difference between the open and closed conformation, while the remaining PCs were initialized randomly. Upon convergence, the eigenvalue of each PC and the scatter of the images in PC space was calculated. The eigenvalues of the PCs are shown in Fig. 3 ▸(a). Clearly, the first three PCs are significant. The scatter plot of the image data in PC1–PC3 space is shown in Fig. 3 ▸(b). Fig. 3 ▸(b) strongly suggests that there is ‘continuous flexibility’ rather than ‘tightly clustered’ flexibility. Fig. 3 ▸(b) also shows the projection of the maps corresponding to the open and closed conformations on the extremes of the first three PCs. It is clear that the open and closed conformations are aligned mostly along the first PC, suggesting that the open/closed classification captures the most significant changes. Fig. 3 ▸(c) shows side views of a pair of structures (mean ± 2 × std, where std is the square root of the eigenvalue) for each PC. Additional details of these structures are available in Supplementary Figs. S4 and S5. Note that PCs are not to be understood as structural pathways with a biological meaning, but as directions that summarize the variance of a data set. For instance, the fact that the RBD appears and disappears at the two extremes of PC3 indicates that there is an important variability in these voxels, which is probably indicative of the up and down conformations of the RBD [to be understood in the context of the elastic analysis shown in Fig. 2 ▸(b)].\nThrough this combination of approaches, we have learnt that the spike conformation fluctuates virtually randomly in a rather continuous manner. Additionally, the approach taken to define the two algorithmically stable ‘classes’ has clearly partitioned the data set according to the main axis of variance, PC1, since the projections of the maps of these classes fall almost exclusively along PC1 and are located towards the extremes of the image-projection cloud. Note that the fraction of structural flexibility owing to PC2 and PC3 is also important in terms of the total variance of the complete image set, but that classification approaches do not seem to properly explore it. Unfortunately, the resolution in PC2 and PC3 is currently limited, so it is difficult to derive clear structural conclusions from these low-resolution maps. However, it is clear from these data that the dynamics of the spike are far richer than just a rigid body closing and opening, and involves more profound rearrangements, especially at the RBD but also at other sites. This observation is similar to that of Ke et al. (2020 ▸) when working with subtomogram averaging.\nAdditionally, the fact that PCA indicates this continuous flexibility to be a key characteristic of the spike dynamics also suggests that many other forms of partitioning (rather than properly ‘classifying’) of this continuous data set could be devised, this fact just being a consequence of the intrinsic instability created by forcing a quasi-continuous data distribution without any clustering structure to fit into a defined set of clusters. In this work, we have clearly forced the classification to go to the extremes of the data distribution, as shown in Fig. 3 ▸, probably by enforcing an algorithmically stable classification, but the key result is that any other degree of movement of the spike in between these extremes of PC1 as well as PC2 and PC3 would also be consistent with the experimental data. In other words, since the continuum of conformations does not have clear ‘cutting/classification’ points, there is a certain algorithmic uncertainty and instability as to the possible results of a classification process. Note that this instability could be exacerbated by the step of particle picking, in the sense that different picking algorithms may have different biases (precisely to minimize this instability, we have performed a ‘consensus’ approach to picking throughout this work).\nClearly, flexibility is key in this system, so that alterations in its dynamics may cause profound effects, including viral neutralization, and this could be one of the reasons for the neutralization mechanism of antibodies directed against the NTD (Chi et al., 2020 ▸).\n\n3.3. Structure of a biochemically stabilized form of the spike  \nWe have also worked with a more recent variant containing six proline substitutions in S2 (HexaPro). This second protein was also studied by Hsieh et al. (2020 ▸). In this case, after going through the same stringent particle-selection process as for the previous specimen, as presented in depth in Section 2, it was impossible to obtain stable classes, so that in Fig. 4 ▸ we present a single map (EMDB entry EMD-11341) together with its global FSC curve and a local resolution analysis. It is clear that the local resolution has increased in the moving parts (mostly the RBD and NTD), although we did not feel confident in further modeling."}

    LitCovid-PD-CLO

    {"project":"LitCovid-PD-CLO","denotations":[{"id":"T107","span":{"begin":286,"end":288},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T108","span":{"begin":286,"end":288},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T109","span":{"begin":305,"end":316},"obj":"http://purl.obolibrary.org/obo/BFO_0000030"},{"id":"T110","span":{"begin":395,"end":396},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T111","span":{"begin":455,"end":456},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T112","span":{"begin":507,"end":510},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T113","span":{"begin":865,"end":866},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T114","span":{"begin":887,"end":888},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T115","span":{"begin":993,"end":994},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T116","span":{"begin":1036,"end":1037},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T117","span":{"begin":1197,"end":1198},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T118","span":{"begin":1216,"end":1217},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T119","span":{"begin":1416,"end":1420},"obj":"http://purl.obolibrary.org/obo/CLO_0001185"},{"id":"T120","span":{"begin":1465,"end":1466},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T121","span":{"begin":2281,"end":2289},"obj":"http://purl.obolibrary.org/obo/UBERON_0004288"},{"id":"T122","span":{"begin":2314,"end":2316},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T123","span":{"begin":2314,"end":2316},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T124","span":{"begin":2483,"end":2485},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T125","span":{"begin":2483,"end":2485},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T126","span":{"begin":2519,"end":2520},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T127","span":{"begin":2705,"end":2707},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T128","span":{"begin":2705,"end":2707},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T129","span":{"begin":2851,"end":2853},"obj":"http://purl.obolibrary.org/obo/CLO_0053799"},{"id":"T130","span":{"begin":3234,"end":3236},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T131","span":{"begin":3234,"end":3236},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T132","span":{"begin":3243,"end":3244},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T133","span":{"begin":3927,"end":3928},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T134","span":{"begin":4216,"end":4217},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T135","span":{"begin":4587,"end":4593},"obj":"http://purl.obolibrary.org/obo/CLO_0001236"},{"id":"T136","span":{"begin":4653,"end":4659},"obj":"http://purl.obolibrary.org/obo/CLO_0001236"},{"id":"T137","span":{"begin":4726,"end":4732},"obj":"http://purl.obolibrary.org/obo/CLO_0001236"},{"id":"T138","span":{"begin":4816,"end":4818},"obj":"http://purl.obolibrary.org/obo/CLO_0053799"},{"id":"T139","span":{"begin":4909,"end":4910},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T140","span":{"begin":4964,"end":4965},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T141","span":{"begin":5053,"end":5056},"obj":"http://purl.obolibrary.org/obo/CLO_0008975"},{"id":"T142","span":{"begin":5691,"end":5697},"obj":"http://purl.obolibrary.org/obo/CLO_0001236"},{"id":"T143","span":{"begin":5897,"end":5898},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T144","span":{"begin":6001,"end":6002},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T145","span":{"begin":6210,"end":6211},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T146","span":{"begin":6292,"end":6298},"obj":"http://purl.obolibrary.org/obo/CLO_0001236"},{"id":"T147","span":{"begin":6422,"end":6430},"obj":"http://purl.obolibrary.org/obo/CLO_0009985"},{"id":"T148","span":{"begin":6601,"end":6602},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T149","span":{"begin":6758,"end":6760},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T150","span":{"begin":6851,"end":6852},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T151","span":{"begin":6931,"end":6934},"obj":"http://purl.obolibrary.org/obo/CLO_0008975"},{"id":"T152","span":{"begin":7111,"end":7112},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T153","span":{"begin":7256,"end":7257},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T154","span":{"begin":7386,"end":7388},"obj":"http://purl.obolibrary.org/obo/CLO_0050050"},{"id":"T155","span":{"begin":7696,"end":7697},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T156","span":{"begin":7843,"end":7844},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T157","span":{"begin":8222,"end":8223},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T158","span":{"begin":8314,"end":8317},"obj":"http://purl.obolibrary.org/obo/CLO_0008395"},{"id":"T159","span":{"begin":8314,"end":8317},"obj":"http://purl.obolibrary.org/obo/CLO_0051535"},{"id":"T160","span":{"begin":8314,"end":8317},"obj":"http://purl.obolibrary.org/obo/CLO_0052068"},{"id":"T161","span":{"begin":8314,"end":8317},"obj":"http://purl.obolibrary.org/obo/CLO_0052069"},{"id":"T162","span":{"begin":8314,"end":8317},"obj":"http://purl.obolibrary.org/obo/CLO_0052070"},{"id":"T163","span":{"begin":8314,"end":8317},"obj":"http://purl.obolibrary.org/obo/CLO_0052071"},{"id":"T164","span":{"begin":8345,"end":8346},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T165","span":{"begin":8358,"end":8359},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T166","span":{"begin":8472,"end":8473},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T167","span":{"begin":8571,"end":8579},"obj":"http://www.ebi.ac.uk/efo/EFO_0000876"},{"id":"T168","span":{"begin":8812,"end":8813},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T169","span":{"begin":8994,"end":8996},"obj":"http://purl.obolibrary.org/obo/CLO_0008933"},{"id":"T170","span":{"begin":9065,"end":9066},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T171","span":{"begin":9136,"end":9137},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T172","span":{"begin":9218,"end":9226},"obj":"http://www.ebi.ac.uk/efo/EFO_0000876"},{"id":"T173","span":{"begin":9230,"end":9233},"obj":"http://purl.obolibrary.org/obo/CLO_0008395"},{"id":"T174","span":{"begin":9230,"end":9233},"obj":"http://purl.obolibrary.org/obo/CLO_0051535"},{"id":"T175","span":{"begin":9230,"end":9233},"obj":"http://purl.obolibrary.org/obo/CLO_0052068"},{"id":"T176","span":{"begin":9230,"end":9233},"obj":"http://purl.obolibrary.org/obo/CLO_0052069"},{"id":"T177","span":{"begin":9230,"end":9233},"obj":"http://purl.obolibrary.org/obo/CLO_0052070"},{"id":"T178","span":{"begin":9230,"end":9233},"obj":"http://purl.obolibrary.org/obo/CLO_0052071"},{"id":"T179","span":{"begin":9448,"end":9449},"obj":"http://purl.obolibrary.org/obo/CLO_0001021"},{"id":"T180","span":{"begin":9569,"end":9570},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T181","span":{"begin":9681,"end":9684},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"},{"id":"T182","span":{"begin":9876,"end":9884},"obj":"http://www.ebi.ac.uk/efo/EFO_0000876"},{"id":"T183","span":{"begin":9982,"end":9985},"obj":"http://purl.obolibrary.org/obo/CLO_0008395"},{"id":"T184","span":{"begin":9982,"end":9985},"obj":"http://purl.obolibrary.org/obo/CLO_0051535"},{"id":"T185","span":{"begin":9982,"end":9985},"obj":"http://purl.obolibrary.org/obo/CLO_0052068"},{"id":"T186","span":{"begin":9982,"end":9985},"obj":"http://purl.obolibrary.org/obo/CLO_0052069"},{"id":"T187","span":{"begin":9982,"end":9985},"obj":"http://purl.obolibrary.org/obo/CLO_0052070"},{"id":"T188","span":{"begin":9982,"end":9985},"obj":"http://purl.obolibrary.org/obo/CLO_0052071"},{"id":"T189","span":{"begin":10174,"end":10177},"obj":"http://purl.obolibrary.org/obo/CLO_0008395"},{"id":"T190","span":{"begin":10174,"end":10177},"obj":"http://purl.obolibrary.org/obo/CLO_0051535"},{"id":"T191","span":{"begin":10174,"end":10177},"obj":"http://purl.obolibrary.org/obo/CLO_0052068"},{"id":"T192","span":{"begin":10174,"end":10177},"obj":"http://purl.obolibrary.org/obo/CLO_0052069"},{"id":"T193","span":{"begin":10174,"end":10177},"obj":"http://purl.obolibrary.org/obo/CLO_0052070"},{"id":"T194","span":{"begin":10174,"end":10177},"obj":"http://purl.obolibrary.org/obo/CLO_0052071"},{"id":"T195","span":{"begin":10383,"end":10384},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T196","span":{"begin":10682,"end":10683},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T197","span":{"begin":10881,"end":10882},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T198","span":{"begin":10943,"end":10944},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T199","span":{"begin":11025,"end":11026},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T200","span":{"begin":11121,"end":11129},"obj":"http://www.ebi.ac.uk/efo/EFO_0000876"},{"id":"T201","span":{"begin":11328,"end":11336},"obj":"http://www.ebi.ac.uk/efo/EFO_0000876"},{"id":"T202","span":{"begin":11363,"end":11366},"obj":"http://purl.obolibrary.org/obo/CLO_0008395"},{"id":"T203","span":{"begin":11363,"end":11366},"obj":"http://purl.obolibrary.org/obo/CLO_0051535"},{"id":"T204","span":{"begin":11363,"end":11366},"obj":"http://purl.obolibrary.org/obo/CLO_0052068"},{"id":"T205","span":{"begin":11363,"end":11366},"obj":"http://purl.obolibrary.org/obo/CLO_0052069"},{"id":"T206","span":{"begin":11363,"end":11366},"obj":"http://purl.obolibrary.org/obo/CLO_0052070"},{"id":"T207","span":{"begin":11363,"end":11366},"obj":"http://purl.obolibrary.org/obo/CLO_0052071"},{"id":"T208","span":{"begin":11535,"end":11536},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T209","span":{"begin":11615,"end":11616},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T210","span":{"begin":11854,"end":11855},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T211","span":{"begin":12202,"end":12203},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T212","span":{"begin":12274,"end":12275},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T213","span":{"begin":12336,"end":12338},"obj":"http://purl.obolibrary.org/obo/CLO_0008922"},{"id":"T214","span":{"begin":12336,"end":12338},"obj":"http://purl.obolibrary.org/obo/CLO_0050052"},{"id":"T215","span":{"begin":12634,"end":12635},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T216","span":{"begin":12709,"end":12710},"obj":"http://purl.obolibrary.org/obo/CLO_0001020"},{"id":"T217","span":{"begin":12776,"end":12779},"obj":"http://purl.obolibrary.org/obo/CLO_0051582"}],"text":"3. Results  \nWith the goal set at analyzing spike flexibility, we describe our key results step by step.\n\n3.1. The ensemble map and the way to obtain it  \nIn the following, we describe the analysis of SARS-CoV-2 spike stabilized in the prefusion state by two proline substitutions in S2 (S-2P). We will objectively demonstrate that the flexibility of the spike protein should be understood as a quasi-continuum of conformations, so that when performing a structural analysis on this specimen special care has to be paid to the image-processing workflows, since they may directly impact on the interpretability of the results.\nStarting from the original SARS-CoV-2 S-2P data set of Wrapp et al. (2020 ▸), we have completely reanalyzed the data using our public domain software integration platform Scipion (de la Rosa-Trevín et al., 2016 ▸), breaking the global 3 Å resolution barrier. A representative view of the new ensemble map and its corresponding global FSC curve is shown in Fig. 1 ▸(a) (EMDB entry EMD-11328); the sequence of a monomer of the S protein is shown on the right to facilitate the further discussion of structure–function relationships (from Wrapp et al., 2020 ▸). Figs. 1 ▸(b) and 1 ▸(c) show a comparison between the original map (Wrapp et al., 2020 ▸) with EMDB code EMD-21375 and the newly reconstructed ensemble map corresponding to EMD-11328. Clearly, the local resolution (Vilas et al., 2018 ▸), which is shown on the left in Figs. 1 ▸(b) and 1 ▸(c), is increased in the new map, and the anisotropy, which is shown in the center, is much reduced. Finally, on the right we present plots of the radially averaged tangential resolution, which is related to the quality of the angular alignment (Vilas et al., 2020 ▸); the steeper the slope, the higher the angular assignment error. As can be appreciated, the slope calculated from the newly obtained map is almost zero when compared with that for the map from Wrapp et al. (2020 ▸), indicating that, in relative terms, the particle alignment used to create the new map is better than that used to build the original map. The result is an overall quantitative enhancement in map quality.\nIn terms of tracing, besides modeling several additional residue side chains and improving the geometry of the carbon skeleton (see Supplementary Fig. S2), one of the most noticeable improvements that we observed in the new map is an extension of the glycan chains that were initially built, particularly throughout the S2 fusion subunit (PDB entry 6zow). A quantitative comparison can be made between the length of glycan chains in the new ‘ensemble structure’ with respect to the previous structure (PDB entry 6vsb; see Supplementary Table S2). Although the total number of N-linked glycosylation sequons throughout the SARS-CoV-2 S trimer is essentially the same in the new structure (45) and PDB entry 6vsb (44), we have substantially increased the length of the glycan chains, expanding the total number of glycans by about 50%. We note the importance of this extensive glycosylation for epitope accessibility and how the accurate determination of this glycan shield will facilitate efforts to rapidly develop effective vaccines and therapeutics. Supplementary Fig. S2 shows a representative section of sharpened versions of the ensemble map (EMDB entry EMD-11328) compared with EMDB entry EMD-21375, in which the glycans can now be better traced. However, we should not forget that the ensemble map contains images in which the receptor-binding domain (RBD) and N-terminal domain (NTD) are in different positions (see Section 3.2), and consequently these domains appear to be blurred. Details of how the tracing was performed can be found in Section 2, while in Supplementary Fig. S3 we present two map-to-model quality figures indicating the good fit in general, with the obvious exception of the variable parts.\n\n3.2. Flexibility analysis  \nStarting from a carefully selected set of particles obtained from our consensus and cleaning approaches (see Section 2), together with the ensemble map described previously, we subjected the data to the following flexibility analysis.\nThe original images that were part of the ensemble map went through a ‘consensus classification’ procedure aimed at separating them into two algorithmically stable classes. Essentially, and as described in more detail in Section 2, we performed two independent classifications, further selecting those particles that were consistently together throughout the two classifications. In this way, we obtained the two new classes shown in Fig. 2 ▸(a). We will refer to these as the ‘closed conformation’ [Fig. 2 ▸(a), Class 1, EMDB entry EMD-11336] and the ‘open conformation’ [Fig. 2 ▸(a), Class 2, EMDB entry EMD-11337]. The number of images in each class was reduced to 45 000 in one case and 21 000 in the other, with global FSC-based resolutions of 3.1 and 3.3 Å, respectively.\nThe open and closed structures depict a clear and concerted movement of the ‘thumb’ formed by the RBD and sub­domains 1 and 2 (SD1 and SD2) and the NTD of an adjacent chain. The thumb moves away from the central spike axis, exposing the RBD in the up conformation. In order to make clearer where the changes are at the level of the Class 1 and Class 2 maps, we have made use of the representation of map local strains in Sorzano et al. (2016 ▸), which helps to very clearly visualize the type of strains needed to relate two maps, whether these are rigid-body rotations or more complex deformations (stretching). We have termed the maps resulting from this elastic analysis as ‘1s’ (Class 1, stretching) and ‘1r’ (Class 1, rotations) on the right-hand side of Fig. 2 ▸(a) and the same for Class 2. The color scale for both stretching and rotations goes from blue for small to red for large. Clearly, the differences among the classes with respect to the NTD and RBD have a very substantial component of pure coordinated rigid-body rotations, while the different RBDs present a much more complex pattern of deformations (stretching), indicating an important structural rearrangement in this area that does not occur elsewhere in the specimen. In terms of atomic modeling, we performed a flexible fitting of the ensemble model onto the closed and open forms [see Fig. 2 ▸(a), rightmost map; the PDB code for the open conformation is PDB entry 6zp7, while that for the closed conformation is 6zp5]. Focusing on rotations, which are the most simple element to follow, we can quantify that the degree of rotation of the thumb in these classes is close to 6°, as shown in Fig. 2 ▸(b). Given this flexibility, we consider that the best way to correctly present the experimental results is through the movie provided as Supplementary Movie S1, in which maps and atomic models are presented. Within the approximation to modeling that a flexible fitting represents, we can appreciate two hinge movements of the RBD–SD1–SD2 domains: one located between amino acids 318–326 and 588–595 that produces most of the displacement, and the other between amino acids 330–335 and 527–531 that accompanies a less pronounced ‘up’ movement of the RBD. This thumb motion is completed by the accompanying motion of the NTD from an adjacent chain. Also in a collective way, other NTDs and RBDs in the down conformation move slightly, as can better be appreciated in Supplementary Movie S1, where the transition between fitted models overlaps with the interpolation between observed high-resolution class maps.\nTo further investigate whether or not the flexibility was continuous, we proceeded as follows. Images from the two classes were pooled together and, using the ensemble map, subjected to a 3D principal component analysis (PCA). The approach we followed is based on Tagare et al. (2015 ▸), with some minor modifications of the method. A detailed explanation of the modifications is given in Section 2. We initialized the first principal component (PC) to the difference between the open and closed conformation, while the remaining PCs were initialized randomly. Upon convergence, the eigenvalue of each PC and the scatter of the images in PC space was calculated. The eigenvalues of the PCs are shown in Fig. 3 ▸(a). Clearly, the first three PCs are significant. The scatter plot of the image data in PC1–PC3 space is shown in Fig. 3 ▸(b). Fig. 3 ▸(b) strongly suggests that there is ‘continuous flexibility’ rather than ‘tightly clustered’ flexibility. Fig. 3 ▸(b) also shows the projection of the maps corresponding to the open and closed conformations on the extremes of the first three PCs. It is clear that the open and closed conformations are aligned mostly along the first PC, suggesting that the open/closed classification captures the most significant changes. Fig. 3 ▸(c) shows side views of a pair of structures (mean ± 2 × std, where std is the square root of the eigenvalue) for each PC. Additional details of these structures are available in Supplementary Figs. S4 and S5. Note that PCs are not to be understood as structural pathways with a biological meaning, but as directions that summarize the variance of a data set. For instance, the fact that the RBD appears and disappears at the two extremes of PC3 indicates that there is an important variability in these voxels, which is probably indicative of the up and down conformations of the RBD [to be understood in the context of the elastic analysis shown in Fig. 2 ▸(b)].\nThrough this combination of approaches, we have learnt that the spike conformation fluctuates virtually randomly in a rather continuous manner. Additionally, the approach taken to define the two algorithmically stable ‘classes’ has clearly partitioned the data set according to the main axis of variance, PC1, since the projections of the maps of these classes fall almost exclusively along PC1 and are located towards the extremes of the image-projection cloud. Note that the fraction of structural flexibility owing to PC2 and PC3 is also important in terms of the total variance of the complete image set, but that classification approaches do not seem to properly explore it. Unfortunately, the resolution in PC2 and PC3 is currently limited, so it is difficult to derive clear structural conclusions from these low-resolution maps. However, it is clear from these data that the dynamics of the spike are far richer than just a rigid body closing and opening, and involves more profound rearrangements, especially at the RBD but also at other sites. This observation is similar to that of Ke et al. (2020 ▸) when working with subtomogram averaging.\nAdditionally, the fact that PCA indicates this continuous flexibility to be a key characteristic of the spike dynamics also suggests that many other forms of partitioning (rather than properly ‘classifying’) of this continuous data set could be devised, this fact just being a consequence of the intrinsic instability created by forcing a quasi-continuous data distribution without any clustering structure to fit into a defined set of clusters. In this work, we have clearly forced the classification to go to the extremes of the data distribution, as shown in Fig. 3 ▸, probably by enforcing an algorithmically stable classification, but the key result is that any other degree of movement of the spike in between these extremes of PC1 as well as PC2 and PC3 would also be consistent with the experimental data. In other words, since the continuum of conformations does not have clear ‘cutting/classification’ points, there is a certain algorithmic uncertainty and instability as to the possible results of a classification process. Note that this instability could be exacerbated by the step of particle picking, in the sense that different picking algorithms may have different biases (precisely to minimize this instability, we have performed a ‘consensus’ approach to picking throughout this work).\nClearly, flexibility is key in this system, so that alterations in its dynamics may cause profound effects, including viral neutralization, and this could be one of the reasons for the neutralization mechanism of antibodies directed against the NTD (Chi et al., 2020 ▸).\n\n3.3. Structure of a biochemically stabilized form of the spike  \nWe have also worked with a more recent variant containing six proline substitutions in S2 (HexaPro). This second protein was also studied by Hsieh et al. (2020 ▸). In this case, after going through the same stringent particle-selection process as for the previous specimen, as presented in depth in Section 2, it was impossible to obtain stable classes, so that in Fig. 4 ▸ we present a single map (EMDB entry EMD-11341) together with its global FSC curve and a local resolution analysis. It is clear that the local resolution has increased in the moving parts (mostly the RBD and NTD), although we did not feel confident in further modeling."}

    LitCovid-PD-GO-BP

    {"project":"LitCovid-PD-GO-BP","denotations":[{"id":"T5","span":{"begin":2739,"end":2761},"obj":"http://purl.obolibrary.org/obo/GO_0006487"},{"id":"T6","span":{"begin":2748,"end":2761},"obj":"http://purl.obolibrary.org/obo/GO_0070085"},{"id":"T7","span":{"begin":3038,"end":3051},"obj":"http://purl.obolibrary.org/obo/GO_0070085"}],"text":"3. Results  \nWith the goal set at analyzing spike flexibility, we describe our key results step by step.\n\n3.1. The ensemble map and the way to obtain it  \nIn the following, we describe the analysis of SARS-CoV-2 spike stabilized in the prefusion state by two proline substitutions in S2 (S-2P). We will objectively demonstrate that the flexibility of the spike protein should be understood as a quasi-continuum of conformations, so that when performing a structural analysis on this specimen special care has to be paid to the image-processing workflows, since they may directly impact on the interpretability of the results.\nStarting from the original SARS-CoV-2 S-2P data set of Wrapp et al. (2020 ▸), we have completely reanalyzed the data using our public domain software integration platform Scipion (de la Rosa-Trevín et al., 2016 ▸), breaking the global 3 Å resolution barrier. A representative view of the new ensemble map and its corresponding global FSC curve is shown in Fig. 1 ▸(a) (EMDB entry EMD-11328); the sequence of a monomer of the S protein is shown on the right to facilitate the further discussion of structure–function relationships (from Wrapp et al., 2020 ▸). Figs. 1 ▸(b) and 1 ▸(c) show a comparison between the original map (Wrapp et al., 2020 ▸) with EMDB code EMD-21375 and the newly reconstructed ensemble map corresponding to EMD-11328. Clearly, the local resolution (Vilas et al., 2018 ▸), which is shown on the left in Figs. 1 ▸(b) and 1 ▸(c), is increased in the new map, and the anisotropy, which is shown in the center, is much reduced. Finally, on the right we present plots of the radially averaged tangential resolution, which is related to the quality of the angular alignment (Vilas et al., 2020 ▸); the steeper the slope, the higher the angular assignment error. As can be appreciated, the slope calculated from the newly obtained map is almost zero when compared with that for the map from Wrapp et al. (2020 ▸), indicating that, in relative terms, the particle alignment used to create the new map is better than that used to build the original map. The result is an overall quantitative enhancement in map quality.\nIn terms of tracing, besides modeling several additional residue side chains and improving the geometry of the carbon skeleton (see Supplementary Fig. S2), one of the most noticeable improvements that we observed in the new map is an extension of the glycan chains that were initially built, particularly throughout the S2 fusion subunit (PDB entry 6zow). A quantitative comparison can be made between the length of glycan chains in the new ‘ensemble structure’ with respect to the previous structure (PDB entry 6vsb; see Supplementary Table S2). Although the total number of N-linked glycosylation sequons throughout the SARS-CoV-2 S trimer is essentially the same in the new structure (45) and PDB entry 6vsb (44), we have substantially increased the length of the glycan chains, expanding the total number of glycans by about 50%. We note the importance of this extensive glycosylation for epitope accessibility and how the accurate determination of this glycan shield will facilitate efforts to rapidly develop effective vaccines and therapeutics. Supplementary Fig. S2 shows a representative section of sharpened versions of the ensemble map (EMDB entry EMD-11328) compared with EMDB entry EMD-21375, in which the glycans can now be better traced. However, we should not forget that the ensemble map contains images in which the receptor-binding domain (RBD) and N-terminal domain (NTD) are in different positions (see Section 3.2), and consequently these domains appear to be blurred. Details of how the tracing was performed can be found in Section 2, while in Supplementary Fig. S3 we present two map-to-model quality figures indicating the good fit in general, with the obvious exception of the variable parts.\n\n3.2. Flexibility analysis  \nStarting from a carefully selected set of particles obtained from our consensus and cleaning approaches (see Section 2), together with the ensemble map described previously, we subjected the data to the following flexibility analysis.\nThe original images that were part of the ensemble map went through a ‘consensus classification’ procedure aimed at separating them into two algorithmically stable classes. Essentially, and as described in more detail in Section 2, we performed two independent classifications, further selecting those particles that were consistently together throughout the two classifications. In this way, we obtained the two new classes shown in Fig. 2 ▸(a). We will refer to these as the ‘closed conformation’ [Fig. 2 ▸(a), Class 1, EMDB entry EMD-11336] and the ‘open conformation’ [Fig. 2 ▸(a), Class 2, EMDB entry EMD-11337]. The number of images in each class was reduced to 45 000 in one case and 21 000 in the other, with global FSC-based resolutions of 3.1 and 3.3 Å, respectively.\nThe open and closed structures depict a clear and concerted movement of the ‘thumb’ formed by the RBD and sub­domains 1 and 2 (SD1 and SD2) and the NTD of an adjacent chain. The thumb moves away from the central spike axis, exposing the RBD in the up conformation. In order to make clearer where the changes are at the level of the Class 1 and Class 2 maps, we have made use of the representation of map local strains in Sorzano et al. (2016 ▸), which helps to very clearly visualize the type of strains needed to relate two maps, whether these are rigid-body rotations or more complex deformations (stretching). We have termed the maps resulting from this elastic analysis as ‘1s’ (Class 1, stretching) and ‘1r’ (Class 1, rotations) on the right-hand side of Fig. 2 ▸(a) and the same for Class 2. The color scale for both stretching and rotations goes from blue for small to red for large. Clearly, the differences among the classes with respect to the NTD and RBD have a very substantial component of pure coordinated rigid-body rotations, while the different RBDs present a much more complex pattern of deformations (stretching), indicating an important structural rearrangement in this area that does not occur elsewhere in the specimen. In terms of atomic modeling, we performed a flexible fitting of the ensemble model onto the closed and open forms [see Fig. 2 ▸(a), rightmost map; the PDB code for the open conformation is PDB entry 6zp7, while that for the closed conformation is 6zp5]. Focusing on rotations, which are the most simple element to follow, we can quantify that the degree of rotation of the thumb in these classes is close to 6°, as shown in Fig. 2 ▸(b). Given this flexibility, we consider that the best way to correctly present the experimental results is through the movie provided as Supplementary Movie S1, in which maps and atomic models are presented. Within the approximation to modeling that a flexible fitting represents, we can appreciate two hinge movements of the RBD–SD1–SD2 domains: one located between amino acids 318–326 and 588–595 that produces most of the displacement, and the other between amino acids 330–335 and 527–531 that accompanies a less pronounced ‘up’ movement of the RBD. This thumb motion is completed by the accompanying motion of the NTD from an adjacent chain. Also in a collective way, other NTDs and RBDs in the down conformation move slightly, as can better be appreciated in Supplementary Movie S1, where the transition between fitted models overlaps with the interpolation between observed high-resolution class maps.\nTo further investigate whether or not the flexibility was continuous, we proceeded as follows. Images from the two classes were pooled together and, using the ensemble map, subjected to a 3D principal component analysis (PCA). The approach we followed is based on Tagare et al. (2015 ▸), with some minor modifications of the method. A detailed explanation of the modifications is given in Section 2. We initialized the first principal component (PC) to the difference between the open and closed conformation, while the remaining PCs were initialized randomly. Upon convergence, the eigenvalue of each PC and the scatter of the images in PC space was calculated. The eigenvalues of the PCs are shown in Fig. 3 ▸(a). Clearly, the first three PCs are significant. The scatter plot of the image data in PC1–PC3 space is shown in Fig. 3 ▸(b). Fig. 3 ▸(b) strongly suggests that there is ‘continuous flexibility’ rather than ‘tightly clustered’ flexibility. Fig. 3 ▸(b) also shows the projection of the maps corresponding to the open and closed conformations on the extremes of the first three PCs. It is clear that the open and closed conformations are aligned mostly along the first PC, suggesting that the open/closed classification captures the most significant changes. Fig. 3 ▸(c) shows side views of a pair of structures (mean ± 2 × std, where std is the square root of the eigenvalue) for each PC. Additional details of these structures are available in Supplementary Figs. S4 and S5. Note that PCs are not to be understood as structural pathways with a biological meaning, but as directions that summarize the variance of a data set. For instance, the fact that the RBD appears and disappears at the two extremes of PC3 indicates that there is an important variability in these voxels, which is probably indicative of the up and down conformations of the RBD [to be understood in the context of the elastic analysis shown in Fig. 2 ▸(b)].\nThrough this combination of approaches, we have learnt that the spike conformation fluctuates virtually randomly in a rather continuous manner. Additionally, the approach taken to define the two algorithmically stable ‘classes’ has clearly partitioned the data set according to the main axis of variance, PC1, since the projections of the maps of these classes fall almost exclusively along PC1 and are located towards the extremes of the image-projection cloud. Note that the fraction of structural flexibility owing to PC2 and PC3 is also important in terms of the total variance of the complete image set, but that classification approaches do not seem to properly explore it. Unfortunately, the resolution in PC2 and PC3 is currently limited, so it is difficult to derive clear structural conclusions from these low-resolution maps. However, it is clear from these data that the dynamics of the spike are far richer than just a rigid body closing and opening, and involves more profound rearrangements, especially at the RBD but also at other sites. This observation is similar to that of Ke et al. (2020 ▸) when working with subtomogram averaging.\nAdditionally, the fact that PCA indicates this continuous flexibility to be a key characteristic of the spike dynamics also suggests that many other forms of partitioning (rather than properly ‘classifying’) of this continuous data set could be devised, this fact just being a consequence of the intrinsic instability created by forcing a quasi-continuous data distribution without any clustering structure to fit into a defined set of clusters. In this work, we have clearly forced the classification to go to the extremes of the data distribution, as shown in Fig. 3 ▸, probably by enforcing an algorithmically stable classification, but the key result is that any other degree of movement of the spike in between these extremes of PC1 as well as PC2 and PC3 would also be consistent with the experimental data. In other words, since the continuum of conformations does not have clear ‘cutting/classification’ points, there is a certain algorithmic uncertainty and instability as to the possible results of a classification process. Note that this instability could be exacerbated by the step of particle picking, in the sense that different picking algorithms may have different biases (precisely to minimize this instability, we have performed a ‘consensus’ approach to picking throughout this work).\nClearly, flexibility is key in this system, so that alterations in its dynamics may cause profound effects, including viral neutralization, and this could be one of the reasons for the neutralization mechanism of antibodies directed against the NTD (Chi et al., 2020 ▸).\n\n3.3. Structure of a biochemically stabilized form of the spike  \nWe have also worked with a more recent variant containing six proline substitutions in S2 (HexaPro). This second protein was also studied by Hsieh et al. (2020 ▸). In this case, after going through the same stringent particle-selection process as for the previous specimen, as presented in depth in Section 2, it was impossible to obtain stable classes, so that in Fig. 4 ▸ we present a single map (EMDB entry EMD-11341) together with its global FSC curve and a local resolution analysis. It is clear that the local resolution has increased in the moving parts (mostly the RBD and NTD), although we did not feel confident in further modeling."}

    LitCovid-sentences

    {"project":"LitCovid-sentences","denotations":[{"id":"T145","span":{"begin":0,"end":2},"obj":"Sentence"},{"id":"T146","span":{"begin":4,"end":13},"obj":"Sentence"},{"id":"T147","span":{"begin":14,"end":105},"obj":"Sentence"},{"id":"T148","span":{"begin":107,"end":111},"obj":"Sentence"},{"id":"T149","span":{"begin":113,"end":156},"obj":"Sentence"},{"id":"T150","span":{"begin":157,"end":296},"obj":"Sentence"},{"id":"T151","span":{"begin":297,"end":627},"obj":"Sentence"},{"id":"T152","span":{"begin":628,"end":886},"obj":"Sentence"},{"id":"T153","span":{"begin":887,"end":1186},"obj":"Sentence"},{"id":"T154","span":{"begin":1187,"end":1192},"obj":"Sentence"},{"id":"T155","span":{"begin":1193,"end":1370},"obj":"Sentence"},{"id":"T156","span":{"begin":1371,"end":1460},"obj":"Sentence"},{"id":"T157","span":{"begin":1461,"end":1575},"obj":"Sentence"},{"id":"T158","span":{"begin":1576,"end":1807},"obj":"Sentence"},{"id":"T159","span":{"begin":1808,"end":2096},"obj":"Sentence"},{"id":"T160","span":{"begin":2097,"end":2162},"obj":"Sentence"},{"id":"T161","span":{"begin":2163,"end":2518},"obj":"Sentence"},{"id":"T162","span":{"begin":2519,"end":2709},"obj":"Sentence"},{"id":"T163","span":{"begin":2710,"end":2996},"obj":"Sentence"},{"id":"T164","span":{"begin":2997,"end":3214},"obj":"Sentence"},{"id":"T165","span":{"begin":3215,"end":3415},"obj":"Sentence"},{"id":"T166","span":{"begin":3416,"end":3653},"obj":"Sentence"},{"id":"T167","span":{"begin":3654,"end":3882},"obj":"Sentence"},{"id":"T168","span":{"begin":3884,"end":3888},"obj":"Sentence"},{"id":"T169","span":{"begin":3890,"end":3912},"obj":"Sentence"},{"id":"T170","span":{"begin":3913,"end":4147},"obj":"Sentence"},{"id":"T171","span":{"begin":4148,"end":4320},"obj":"Sentence"},{"id":"T172","span":{"begin":4321,"end":4527},"obj":"Sentence"},{"id":"T173","span":{"begin":4528,"end":4594},"obj":"Sentence"},{"id":"T174","span":{"begin":4595,"end":4765},"obj":"Sentence"},{"id":"T175","span":{"begin":4766,"end":4925},"obj":"Sentence"},{"id":"T176","span":{"begin":4926,"end":5099},"obj":"Sentence"},{"id":"T177","span":{"begin":5100,"end":5190},"obj":"Sentence"},{"id":"T178","span":{"begin":5191,"end":5538},"obj":"Sentence"},{"id":"T179","span":{"begin":5539,"end":5723},"obj":"Sentence"},{"id":"T180","span":{"begin":5724,"end":5816},"obj":"Sentence"},{"id":"T181","span":{"begin":5817,"end":6167},"obj":"Sentence"},{"id":"T182","span":{"begin":6168,"end":6421},"obj":"Sentence"},{"id":"T183","span":{"begin":6422,"end":6604},"obj":"Sentence"},{"id":"T184","span":{"begin":6605,"end":6808},"obj":"Sentence"},{"id":"T185","span":{"begin":6809,"end":7154},"obj":"Sentence"},{"id":"T186","span":{"begin":7155,"end":7247},"obj":"Sentence"},{"id":"T187","span":{"begin":7248,"end":7509},"obj":"Sentence"},{"id":"T188","span":{"begin":7510,"end":7604},"obj":"Sentence"},{"id":"T189","span":{"begin":7605,"end":7736},"obj":"Sentence"},{"id":"T190","span":{"begin":7737,"end":7842},"obj":"Sentence"},{"id":"T191","span":{"begin":7843,"end":7909},"obj":"Sentence"},{"id":"T192","span":{"begin":7910,"end":8070},"obj":"Sentence"},{"id":"T193","span":{"begin":8071,"end":8172},"obj":"Sentence"},{"id":"T194","span":{"begin":8173,"end":8225},"obj":"Sentence"},{"id":"T195","span":{"begin":8226,"end":8271},"obj":"Sentence"},{"id":"T196","span":{"begin":8272,"end":8348},"obj":"Sentence"},{"id":"T197","span":{"begin":8349,"end":8462},"obj":"Sentence"},{"id":"T198","span":{"begin":8463,"end":8603},"obj":"Sentence"},{"id":"T199","span":{"begin":8604,"end":8779},"obj":"Sentence"},{"id":"T200","span":{"begin":8780,"end":8910},"obj":"Sentence"},{"id":"T201","span":{"begin":8911,"end":8986},"obj":"Sentence"},{"id":"T202","span":{"begin":8987,"end":8997},"obj":"Sentence"},{"id":"T203","span":{"begin":8998,"end":9147},"obj":"Sentence"},{"id":"T204","span":{"begin":9148,"end":9452},"obj":"Sentence"},{"id":"T205","span":{"begin":9453,"end":9596},"obj":"Sentence"},{"id":"T206","span":{"begin":9597,"end":9915},"obj":"Sentence"},{"id":"T207","span":{"begin":9916,"end":10132},"obj":"Sentence"},{"id":"T208","span":{"begin":10133,"end":10289},"obj":"Sentence"},{"id":"T209","span":{"begin":10290,"end":10506},"obj":"Sentence"},{"id":"T210","span":{"begin":10507,"end":10605},"obj":"Sentence"},{"id":"T211","span":{"begin":10606,"end":11051},"obj":"Sentence"},{"id":"T212","span":{"begin":11052,"end":11419},"obj":"Sentence"},{"id":"T213","span":{"begin":11420,"end":11640},"obj":"Sentence"},{"id":"T214","span":{"begin":11641,"end":11910},"obj":"Sentence"},{"id":"T215","span":{"begin":11911,"end":12181},"obj":"Sentence"},{"id":"T216","span":{"begin":12183,"end":12187},"obj":"Sentence"},{"id":"T217","span":{"begin":12189,"end":12248},"obj":"Sentence"},{"id":"T218","span":{"begin":12249,"end":12349},"obj":"Sentence"},{"id":"T219","span":{"begin":12350,"end":12412},"obj":"Sentence"},{"id":"T220","span":{"begin":12413,"end":12737},"obj":"Sentence"},{"id":"T221","span":{"begin":12738,"end":12891},"obj":"Sentence"}],"namespaces":[{"prefix":"_base","uri":"http://pubannotation.org/ontology/tao.owl#"}],"text":"3. Results  \nWith the goal set at analyzing spike flexibility, we describe our key results step by step.\n\n3.1. The ensemble map and the way to obtain it  \nIn the following, we describe the analysis of SARS-CoV-2 spike stabilized in the prefusion state by two proline substitutions in S2 (S-2P). We will objectively demonstrate that the flexibility of the spike protein should be understood as a quasi-continuum of conformations, so that when performing a structural analysis on this specimen special care has to be paid to the image-processing workflows, since they may directly impact on the interpretability of the results.\nStarting from the original SARS-CoV-2 S-2P data set of Wrapp et al. (2020 ▸), we have completely reanalyzed the data using our public domain software integration platform Scipion (de la Rosa-Trevín et al., 2016 ▸), breaking the global 3 Å resolution barrier. A representative view of the new ensemble map and its corresponding global FSC curve is shown in Fig. 1 ▸(a) (EMDB entry EMD-11328); the sequence of a monomer of the S protein is shown on the right to facilitate the further discussion of structure–function relationships (from Wrapp et al., 2020 ▸). Figs. 1 ▸(b) and 1 ▸(c) show a comparison between the original map (Wrapp et al., 2020 ▸) with EMDB code EMD-21375 and the newly reconstructed ensemble map corresponding to EMD-11328. Clearly, the local resolution (Vilas et al., 2018 ▸), which is shown on the left in Figs. 1 ▸(b) and 1 ▸(c), is increased in the new map, and the anisotropy, which is shown in the center, is much reduced. Finally, on the right we present plots of the radially averaged tangential resolution, which is related to the quality of the angular alignment (Vilas et al., 2020 ▸); the steeper the slope, the higher the angular assignment error. As can be appreciated, the slope calculated from the newly obtained map is almost zero when compared with that for the map from Wrapp et al. (2020 ▸), indicating that, in relative terms, the particle alignment used to create the new map is better than that used to build the original map. The result is an overall quantitative enhancement in map quality.\nIn terms of tracing, besides modeling several additional residue side chains and improving the geometry of the carbon skeleton (see Supplementary Fig. S2), one of the most noticeable improvements that we observed in the new map is an extension of the glycan chains that were initially built, particularly throughout the S2 fusion subunit (PDB entry 6zow). A quantitative comparison can be made between the length of glycan chains in the new ‘ensemble structure’ with respect to the previous structure (PDB entry 6vsb; see Supplementary Table S2). Although the total number of N-linked glycosylation sequons throughout the SARS-CoV-2 S trimer is essentially the same in the new structure (45) and PDB entry 6vsb (44), we have substantially increased the length of the glycan chains, expanding the total number of glycans by about 50%. We note the importance of this extensive glycosylation for epitope accessibility and how the accurate determination of this glycan shield will facilitate efforts to rapidly develop effective vaccines and therapeutics. Supplementary Fig. S2 shows a representative section of sharpened versions of the ensemble map (EMDB entry EMD-11328) compared with EMDB entry EMD-21375, in which the glycans can now be better traced. However, we should not forget that the ensemble map contains images in which the receptor-binding domain (RBD) and N-terminal domain (NTD) are in different positions (see Section 3.2), and consequently these domains appear to be blurred. Details of how the tracing was performed can be found in Section 2, while in Supplementary Fig. S3 we present two map-to-model quality figures indicating the good fit in general, with the obvious exception of the variable parts.\n\n3.2. Flexibility analysis  \nStarting from a carefully selected set of particles obtained from our consensus and cleaning approaches (see Section 2), together with the ensemble map described previously, we subjected the data to the following flexibility analysis.\nThe original images that were part of the ensemble map went through a ‘consensus classification’ procedure aimed at separating them into two algorithmically stable classes. Essentially, and as described in more detail in Section 2, we performed two independent classifications, further selecting those particles that were consistently together throughout the two classifications. In this way, we obtained the two new classes shown in Fig. 2 ▸(a). We will refer to these as the ‘closed conformation’ [Fig. 2 ▸(a), Class 1, EMDB entry EMD-11336] and the ‘open conformation’ [Fig. 2 ▸(a), Class 2, EMDB entry EMD-11337]. The number of images in each class was reduced to 45 000 in one case and 21 000 in the other, with global FSC-based resolutions of 3.1 and 3.3 Å, respectively.\nThe open and closed structures depict a clear and concerted movement of the ‘thumb’ formed by the RBD and sub­domains 1 and 2 (SD1 and SD2) and the NTD of an adjacent chain. The thumb moves away from the central spike axis, exposing the RBD in the up conformation. In order to make clearer where the changes are at the level of the Class 1 and Class 2 maps, we have made use of the representation of map local strains in Sorzano et al. (2016 ▸), which helps to very clearly visualize the type of strains needed to relate two maps, whether these are rigid-body rotations or more complex deformations (stretching). We have termed the maps resulting from this elastic analysis as ‘1s’ (Class 1, stretching) and ‘1r’ (Class 1, rotations) on the right-hand side of Fig. 2 ▸(a) and the same for Class 2. The color scale for both stretching and rotations goes from blue for small to red for large. Clearly, the differences among the classes with respect to the NTD and RBD have a very substantial component of pure coordinated rigid-body rotations, while the different RBDs present a much more complex pattern of deformations (stretching), indicating an important structural rearrangement in this area that does not occur elsewhere in the specimen. In terms of atomic modeling, we performed a flexible fitting of the ensemble model onto the closed and open forms [see Fig. 2 ▸(a), rightmost map; the PDB code for the open conformation is PDB entry 6zp7, while that for the closed conformation is 6zp5]. Focusing on rotations, which are the most simple element to follow, we can quantify that the degree of rotation of the thumb in these classes is close to 6°, as shown in Fig. 2 ▸(b). Given this flexibility, we consider that the best way to correctly present the experimental results is through the movie provided as Supplementary Movie S1, in which maps and atomic models are presented. Within the approximation to modeling that a flexible fitting represents, we can appreciate two hinge movements of the RBD–SD1–SD2 domains: one located between amino acids 318–326 and 588–595 that produces most of the displacement, and the other between amino acids 330–335 and 527–531 that accompanies a less pronounced ‘up’ movement of the RBD. This thumb motion is completed by the accompanying motion of the NTD from an adjacent chain. Also in a collective way, other NTDs and RBDs in the down conformation move slightly, as can better be appreciated in Supplementary Movie S1, where the transition between fitted models overlaps with the interpolation between observed high-resolution class maps.\nTo further investigate whether or not the flexibility was continuous, we proceeded as follows. Images from the two classes were pooled together and, using the ensemble map, subjected to a 3D principal component analysis (PCA). The approach we followed is based on Tagare et al. (2015 ▸), with some minor modifications of the method. A detailed explanation of the modifications is given in Section 2. We initialized the first principal component (PC) to the difference between the open and closed conformation, while the remaining PCs were initialized randomly. Upon convergence, the eigenvalue of each PC and the scatter of the images in PC space was calculated. The eigenvalues of the PCs are shown in Fig. 3 ▸(a). Clearly, the first three PCs are significant. The scatter plot of the image data in PC1–PC3 space is shown in Fig. 3 ▸(b). Fig. 3 ▸(b) strongly suggests that there is ‘continuous flexibility’ rather than ‘tightly clustered’ flexibility. Fig. 3 ▸(b) also shows the projection of the maps corresponding to the open and closed conformations on the extremes of the first three PCs. It is clear that the open and closed conformations are aligned mostly along the first PC, suggesting that the open/closed classification captures the most significant changes. Fig. 3 ▸(c) shows side views of a pair of structures (mean ± 2 × std, where std is the square root of the eigenvalue) for each PC. Additional details of these structures are available in Supplementary Figs. S4 and S5. Note that PCs are not to be understood as structural pathways with a biological meaning, but as directions that summarize the variance of a data set. For instance, the fact that the RBD appears and disappears at the two extremes of PC3 indicates that there is an important variability in these voxels, which is probably indicative of the up and down conformations of the RBD [to be understood in the context of the elastic analysis shown in Fig. 2 ▸(b)].\nThrough this combination of approaches, we have learnt that the spike conformation fluctuates virtually randomly in a rather continuous manner. Additionally, the approach taken to define the two algorithmically stable ‘classes’ has clearly partitioned the data set according to the main axis of variance, PC1, since the projections of the maps of these classes fall almost exclusively along PC1 and are located towards the extremes of the image-projection cloud. Note that the fraction of structural flexibility owing to PC2 and PC3 is also important in terms of the total variance of the complete image set, but that classification approaches do not seem to properly explore it. Unfortunately, the resolution in PC2 and PC3 is currently limited, so it is difficult to derive clear structural conclusions from these low-resolution maps. However, it is clear from these data that the dynamics of the spike are far richer than just a rigid body closing and opening, and involves more profound rearrangements, especially at the RBD but also at other sites. This observation is similar to that of Ke et al. (2020 ▸) when working with subtomogram averaging.\nAdditionally, the fact that PCA indicates this continuous flexibility to be a key characteristic of the spike dynamics also suggests that many other forms of partitioning (rather than properly ‘classifying’) of this continuous data set could be devised, this fact just being a consequence of the intrinsic instability created by forcing a quasi-continuous data distribution without any clustering structure to fit into a defined set of clusters. In this work, we have clearly forced the classification to go to the extremes of the data distribution, as shown in Fig. 3 ▸, probably by enforcing an algorithmically stable classification, but the key result is that any other degree of movement of the spike in between these extremes of PC1 as well as PC2 and PC3 would also be consistent with the experimental data. In other words, since the continuum of conformations does not have clear ‘cutting/classification’ points, there is a certain algorithmic uncertainty and instability as to the possible results of a classification process. Note that this instability could be exacerbated by the step of particle picking, in the sense that different picking algorithms may have different biases (precisely to minimize this instability, we have performed a ‘consensus’ approach to picking throughout this work).\nClearly, flexibility is key in this system, so that alterations in its dynamics may cause profound effects, including viral neutralization, and this could be one of the reasons for the neutralization mechanism of antibodies directed against the NTD (Chi et al., 2020 ▸).\n\n3.3. Structure of a biochemically stabilized form of the spike  \nWe have also worked with a more recent variant containing six proline substitutions in S2 (HexaPro). This second protein was also studied by Hsieh et al. (2020 ▸). In this case, after going through the same stringent particle-selection process as for the previous specimen, as presented in depth in Section 2, it was impossible to obtain stable classes, so that in Fig. 4 ▸ we present a single map (EMDB entry EMD-11341) together with its global FSC curve and a local resolution analysis. It is clear that the local resolution has increased in the moving parts (mostly the RBD and NTD), although we did not feel confident in further modeling."}

    LitCovid-PubTator

    {"project":"LitCovid-PubTator","denotations":[{"id":"107","span":{"begin":45,"end":50},"obj":"Gene"},{"id":"113","span":{"begin":290,"end":294},"obj":"Gene"},{"id":"114","span":{"begin":357,"end":362},"obj":"Gene"},{"id":"115","span":{"begin":214,"end":219},"obj":"Gene"},{"id":"116","span":{"begin":203,"end":213},"obj":"Species"},{"id":"117","span":{"begin":261,"end":268},"obj":"Chemical"},{"id":"120","span":{"begin":666,"end":670},"obj":"Gene"},{"id":"121","span":{"begin":655,"end":665},"obj":"Species"},{"id":"132","span":{"begin":2796,"end":2797},"obj":"Gene"},{"id":"133","span":{"begin":2785,"end":2795},"obj":"Species"},{"id":"134","span":{"begin":2274,"end":2280},"obj":"Chemical"},{"id":"135","span":{"begin":2414,"end":2420},"obj":"Chemical"},{"id":"136","span":{"begin":2579,"end":2585},"obj":"Chemical"},{"id":"137","span":{"begin":2739,"end":2740},"obj":"Chemical"},{"id":"138","span":{"begin":2930,"end":2936},"obj":"Chemical"},{"id":"139","span":{"begin":2975,"end":2982},"obj":"Chemical"},{"id":"140","span":{"begin":3121,"end":3127},"obj":"Chemical"},{"id":"141","span":{"begin":3382,"end":3389},"obj":"Chemical"},{"id":"143","span":{"begin":4670,"end":4690},"obj":"Disease"},{"id":"149","span":{"begin":5053,"end":5056},"obj":"Gene"},{"id":"150","span":{"begin":5258,"end":5277},"obj":"Gene"},{"id":"151","span":{"begin":5138,"end":5143},"obj":"Gene"},{"id":"152","span":{"begin":6931,"end":6934},"obj":"Gene"},{"id":"153","span":{"begin":7280,"end":7284},"obj":"Disease"},{"id":"156","span":{"begin":9230,"end":9233},"obj":"Gene"},{"id":"157","span":{"begin":8310,"end":8317},"obj":"CellLine"},{"id":"166","span":{"begin":9982,"end":9985},"obj":"Gene"},{"id":"167","span":{"begin":10174,"end":10177},"obj":"Gene"},{"id":"168","span":{"begin":9974,"end":9977},"obj":"Gene"},{"id":"169","span":{"begin":10352,"end":10357},"obj":"Gene"},{"id":"170","span":{"begin":9517,"end":9522},"obj":"Gene"},{"id":"171","span":{"begin":10166,"end":10169},"obj":"Gene"},{"id":"172","span":{"begin":9758,"end":9761},"obj":"CellLine"},{"id":"173","span":{"begin":9844,"end":9847},"obj":"CellLine"},{"id":"179","span":{"begin":11363,"end":11366},"obj":"Gene"},{"id":"180","span":{"begin":11305,"end":11310},"obj":"Gene"},{"id":"181","span":{"begin":10710,"end":10715},"obj":"Gene"},{"id":"182","span":{"begin":11355,"end":11358},"obj":"Gene"},{"id":"183","span":{"begin":11340,"end":11343},"obj":"CellLine"},{"id":"185","span":{"begin":12241,"end":12246},"obj":"Gene"},{"id":"188","span":{"begin":12311,"end":12318},"obj":"Chemical"},{"id":"189","span":{"begin":12390,"end":12395},"obj":"Disease"}],"attributes":[{"id":"A107","pred":"tao:has_database_id","subj":"107","obj":"Gene:43740568"},{"id":"A113","pred":"tao:has_database_id","subj":"113","obj":"Gene:51360"},{"id":"A114","pred":"tao:has_database_id","subj":"114","obj":"Gene:43740568"},{"id":"A115","pred":"tao:has_database_id","subj":"115","obj":"Gene:43740568"},{"id":"A116","pred":"tao:has_database_id","subj":"116","obj":"Tax:2697049"},{"id":"A117","pred":"tao:has_database_id","subj":"117","obj":"MESH:D011392"},{"id":"A120","pred":"tao:has_database_id","subj":"120","obj":"Gene:51360"},{"id":"A121","pred":"tao:has_database_id","subj":"121","obj":"Tax:2697049"},{"id":"A132","pred":"tao:has_database_id","subj":"132","obj":"Gene:43740568"},{"id":"A133","pred":"tao:has_database_id","subj":"133","obj":"Tax:2697049"},{"id":"A134","pred":"tao:has_database_id","subj":"134","obj":"MESH:D002244"},{"id":"A135","pred":"tao:has_database_id","subj":"135","obj":"MESH:D011134"},{"id":"A136","pred":"tao:has_database_id","subj":"136","obj":"MESH:D011134"},{"id":"A137","pred":"tao:has_database_id","subj":"137","obj":"MESH:D009584"},{"id":"A138","pred":"tao:has_database_id","subj":"138","obj":"MESH:D011134"},{"id":"A139","pred":"tao:has_database_id","subj":"139","obj":"MESH:D011134"},{"id":"A140","pred":"tao:has_database_id","subj":"140","obj":"MESH:D011134"},{"id":"A141","pred":"tao:has_database_id","subj":"141","obj":"MESH:D011134"},{"id":"A143","pred":"tao:has_database_id","subj":"143","obj":"MESH:C557826"},{"id":"A149","pred":"tao:has_database_id","subj":"149","obj":"Gene:57306"},{"id":"A151","pred":"tao:has_database_id","subj":"151","obj":"Gene:43740568"},{"id":"A152","pred":"tao:has_database_id","subj":"152","obj":"Gene:57306"},{"id":"A156","pred":"tao:has_database_id","subj":"156","obj":"Gene:57332"},{"id":"A157","pred":"tao:has_database_id","subj":"157","obj":"CVCL:0152"},{"id":"A166","pred":"tao:has_database_id","subj":"166","obj":"Gene:57332"},{"id":"A167","pred":"tao:has_database_id","subj":"167","obj":"Gene:57332"},{"id":"A168","pred":"tao:has_database_id","subj":"168","obj":"Gene:8535"},{"id":"A169","pred":"tao:has_database_id","subj":"169","obj":"Gene:43740568"},{"id":"A170","pred":"tao:has_database_id","subj":"170","obj":"Gene:43740568"},{"id":"A171","pred":"tao:has_database_id","subj":"171","obj":"Gene:8535"},{"id":"A172","pred":"tao:has_database_id","subj":"172","obj":"CVCL:0152"},{"id":"A173","pred":"tao:has_database_id","subj":"173","obj":"CVCL:0152"},{"id":"A179","pred":"tao:has_database_id","subj":"179","obj":"Gene:57332"},{"id":"A180","pred":"tao:has_database_id","subj":"180","obj":"Gene:43740568"},{"id":"A181","pred":"tao:has_database_id","subj":"181","obj":"Gene:43740568"},{"id":"A182","pred":"tao:has_database_id","subj":"182","obj":"Gene:8535"},{"id":"A183","pred":"tao:has_database_id","subj":"183","obj":"CVCL:0152"},{"id":"A185","pred":"tao:has_database_id","subj":"185","obj":"Gene:43740568"},{"id":"A188","pred":"tao:has_database_id","subj":"188","obj":"MESH:D011392"}],"namespaces":[{"prefix":"Tax","uri":"https://www.ncbi.nlm.nih.gov/taxonomy/"},{"prefix":"MESH","uri":"https://id.nlm.nih.gov/mesh/"},{"prefix":"Gene","uri":"https://www.ncbi.nlm.nih.gov/gene/"},{"prefix":"CVCL","uri":"https://web.expasy.org/cellosaurus/CVCL_"}],"text":"3. Results  \nWith the goal set at analyzing spike flexibility, we describe our key results step by step.\n\n3.1. The ensemble map and the way to obtain it  \nIn the following, we describe the analysis of SARS-CoV-2 spike stabilized in the prefusion state by two proline substitutions in S2 (S-2P). We will objectively demonstrate that the flexibility of the spike protein should be understood as a quasi-continuum of conformations, so that when performing a structural analysis on this specimen special care has to be paid to the image-processing workflows, since they may directly impact on the interpretability of the results.\nStarting from the original SARS-CoV-2 S-2P data set of Wrapp et al. (2020 ▸), we have completely reanalyzed the data using our public domain software integration platform Scipion (de la Rosa-Trevín et al., 2016 ▸), breaking the global 3 Å resolution barrier. A representative view of the new ensemble map and its corresponding global FSC curve is shown in Fig. 1 ▸(a) (EMDB entry EMD-11328); the sequence of a monomer of the S protein is shown on the right to facilitate the further discussion of structure–function relationships (from Wrapp et al., 2020 ▸). Figs. 1 ▸(b) and 1 ▸(c) show a comparison between the original map (Wrapp et al., 2020 ▸) with EMDB code EMD-21375 and the newly reconstructed ensemble map corresponding to EMD-11328. Clearly, the local resolution (Vilas et al., 2018 ▸), which is shown on the left in Figs. 1 ▸(b) and 1 ▸(c), is increased in the new map, and the anisotropy, which is shown in the center, is much reduced. Finally, on the right we present plots of the radially averaged tangential resolution, which is related to the quality of the angular alignment (Vilas et al., 2020 ▸); the steeper the slope, the higher the angular assignment error. As can be appreciated, the slope calculated from the newly obtained map is almost zero when compared with that for the map from Wrapp et al. (2020 ▸), indicating that, in relative terms, the particle alignment used to create the new map is better than that used to build the original map. The result is an overall quantitative enhancement in map quality.\nIn terms of tracing, besides modeling several additional residue side chains and improving the geometry of the carbon skeleton (see Supplementary Fig. S2), one of the most noticeable improvements that we observed in the new map is an extension of the glycan chains that were initially built, particularly throughout the S2 fusion subunit (PDB entry 6zow). A quantitative comparison can be made between the length of glycan chains in the new ‘ensemble structure’ with respect to the previous structure (PDB entry 6vsb; see Supplementary Table S2). Although the total number of N-linked glycosylation sequons throughout the SARS-CoV-2 S trimer is essentially the same in the new structure (45) and PDB entry 6vsb (44), we have substantially increased the length of the glycan chains, expanding the total number of glycans by about 50%. We note the importance of this extensive glycosylation for epitope accessibility and how the accurate determination of this glycan shield will facilitate efforts to rapidly develop effective vaccines and therapeutics. Supplementary Fig. S2 shows a representative section of sharpened versions of the ensemble map (EMDB entry EMD-11328) compared with EMDB entry EMD-21375, in which the glycans can now be better traced. However, we should not forget that the ensemble map contains images in which the receptor-binding domain (RBD) and N-terminal domain (NTD) are in different positions (see Section 3.2), and consequently these domains appear to be blurred. Details of how the tracing was performed can be found in Section 2, while in Supplementary Fig. S3 we present two map-to-model quality figures indicating the good fit in general, with the obvious exception of the variable parts.\n\n3.2. Flexibility analysis  \nStarting from a carefully selected set of particles obtained from our consensus and cleaning approaches (see Section 2), together with the ensemble map described previously, we subjected the data to the following flexibility analysis.\nThe original images that were part of the ensemble map went through a ‘consensus classification’ procedure aimed at separating them into two algorithmically stable classes. Essentially, and as described in more detail in Section 2, we performed two independent classifications, further selecting those particles that were consistently together throughout the two classifications. In this way, we obtained the two new classes shown in Fig. 2 ▸(a). We will refer to these as the ‘closed conformation’ [Fig. 2 ▸(a), Class 1, EMDB entry EMD-11336] and the ‘open conformation’ [Fig. 2 ▸(a), Class 2, EMDB entry EMD-11337]. The number of images in each class was reduced to 45 000 in one case and 21 000 in the other, with global FSC-based resolutions of 3.1 and 3.3 Å, respectively.\nThe open and closed structures depict a clear and concerted movement of the ‘thumb’ formed by the RBD and sub­domains 1 and 2 (SD1 and SD2) and the NTD of an adjacent chain. The thumb moves away from the central spike axis, exposing the RBD in the up conformation. In order to make clearer where the changes are at the level of the Class 1 and Class 2 maps, we have made use of the representation of map local strains in Sorzano et al. (2016 ▸), which helps to very clearly visualize the type of strains needed to relate two maps, whether these are rigid-body rotations or more complex deformations (stretching). We have termed the maps resulting from this elastic analysis as ‘1s’ (Class 1, stretching) and ‘1r’ (Class 1, rotations) on the right-hand side of Fig. 2 ▸(a) and the same for Class 2. The color scale for both stretching and rotations goes from blue for small to red for large. Clearly, the differences among the classes with respect to the NTD and RBD have a very substantial component of pure coordinated rigid-body rotations, while the different RBDs present a much more complex pattern of deformations (stretching), indicating an important structural rearrangement in this area that does not occur elsewhere in the specimen. In terms of atomic modeling, we performed a flexible fitting of the ensemble model onto the closed and open forms [see Fig. 2 ▸(a), rightmost map; the PDB code for the open conformation is PDB entry 6zp7, while that for the closed conformation is 6zp5]. Focusing on rotations, which are the most simple element to follow, we can quantify that the degree of rotation of the thumb in these classes is close to 6°, as shown in Fig. 2 ▸(b). Given this flexibility, we consider that the best way to correctly present the experimental results is through the movie provided as Supplementary Movie S1, in which maps and atomic models are presented. Within the approximation to modeling that a flexible fitting represents, we can appreciate two hinge movements of the RBD–SD1–SD2 domains: one located between amino acids 318–326 and 588–595 that produces most of the displacement, and the other between amino acids 330–335 and 527–531 that accompanies a less pronounced ‘up’ movement of the RBD. This thumb motion is completed by the accompanying motion of the NTD from an adjacent chain. Also in a collective way, other NTDs and RBDs in the down conformation move slightly, as can better be appreciated in Supplementary Movie S1, where the transition between fitted models overlaps with the interpolation between observed high-resolution class maps.\nTo further investigate whether or not the flexibility was continuous, we proceeded as follows. Images from the two classes were pooled together and, using the ensemble map, subjected to a 3D principal component analysis (PCA). The approach we followed is based on Tagare et al. (2015 ▸), with some minor modifications of the method. A detailed explanation of the modifications is given in Section 2. We initialized the first principal component (PC) to the difference between the open and closed conformation, while the remaining PCs were initialized randomly. Upon convergence, the eigenvalue of each PC and the scatter of the images in PC space was calculated. The eigenvalues of the PCs are shown in Fig. 3 ▸(a). Clearly, the first three PCs are significant. The scatter plot of the image data in PC1–PC3 space is shown in Fig. 3 ▸(b). Fig. 3 ▸(b) strongly suggests that there is ‘continuous flexibility’ rather than ‘tightly clustered’ flexibility. Fig. 3 ▸(b) also shows the projection of the maps corresponding to the open and closed conformations on the extremes of the first three PCs. It is clear that the open and closed conformations are aligned mostly along the first PC, suggesting that the open/closed classification captures the most significant changes. Fig. 3 ▸(c) shows side views of a pair of structures (mean ± 2 × std, where std is the square root of the eigenvalue) for each PC. Additional details of these structures are available in Supplementary Figs. S4 and S5. Note that PCs are not to be understood as structural pathways with a biological meaning, but as directions that summarize the variance of a data set. For instance, the fact that the RBD appears and disappears at the two extremes of PC3 indicates that there is an important variability in these voxels, which is probably indicative of the up and down conformations of the RBD [to be understood in the context of the elastic analysis shown in Fig. 2 ▸(b)].\nThrough this combination of approaches, we have learnt that the spike conformation fluctuates virtually randomly in a rather continuous manner. Additionally, the approach taken to define the two algorithmically stable ‘classes’ has clearly partitioned the data set according to the main axis of variance, PC1, since the projections of the maps of these classes fall almost exclusively along PC1 and are located towards the extremes of the image-projection cloud. Note that the fraction of structural flexibility owing to PC2 and PC3 is also important in terms of the total variance of the complete image set, but that classification approaches do not seem to properly explore it. Unfortunately, the resolution in PC2 and PC3 is currently limited, so it is difficult to derive clear structural conclusions from these low-resolution maps. However, it is clear from these data that the dynamics of the spike are far richer than just a rigid body closing and opening, and involves more profound rearrangements, especially at the RBD but also at other sites. This observation is similar to that of Ke et al. (2020 ▸) when working with subtomogram averaging.\nAdditionally, the fact that PCA indicates this continuous flexibility to be a key characteristic of the spike dynamics also suggests that many other forms of partitioning (rather than properly ‘classifying’) of this continuous data set could be devised, this fact just being a consequence of the intrinsic instability created by forcing a quasi-continuous data distribution without any clustering structure to fit into a defined set of clusters. In this work, we have clearly forced the classification to go to the extremes of the data distribution, as shown in Fig. 3 ▸, probably by enforcing an algorithmically stable classification, but the key result is that any other degree of movement of the spike in between these extremes of PC1 as well as PC2 and PC3 would also be consistent with the experimental data. In other words, since the continuum of conformations does not have clear ‘cutting/classification’ points, there is a certain algorithmic uncertainty and instability as to the possible results of a classification process. Note that this instability could be exacerbated by the step of particle picking, in the sense that different picking algorithms may have different biases (precisely to minimize this instability, we have performed a ‘consensus’ approach to picking throughout this work).\nClearly, flexibility is key in this system, so that alterations in its dynamics may cause profound effects, including viral neutralization, and this could be one of the reasons for the neutralization mechanism of antibodies directed against the NTD (Chi et al., 2020 ▸).\n\n3.3. Structure of a biochemically stabilized form of the spike  \nWe have also worked with a more recent variant containing six proline substitutions in S2 (HexaPro). This second protein was also studied by Hsieh et al. (2020 ▸). In this case, after going through the same stringent particle-selection process as for the previous specimen, as presented in depth in Section 2, it was impossible to obtain stable classes, so that in Fig. 4 ▸ we present a single map (EMDB entry EMD-11341) together with its global FSC curve and a local resolution analysis. It is clear that the local resolution has increased in the moving parts (mostly the RBD and NTD), although we did not feel confident in further modeling."}