PMC:539967 JSONTXT

Annnotations TAB JSON ListView MergeView

{"target":"https://pubannotation.org/docs/sourcedb/PMC/sourceid/539967","sourcedb":"PMC","sourceid":"539967","source_url":"http://www.ncbi.nlm.nih.gov/pmc/539967","text":"Carbohydrate Structure Suite (CSS): analysis of carbohydrate 3D structures derived from the PDB \n\nAbstract\nKnowledge of the 3D structure of glycoproteins and protein–carbohydrate complexes is indispensable to fully understand the biological processes they are involved in. Carbohydrate Structure Suite is an attempt to automatically analyse carbohydrate structures contained in the PDB and make the results publicly available on the internet. Characteristic torsion angles, glycoprotein sequences and carbohydrate–protein interactions are analysed. Furthermore, tools to crosslink the PDB and carbohydrate databases and to check the integrity of carbohydrate 3D structures are included. The service is available at (www.dkfz.de/spec/css/).\n\nINTRODUCTION\nCarbohydrate–protein interactions are implicated in a variety of cell–cell and cell–matrix recognition events, ranging from fertilization, cellular differentiation and development to pathological situations like inflammation, viral and bacterial infections, immune response and metastasis (1–3). These events require a specific recognition of different carbohydrate structures by carbohydrate-binding proteins, the lectins (4).\nMost of the carbohydrates involved in these recognition processes are part of glycoproteins. Protein glycosylation is probably the most common and complex co- and post-translational modification. In many cases, the glycan chains are mandatory for the correct folding process (5). They also alter the properties of the proteins, e.g. by increasing solubility, protecting them from proteolysis or reducing backbone flexibility (2). Variations in the terminal residues of glycans attached to a protein enable a cell- and tissue-specific fine-tuning of protein properties without a change in the amino acid sequence (5).\nTo be able to understand these processes in detail, it is often indispensable to know the 3D structures of the glycoproteins or protein–carbohydrate complexes, respectively. The largest publicly available source of such 3D structures is the PDB (6). Unfortunately, there is no standard nomenclature for carbohydrate residues (2,7), which makes the analysis of these data difficult. Moreover, many of the structural characteristics of interest are secondary data, given only implicitly in the structures, e.g. linkage torsion angles. It is impossible to extract this information efficiently based on the nomenclature that is present in the PDB entries alone.\nCarbohydrate Structure Suite (CSS) is an attempt to automatically detect and assign carbohydrate structures in the PDB in a nomenclature-independent manner, analyse them and make the results publicly available on the internet.\n\npdb2linucs: detection of carbohydrate structures in PDB entries\npdb2linucs (7) detects carbohydrate structures using an algorithm that is based on element types and atom coordinates only. The algorithm is independent of residue notation and thus bridges the gap of a missing standard nomenclature for carbohydrates in PDB entries. The program can be queried by PDB ID or by uploading a structure from a local computer. The results are displayed using the LINUCS nomenclature, a linear, unique notation for carbohydrate structures (8). This notation is also used to query GlycosciencesDB, the former SweetDB (9). In case there is a related entry in that database, a direct link is offered (Figure 1). Thereby, pdb2linucs establishes a cross-linking between a protein database and a carbohydrate database.\nUsing an extended version of pdb2linucs, three data sets of carbohydrate- and glycoprotein-related data were generated from the PDB. These data sets, entitled GlyTorsionDB, GlySeqDB and GlyVicinityDB, are stored in extensible markup language (XML)-based text files and queried by specialized software, which can be accessed through web interfaces.\n\nGlyTorsionDB: a database of carbohydrate torsion angles\n3D protein structures are mainly characterized by their backbone torsion angles (10). Using a similar approach, carbohydrate structures can be described by the torsion angles of the glycosidic linkages connecting monosaccharide residues. GlyTorsionDB is a database containing the torsion angle values of all carbohydrate linkages present in the PDB. Furthermore, it includes the ring torsion angles of the single monosaccharides, omega torsion angles for exocyclic hydroymethyl-groups, side chain torsion angles of asparagine residues involved in N-linked glycosylation and the torsion angles of N-acetyl groups attached to carbohydrate rings. The database is queried by the software GlyTorsion.\nIn most cases, a statistic evaluation of the distribution of angles for a specific structural feature is the most informative way to look at the data. Therefore, GlyTorsion performs a graphical output of results by generating diagrams or plots (histograms). From these, the user can see at a glance, which angles are preferred by the residues of interest. In a subsequent step, the numerical values of the angles together with the information about the PDB entries they originate from can be displayed for the entire data, which have been used to generate the diagram. Additionally, the detailed data of user-defined ranges can be displayed.\n\ncarp: CArbohydrate Ramachandran Plot\nAnother possibility to query GlyTorsionDB is carp, the CArbohydrate Ramachandran Plot. The ‘Ramachandran Plot’, where protein backbone torsion angles Φ and Ψ are plotted against each other, is a frequently used tool to evaluate the quality of a protein 3D structure (11). For carbohydrate structures, linkage torsions can be evaluated in a similar way. In contrast to proteins, however, the frequency of populated torsion angles depends on the monosaccharide residues involved in the linkage as well as the kind of linkage (1–3, 1–4, etc.). carp analyses carbohydrate data given in PDB files using the pdb2linucs algorithm and compares them to the data available in GlyTorsionDB (Figure 2a). For each different combination of monosaccharides and linkage type, a separate plot is generated.\nFor seldom occurring linkages, there is only few data available in the PDB. Therefore, carp offers the possibility to query another database, the GlycoMapsDB, available at www.dkfz.de/spec/glycomaps/. That database contains conformational maps derived from molecular dynamics simulations and is therefore independent of the number of entries found for a given linkage in the PDB (Figure 2b).\n\nGlySeqDB: glycoprotein sequences database\nAbout 70% of all proteins contain potential N-glycosylation sites (Asn-Xxx-Ser/Thr, Xxx not Pro). For reasons that are still unclear, not all of them are in fact glycosylated (12). Unfortunately, only very few experimental data are available, where it has been unambiguously shown, that a specific glycosylation site is occupied by a defined glycan. This lack in experimental data hampers to decipher the structural parameters, which are required for glycosylation. Therefore, glycoprotein sequences taken from the PDB and from carefully assigned SwissProt entries have been used to gain deeper insights into factors that regulate glycan attachment to a N-glycosylation site (13, 14). GlySeqDB makes these data now available, which had to be extracted manually until recently. GlySeqDB provides online access to glycoprotein sequences originating from PDB and SwissProt. The query software GlySeq analyses the statistics of amino acids in the sequential neighbourhood of N- and O-glycosylation sites. In addition to the results in text mode, GlySeq is able to display graphical results by generating diagrams.\nWithin the PDB, there are many redundant sequences. If, for example, a protein is a trimer consisting of three identical chains, and the structure of that protein was resolved at four different conditions, then there are 12 identical protein chains in the database. Therefore, GlySeqDB also contains datasets where redundant information is removed.\nBesides the selection of the source dataset and the type of glycosylation site, GlySeq offers the use of additional filters, e.g. to analyse only those N-glycosylation sites where the residue at position +2 is Ser and not Thr or vice versa.\n\nGlyVicinityDB: a database about carbohydrate–protein interactions\nBesides the amino acids in sequential neighbourhood of glycosylation sites, those in the spatial vicinity of carbohydrate residues determine the characteristics of glycoproteins. The latter ones are of special interest for the examination of the interactions between carbohydrates and carbohydrate-binding proteins. Since carbohydrate ligands are not covalently bound, sequence analysis comparable to that for glycosylation sites is not applicable here.\nGlyVicinityDB contains lists of distances about the amino acids in the spatial vicinity of carbohydrate residues present in the PDB. The database is queried by the software GlyVicinity. This program performs a statistical analysis of the frequency of amino acids within a user-definable distance (up to 10 Å) of carbohydrate residues. Like GlySeq, GlyVicinity is also able to display results as diagrams. In a subsequent step, examination of the atoms forming the closest contacts between carbohydrate and amino acid residues is also possible. Moreover, GlyVicinity enables the user to define subsets of the database by selecting the PDB entries to be considered, e.g. to investigate virus-related proteins only.\n\npdb-care: integrity check of carbohydrate structures\nAn analysis of the entire carbohydrate data in the PDB revealed that about 30% of all carbohydrate-containing PDB entries comprise one or several errors, most of which are due to inconsistencies in residue nomenclature or erroneous connection data (7). This results from a lack of check software for carbohydrate structures like it exists for the protein parts of PDB entries (11). pdb-care (PDB CArbohydrate REsidue check) (15) provides such a check by comparing the residue names derived by the pdb2linucs algorithm with those present in the file to be checked; and inconsistencies are reported. Furthermore, the program performs a check of the information given in the CONECT records of PDB files. Frequent use of this software will increase the quality of carbohydrate information within the PDB.\n\nUpdates\nThe PDB is updated weekly with new structures. The update process of the databases contained in CSS is mainly automatic; therefore the service always represents the actually available data in the PDB.\n\nImplementation\nCSS is running on a Linux PC with Apache web server software. Interaction with the user is realized by PHP interfaces. The data sets are stored in XML-formatted text files, and the programs to query these data are written in C.\nVisualization of protein and carbohydrate structures ( pdb2lincus, GlyVicinity) or torsion angles (GlyTorsion, carp) is performed by the java applet Jmol (http://jmol.sourceforge.net) or the plugin Chime (www.mdlchime.com). Diagrams and plots are generated in scalable vector graphics (SVG) format. For browsers that cannot display SVG files, they are converted to graphics interchange format (GIF) files using ImageMagick (http://imagemagick.sourceforge.net).\nThe service is hosted at and maintained by the German Cancer Research Centre in Heidelberg, Germany. It can be accessed online at (www.dkfz.de/spec/css/). The databases and tools, which CSS consist of, are also integrated into the Glycosciences web portal (www.glycosciences.de).\n\nOutlook\nThe download of retrieved data is currently only possible using copy/paste mechanisms. We are currently working to provide XML formats for all retrievable data, so that an efficient way of data exchange will be established. Additionally, we will provide access to structural parameters of the protein backbone in the vicinity of glycosylation sites. With the GlyProt application (available at the glycosciences web portal), an efficient tool is available to perform in silico glycosylation of a given 3D protein structure. ","divisions":[{"label":"Title","span":{"begin":0,"end":96}},{"label":"Abstract","span":{"begin":107,"end":740}},{"label":"Body","span":{"begin":741,"end":12012}},{"label":"Section","span":{"begin":741,"end":12011}},{"label":"Title","span":{"begin":741,"end":753}},{"label":"Section","span":{"begin":2685,"end":3836}},{"label":"Title","span":{"begin":2685,"end":2748}},{"label":"Section","span":{"begin":3838,"end":5231}},{"label":"Title","span":{"begin":3838,"end":3893}},{"label":"Section","span":{"begin":5233,"end":6451}},{"label":"Title","span":{"begin":5233,"end":5269}},{"label":"Section","span":{"begin":6453,"end":8194}},{"label":"Title","span":{"begin":6453,"end":6494}},{"label":"Section","span":{"begin":8196,"end":9428}},{"label":"Title","span":{"begin":8196,"end":8261}},{"label":"Section","span":{"begin":9430,"end":10283}},{"label":"Title","span":{"begin":9430,"end":9482}},{"label":"Section","span":{"begin":10285,"end":10493}},{"label":"Title","span":{"begin":10285,"end":10292}},{"label":"Section","span":{"begin":10495,"end":11478}},{"label":"Title","span":{"begin":10495,"end":10509}},{"label":"Section","span":{"begin":11480,"end":12010}},{"label":"Title","span":{"begin":11480,"end":11487}}],"tracks":[{"project":"2_test","denotations":[{"id":"15608187-11841247-77246187","span":{"begin":1044,"end":1045},"obj":"11841247"},{"id":"15608187-8577698-77246187","span":{"begin":1044,"end":1045},"obj":"8577698"},{"id":"15608187-12223270-77246188","span":{"begin":1178,"end":1179},"obj":"12223270"},{"id":"15608187-11269317-77246189","span":{"begin":1458,"end":1459},"obj":"11269317"},{"id":"15608187-11841247-77246190","span":{"begin":1608,"end":1609},"obj":"11841247"},{"id":"15608187-11269317-77246191","span":{"begin":1795,"end":1796},"obj":"11269317"},{"id":"15608187-10592235-77246192","span":{"begin":2045,"end":2046},"obj":"10592235"},{"id":"15608187-11841247-77246193","span":{"begin":2125,"end":2126},"obj":"11841247"},{"id":"15608187-15010309-77246194","span":{"begin":2127,"end":2128},"obj":"15010309"},{"id":"15608187-15010309-77246195","span":{"begin":2761,"end":2762},"obj":"15010309"},{"id":"15608187-11675023-77246196","span":{"begin":3216,"end":3217},"obj":"11675023"},{"id":"15608187-11752350-77246197","span":{"begin":3293,"end":3294},"obj":"11752350"},{"id":"15608187-8568893-77246198","span":{"begin":3975,"end":3977},"obj":"8568893"},{"id":"15608187-11056479-77246199","span":{"begin":5537,"end":5539},"obj":"11056479"},{"id":"15608187-10580125-77246200","span":{"begin":6671,"end":6673},"obj":"10580125"},{"id":"15608187-14514716-77246201","span":{"begin":7171,"end":7173},"obj":"14514716"},{"id":"15608187-14514714-77246202","span":{"begin":7175,"end":7177},"obj":"14514714"},{"id":"15608187-15010309-77246203","span":{"begin":9732,"end":9733},"obj":"15010309"},{"id":"15608187-11056479-77246204","span":{"begin":9860,"end":9862},"obj":"11056479"},{"id":"15608187-15180909-77246205","span":{"begin":9908,"end":9910},"obj":"15180909"}],"attributes":[{"subj":"15608187-11841247-77246187","pred":"source","obj":"2_test"},{"subj":"15608187-8577698-77246187","pred":"source","obj":"2_test"},{"subj":"15608187-12223270-77246188","pred":"source","obj":"2_test"},{"subj":"15608187-11269317-77246189","pred":"source","obj":"2_test"},{"subj":"15608187-11841247-77246190","pred":"source","obj":"2_test"},{"subj":"15608187-11269317-77246191","pred":"source","obj":"2_test"},{"subj":"15608187-10592235-77246192","pred":"source","obj":"2_test"},{"subj":"15608187-11841247-77246193","pred":"source","obj":"2_test"},{"subj":"15608187-15010309-77246194","pred":"source","obj":"2_test"},{"subj":"15608187-15010309-77246195","pred":"source","obj":"2_test"},{"subj":"15608187-11675023-77246196","pred":"source","obj":"2_test"},{"subj":"15608187-11752350-77246197","pred":"source","obj":"2_test"},{"subj":"15608187-8568893-77246198","pred":"source","obj":"2_test"},{"subj":"15608187-11056479-77246199","pred":"source","obj":"2_test"},{"subj":"15608187-10580125-77246200","pred":"source","obj":"2_test"},{"subj":"15608187-14514716-77246201","pred":"source","obj":"2_test"},{"subj":"15608187-14514714-77246202","pred":"source","obj":"2_test"},{"subj":"15608187-15010309-77246203","pred":"source","obj":"2_test"},{"subj":"15608187-11056479-77246204","pred":"source","obj":"2_test"},{"subj":"15608187-15180909-77246205","pred":"source","obj":"2_test"}]}],"config":{"attribute types":[{"pred":"source","value type":"selection","values":[{"id":"2_test","color":"#93ecb7","default":true}]}]}}