| URL: | http://csdb.glycoscience.ru |
| Full name: | Carbohydrate Structure Database |
| Description: | The Carbohydrate Structure Databases (CSDBs) store structural, bibliographic, taxonomic, NMR spectroscopic, and other data on natural carbohydrates, carbohydrate derivatives and glycosyltransferases published in the scientific literature. As of 2018, it has full coverage (>90% of published data) on bioglycans from prokaryotes, and also includes data those from fungi and plants. The time lag between publishing and deposition is ca. one year. CSDB serves as aplatform for multiple services of glycoinformatics, such as NMR spectra simulation, NMR-based structure elucidation, molecular geometry prediction, taxon clustering, glycobiological statistical tools, etc. |
| Year founded: | 2008 |
| Last update: | 2019 |
| Version: | 1 |
| Accessibility: |
Accessible
|
| Country/Region: | Russian Federation |
| Data type: | |
| Data object: | |
| Database category: | |
| Major species: | |
| Keywords: |
| University/Institution: | N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences |
| Address: | Leninsky prospect 47, Moscow 119991, Russia |
| City: | Moscow |
| Province/State: | |
| Country/Region: | Russian Federation |
| Contact name (PI/Team): | Philip V. Toukach |
| Contact email (PI/Helpdesk): | netbox@toukach.ru |
|
Source files of the Carbohydrate Structure Database: the way to sophisticated analysis of natural glycans. [PMID: 35354826]
The Carbohydrate Structure Database (CSDB, http://csdb.glycoscience.ru/ ) is a free curated repository storing various data on glycans of bacterial, fungal and plant origins. Currently, it maintains a close-to-full coverage on bacterial and fungal carbohydrates up to the year 2020. The CSDB web-interface provides free access to the database content and dedicated tools. Still, the number of these tools and the types of the corresponding analyses is limited, whereas the database itself contains data that can be used in a broader scope of analytical studies. In this paper, we present CSDB source data files and a self-contained SQL dump, and exemplify their possible application in glycan-related studies. By using CSDB in an SQL format, the user can gain access to the chain length distribution or charge distribution (as an example) in a given set of glycans defined according to specific structural, taxonomic, or other parameters, whereas the source text dump files can be imported to any dedicated database with a specific internal architecture differing from that of CSDB. |
|
Expanding CSDB_GT glycosyltransferase database with Escherichia coli. [PMID: 30759212]
In 2017, we reported a new database on glycosyltransferase (GT) activities, CSDB_GT (http://csdb.glycoscience.ru/gt.html), which was built at the platform of the Carbohydrate Structure Database (CSDB, http://csdb.glycoscience.ru/database/index.html) and contained data on experimentally confirmed GT activities from Arabidopsis thaliana. All entries in CSDB_GT are curated manually upon the analysis of scientific publications, and the key features of the database are accurate structural, genetic, protein and bibliographic references and close-to-complete coverage on experimentally proven GT activities in selected species. In 2018, CSDB_GT was supplemented with data on Escherichia coli GT activities. Now it contains ca. 800 entries on E. coli GTs, including ca. 550 entries with functions predicted in silico. This information was extracted from research papers published up to the year 2018 or was obtained by the authors' efforts on GT annotation. Thus, CSDB_GT was extended to provide not only experimentally confirmed GT activities, but also those predicted on the basis of gene or protein sequence homology that could carry valuable information. Accordingly, a new confirmation status-predicted in silico-was introduced. In addition, the coverage on A. thaliana was extended up to ca. 900 entries, all of which had experimental confirmation. Currently, CSDB_GT provides close-to-complete coverage on experimentally confirmed GT activities from A. thaliana and E. coli presented up to the year 2018. |
|
Glycoinformatics: Bridging Isolated Islands in the Sea of Data. [PMID: 29786940]
Glycoinformatics is an actively developing scientific discipline, which provides scientists with the means of access to the data on natural glycans and with various tools of their processing. However, the informatization of glycomics has a long way to go before catching up with genomics and proteomics. In this Viewpoint, we review the current situation in glycoinformatics and discuss its achievements and shortcomings, emphasizing the major drawbacks: the lack of recognized standards, protocols, data indices and tools, and the informational isolation of the existing projects. We reiterate possible solutions of the persistent issues and describe our vision of an ideal glycoinformatics project. |
|
REStLESS: automated translation of glycan sequences from residue-based notation to SMILES and atomic coordinates. [PMID: 29547883]
Motivation: Glycans and glycoconjugates are usually recorded in dedicated databases in residue-based notations. Only a few of them can be converted into chemical (atom-based) formats highly demanded in conformational and biochemical studies. In this work, we present a tool for translation from a residue-based glycan notation to SMILES. |
|
GRASS: semi-automated NMR-based structure elucidation of saccharides. [PMID: 29092007]
Motivation: Carbohydrates play crucial roles in various biochemical processes and are useful for developing drugs and vaccines. However, in case of carbohydrates, the primary structure elucidation is usually a sophisticated task. Therefore, they remain the least structurally characterized class of biomolecules, and it hampers the progress in glycochemistry and glycobiology. Creating a usable instrument designed to assist researchers in natural carbohydrate structure determination would advance glycochemistry in biomedical and pharmaceutical applications. |
|
Carbohydrate structure database merged from bacterial, archaeal, plant and fungal parts. [PMID: 26286194]
The Carbohydrate Structure Databases (CSDBs, http://csdb.glycoscience.ru) store structural, bibliographic, taxonomic, NMR spectroscopic, and other data on natural carbohydrates and their derivatives published in the scientific literature. The CSDB project was launched in 2005 for bacterial saccharides (as BCSDB). Currently, it includes two parts, the Bacterial CSDB and the Plant&Fungal CSDB. In March 2015, these databases were merged to the single CSDB. The combined CSDB includes information on bacterial and archaeal glycans and derivatives (the coverage is close to complete), as well as on plant and fungal glycans and glycoconjugates (almost all structures published up to 1998). CSDB is regularly updated via manual expert annotation of original publications. Both newly annotated data and data imported from other databases are manually curated. The CSDB data are exportable in a number of modern formats, such as GlycoRDF. CSDB provides additional services for simulation of (1)H, (13)C and 2D NMR spectra of saccharides, NMR-based structure prediction, glycan-based taxon clustering and other. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. |
|
Carbohydrate Structure Database: tools for statistical analysis of bacterial, plant and fungal glycomes. [PMID: 26337239]
Carbohydrates are biological blocks participating in diverse and crucial processes both at cellular and organism levels. They protect individual cells, establish intracellular interactions, take part in the immune reaction and participate in many other processes. Glycosylation is considered as one of the most important modifications of proteins and other biologically active molecules. Still, the data on the enzymatic machinery involved in the carbohydrate synthesis and processing are scattered, and the advance on its study is hindered by the vast bulk of accumulated genetic information not supported by any experimental evidences for functions of proteins that are encoded by these genes. In this article, we present novel instruments for statistical analysis of glycomes in taxa. These tools may be helpful for investigating carbohydrate-related enzymatic activities in various groups of organisms and for comparison of their carbohydrate content. The instruments are developed on the Carbohydrate Structure Database (CSDB) platform and are available freely on the CSDB web-site at http://csdb.glycoscience.ru. Database URL: http://csdb.glycoscience.ru. |
|
Expansion of coverage of Carbohydrate Structure Database (CSDB). [PMID: 24680503]
The Bacterial Carbohydrate Structure Database (BCSDB), which has been maintained since 2005, was expanded to cover glycans from plants and fungi. The current coverage on plant and fungal glycans includes several thousands of the CarbBank records, as well as data published before 1996 but not deposited in CarbBank. Prior to deposition, the data were verified against the original publications and supplemented with additional information, such as NMR spectra. Both the Bacterial and Plant and Fungal Carbohydrate Structure Databases are freely available at http://csdb.glycoscience.ru. |
|
Carbohydrate structure generalization scheme for database-driven simulation of experimental observables, such as NMR chemical shifts. [PMID: 25020143]
Carbohydrates play an immense role in different aspects of life. NMR spectroscopy is the most powerful tool for investigation of these compounds. Nowadays, progress in computational procedures has opened up novel opportunities giving an impulse to the development of new instruments intended to make the research simpler and more efficient. In this paper, we present a new approach for simulating (13)C NMR chemical shifts of carbohydrates. The approach is suitable for any atomic observables, which could be stored in a database. The method is based on sequential generalization of the chemical surroundings of the atom under prediction and heuristic averaging of database data. Unlike existing applications, the generalization scheme is tuned for carbohydrates, including those containing phosphates, amino acids, alditols, and other non-carbohydrate constituents. It was implemented in the Glycan-Optimized Dual Empirical Spectrum Simulation (GODESS) software, which is freely available on the Internet. In the field of carbohydrates, our approach was shown to outperform all other existing methods of NMR spectrum prediction (including quantum-mechanical calculations) in accuracy. Only this approach supports NMR spectrum simulation for a number of structural features in polymeric structures. |
|
Bacterial carbohydrate structure database 3: principles and realization. [PMID: 21155523]
Bacterial carbohydrate structure database (BCSDB) is an open-access project that collects primary publication data on carbohydrate structures originating from bacteria, their biological properties, bibliographic and taxonomic annotations, NMR spectra, etc. Almost complete coverage and outstanding data consistency are achieved. BCSDB version 3 and the principles lying behind it, including glycan description language, are reported. |
|
Statistical analysis of the Bacterial Carbohydrate Structure Data Base (BCSDB): characteristics and diversity of bacterial carbohydrates in comparison with mammalian glycans. [PMID: 18694500]
BACKGROUND: There are considerable differences between bacterial and mammalian glycans. In contrast to most eukaryotic carbohydrates, bacterial glycans are often composed of repeating units with diverse functions ranging from structural reinforcement to adhesion, colonization and camouflage. Since bacterial glycans are typically displayed at the cell surface, they can interact with the environment and, therefore, have significant biomedical importance. |