| URL: | http://allccs.zhulab.cn |
| Full name: | AllCCS2 |
| Description: | AllCCS2 is an enhanced database designed for the universal prediction of ion mobility collision cross-section (CCS) values of small molecules. It incorporates 10,384 records and 7713 unified CCS values as training data and uses a neural network trained on diverse molecular representations to achieve exceptional prediction accuracy. |
| Year founded: | 2019 |
| Last update: | 2023-05-22 |
| Version: | v2.0 |
| Accessibility: |
Accessible
|
| Country/Region: | China |
| Data type: | |
| Data object: |
NA
|
| Database category: | |
| Major species: |
NA
|
| Keywords: |
| University/Institution: | Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences |
| Address: | |
| City: | |
| Province/State: | |
| Country/Region: | China |
| Contact name (PI/Team): | Zheng-Jiang Zhu |
| Contact email (PI/Helpdesk): | jiangzhu@sioc.ac.cn |
|
AllCCS2: Curation of Ion Mobility Collision Cross-Section Atlas for Small Molecules Using Comprehensive Molecular Representations. [PMID: 37664900]
The development of ion mobility-mass spectrometry (IM-MS) has revolutionized the analysis of small molecules, such as metabolomics, lipidomics, and exposome studies. The curation of comprehensive reference collision cross-section (CCS) databases plays a pivotal role in the successful application of IM-MS for small-molecule analysis. In this study, we presented AllCCS2, an enhanced version of AllCCS, designed for the universal prediction of the ion mobility CCS values of small molecules. AllCCS2 incorporated newly available experimental CCS data, including 10,384 records and 7713 unified values, as training data. By leveraging a neural network trained on diverse molecular representations encompassing mass spectrometry features, molecular descriptors, and graph features extracted using a graph convolutional network, AllCCS2 achieved exceptional prediction accuracy. AllCCS2 achieved median relative error (MedRE) values of 0.31, 0.72, and 1.64% in the training, validation, and testing sets, respectively, surpassing existing CCS prediction tools in terms of accuracy and coverage. Furthermore, AllCCS2 exhibited excellent compatibility with different instrument platforms (DTIMS, TWIMS, and TIMS). The prediction uncertainties in AllCCS2 from the training data and the prediction model were comprehensively investigated by using representative structure similarity and model prediction variation. Notably, small molecules with high structural similarities to the training set and lower model prediction variation exhibited improved accuracy and lower relative errors. In summary, AllCCS2 serves as a valuable resource to support applications of IM-MS technologies. The AllCCS2 database and tools are freely accessible at http://allccs.zhulab.cn/. |
|
Ion mobility collision cross-section atlas for known and unknown metabolite annotation in untargeted metabolomics. [PMID: 32859911]
The metabolome includes not just known but also unknown metabolites; however, metabolite annotation remains the bottleneck in untargeted metabolomics. Ion mobility - mass spectrometry (IM-MS) has emerged as a promising technology by providing multi-dimensional characterizations of metabolites. Here, we curate an ion mobility CCS atlas, namely AllCCS, and develop an integrated strategy for metabolite annotation using known or unknown chemical structures. The AllCCS atlas covers vast chemical structures with >5000 experimental CCS records and ~12 million calculated CCS values for >1.6 million small molecules. We demonstrate the high accuracy and wide applicability of AllCCS with medium relative errors of 0.5-2% for a broad spectrum of small molecules. AllCCS combined with in silico MS/MS spectra facilitates multi-dimensional match and substantially improves the accuracy and coverage of both known and unknown metabolite annotation from biological samples. Together, AllCCS is a versatile resource that enables confident metabolite annotation, revealing comprehensive chemical and metabolic insights towards biological processes. |