| URL: | https://zenodo.org/record/29887?ln%C2%BCen#.VsL3yDLWR_V |
| Full name: | |
| Description: | Drug toxicity is a major concern for both regulatory agencies and the pharmaceutical industry. A new system for identification of drug side effects from the literature was presented that combines three approaches: machine learning, rule- and knowledge-based approaches. This system has been developed to address the Task 3.B of Biocreative V challenge (BC5) dealing with Chemical-induced Disease (CID) relations. |
| Year founded: | 2016 |
| Last update: | |
| Version: | |
| Accessibility: |
Accessible
|
| Country/Region: | Spain |
| Data type: | |
| Data object: |
NA
|
| Database category: | |
| Major species: |
NA
|
| Keywords: |
| University/Institution: | Pompeu Fabra University |
| Address: | Research Programme on Biomedical Informatics (GRIB), IMIM, UPF, Barcelona, Spain |
| City: | |
| Province/State: | |
| Country/Region: | Spain |
| Contact name (PI/Team): | Laura I. Furlong |
| Contact email (PI/Helpdesk): | lfurlong@imim.es |
|
cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling. [PMID: 37952182]
MOTIVATION: The precise characterization of cell-type transcriptomes is pivotal to understanding cellular lineages, deconvolution of bulk transcriptomes, and clinical applications. Single-cell RNA sequencing resources like the Human Cell Atlas have revolutionised cell-type profiling. However, challenges persist due to data heterogeneity and discrepancies across different studies. One limitation of prevailing tools such as CIBERSORTx is their inability to address hierarchical data structures and handle nonoverlapping gene sets across samples, relying on filtering or imputation. |
|
DOSE-L1000: unveiling the intricate landscape of compound-induced transcriptional changes. [PMID: 37952162]
MOTIVATION: The LINCS L1000 project has collected gene expression profiles for thousands of compounds across a wide array of concentrations, cell lines, and time points. However, conventional analysis methods often fall short in capturing the rich information encapsulated within the L1000 transcriptional dose-response data. |
|
Combining machine learning, crowdsourcing and expert knowledge to detect chemical-induced diseases in text. [PMID: 27307137]
Drug toxicity is a major concern for both regulatory agencies and the pharmaceutical industry. In this context, text-mining methods for the identification of drug side effects from free text are key for the development of up-to-date knowledge sources on drug adverse reactions. We present a new system for identification of drug side effects from the literature that combines three approaches: machine learning, rule- and knowledge-based approaches. This system has been developed to address the Task 3.B of Biocreative V challenge (BC5) dealing with Chemical-induced Disease (CID) relations. The first two approaches focus on identifying relations at the sentence-level, while the knowledge-based approach is applied both at sentence and abstract levels. The machine learning method is based on the BeFree system using two corpora as training data: the annotated data provided by the CID task organizers and a new CID corpus developed by crowdsourcing. Different combinations of results from the three strategies were selected for each run of the challenge. In the final evaluation setting, the system achieved the highest Recall of the challenge (63%). By performing an error analysis, we identified the main causes of misclassifications and areas for improving of our system, and highlighted the need of consistent gold standard data sets for advancing the state of the art in text mining of drug side effects.Database URL: https://zenodo.org/record/29887?ln¼en#.VsL3yDLWR_V. |