Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

iSimp

General information

URL: http://research.bioinformatics.udel.edu/isimp/
Full name: A Sentence Simplification System for Biomedical Text
Description: Sentence simplification is a technique designed to detect the various types of clauses and constructs used in a complex sentence, in an effort to produce two or more simple sentences while maintaining both coherence and the communicated message.
Year founded: 2014
Last update: 2014-02-01
Version: v1.0
Accessibility:
Accessible
Country/Region: United States

Classification & Tag

Data type:
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: University of Delaware
Address: 18 Amstel Ave,Newark, DE 19716,USA
City: Newark
Province/State:
Country/Region: United States
Contact name (PI/Team): Yifan Peng
Contact email (PI/Helpdesk): yfpeng@udel.edu

Publications

24850848
iSimp in BioC standard format: enhancing the interoperability of a sentence simplification system. [PMID: 24850848]
Peng Y, Tudor CO, Torii M, Wu CH, Vijay-Shanker K.

This article reports the use of the BioC standard format in our sentence simplification system, iSimp, and demonstrates its general utility. iSimp is designed to simplify complex sentences commonly found in the biomedical text, and has been shown to improve existing text mining applications that rely on the analysis of sentence structures. By adopting the BioC format, we aim to make iSimp readily interoperable with other applications in the biomedical domain. To examine the utility of iSimp in BioC, we implemented a rule-based relation extraction system that uses iSimp as a preprocessing module and BioC for data exchange. Evaluation on the training corpus of BioNLP-ST 2011 GENIA Event Extraction (GE) task showed that iSimp sentence simplification improved the recall by 3.2% without reducing precision. The iSimp simplification-annotated corpora, both our previously used corpus and the GE corpus in the current study, have been converted into the BioC format and made publicly available at the project's Web site: http://research.bioinformatics.udel.edu/isimp/. Database URL:http://research.bioinformatics.udel.edu/isimp/ © The Author(s) 2014. Published by Oxford University Press.

Database (Oxford). 2014:2014() | 9 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
5616/6895 (18.564%)
Literature:
471/577 (18.544%)
5616
Total Rank
9
Citations
0.818
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2015-06-20
Curated by:
Chunlei Yu [2016-03-31]
Chunlei Yu [2015-11-20]
Jian Sang [2015-07-01]
Zhang Zhang [2015-06-25]