Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

tagtog

General information

URL: https://www.tagtog.net/
Full name: tagtog
Description: Annotation tool for biomedical text mining and corpora creation. Interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles
Year founded: 2014
Last update: 2016-01-01
Version: v1.0
Accessibility:
Accessible
Country/Region: United Kingdom

Classification & Tag

Data type:
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: University of Cambridge
Address: Cambridge CB2 3EH, UK
City: Cambridge
Province/State:
Country/Region: United Kingdom
Contact name (PI/Team): Peter McQuilton
Contact email (PI/Helpdesk): pam51@gen.cam.ac.uk

Publications

24715220
tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles. [PMID: 24715220]
Cejuela JM, McQuilton P, Ponting L, Marygold SJ, Stefancsik R, Millburn GH, Rost B, FlyBase Consortium.

The breadth and depth of biomedical literature are increasing year upon year. To keep abreast of these increases, FlyBase, a database for Drosophila genomic and genetic information, is constantly exploring new ways to mine the published literature to increase the efficiency and accuracy of manual curation and to automate some aspects, such as triaging and entity extraction. Toward this end, we present the 'tagtog' system, a web-based annotation framework that can be used to mark up biological entities (such as genes) and concepts (such as Gene Ontology terms) in full-text articles. tagtog leverages manual user annotation in combination with automatic machine-learned annotation to provide accurate identification of gene symbols and gene names. As part of the BioCreative IV Interactive Annotation Task, FlyBase has used tagtog to identify and extract mentions of Drosophila melanogaster gene symbols and names in full-text biomedical articles from the PLOS stable of journals. We show here the results of three experiments with different sized corpora and assess gene recognition performance and curation speed. We conclude that tagtog-named entity recognition improves with a larger corpus and that tagtog-assisted curation is quicker than manual curation. DATABASE URL: www.tagtog.net, www.flybase.org.

Database (Oxford). 2014:2014(0) | 28 Citations (from Europe PMC, 2026-04-04)

Ranking

All databases:
3585/6932 (48.298%)
Literature:
309/577 (46.62%)
3585
Total Rank
28
Citations
2.333
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2015-06-20
Curated by:
Guangyu Wang [2016-03-31]
Guangyu Wang [2015-11-23]
Guangyu Wang [2015-06-26]