Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

General information

URL: https://unite.ut.ee/
Full name: UNITE database
Description: The development of UNITE is a Northern European initiative presently involving 10 core institutions. The project links experts in fungal taxonomy, ecology and bioinformatics.
Year founded: 2005
Last update: 2019
Version: 7.2
Accessibility:
Manual:
Accessible
Real time : Checking...
Country/Region: Estonia

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: University of Tartu
Address: Department of Botany, Institute of Ecology and Earth Sciences, University of Tartu, 40 Lai St, 51005 Tartu, Estonia
City: Tartu
Province/State:
Country/Region: Estonia
Contact name (PI/Team): Urmas Kõljalg
Contact email (PI/Helpdesk): urmas.koljalg@ut.ee

Publications

30371820
The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications. [PMID: 30371820]
Nilsson RH, Larsson KH, Taylor AFS, Bengtsson-Palme J, Jeppesen TS, Schigel D, Kennedy P, Picard K, Glöckner FO, Tedersoo L, Saar I, Kõljalg U, Abarenkov K.

UNITE (https://unite.ut.ee/) is a web-based database and sequence management environment for the molecular identification of fungi. It targets the formal fungal barcode-the nuclear ribosomal internal transcribed spacer (ITS) region-and offers all ?1 000 000 public fungal ITS sequences for reference. These are clustered into ?459 000 species hypotheses and assigned digital object identifiers (DOIs) to promote unambiguous reference across studies. In-house and web-based third-party sequence curation and annotation have resulted in more than 275 000 improvements to the data over the past 15 years. UNITE serves as a data provider for a range of metabarcoding software pipelines and regularly exchanges data with all major fungal sequence databases and other community resources. Recent improvements include redesigned handling of unclassifiable species hypotheses, integration with the taxonomic backbone of the Global Biodiversity Information Facility, and support for an unlimited number of parallel taxonomic classification systems.

Nucleic Acids Res. 2019:47(D1) | 781 Citations (from Europe PMC, 2024-04-06)
29559822
Taxonomic annotation of public fungal ITS sequences from the built environment - a report from an April 10-11, 2017 workshop (Aberdeen, UK). [PMID: 29559822]
Nilsson RH, Taylor AFS, Adams RI, Baschien C, Johan Bengtsson-Palme, Cangren P, Coleine C, Heide-Marie Daniel, Glassman SI, Hirooka Y, Irinyi L, Reda Iršėnaitė, Pedro M Martin-Sanchez, Meyer W, Seung-Yoon Oh, Jose Paulo Sampaio, Seifert KA, Sklenář F, Dirk Stubbe, Suh SO, Summerbell R, Svantesson S, Martin Unterseher, Cobus M Visagie, Weiss M, Woudenberg JH, Christian Wurzbacher, den Wyngaert SV, Yilmaz N, Andrey Yurkov, Kõljalg U, Abarenkov K.

Recent DNA-based studies have shown that the built environment is surprisingly rich in fungi. These indoor fungi - whether transient visitors or more persistent residents - may hold clues to the rising levels of human allergies and other medical and building-related health problems observed globally. The taxonomic identity of these fungi is crucial in such pursuits. Molecular identification of the built mycobiome is no trivial undertaking, however, given the large number of unidentified, misidentified, and technically compromised fungal sequences in public sequence databases. In addition, the sequence metadata required to make informed taxonomic decisions - such as country and host/substrate of collection - are often lacking even from reference and ex-type sequences. Here we report on a taxonomic annotation workshop (April 10-11, 2017) organized at the James Hutton Institute/University of Aberdeen (UK) to facilitate reproducible studies of the built mycobiome. The 32 participants went through public fungal ITS barcode sequences related to the built mycobiome for taxonomic and nomenclatural correctness, technical quality, and metadata availability. A total of 19,508 changes - including 4,783 name changes, 14,121 metadata annotations, and the removal of 99 technically compromised sequences - were implemented in the UNITE database for molecular identification of fungi (https://unite.ut.ee/) and shared with a range of other databases and downstream resources. Among the genera that saw the largest number of changes were , , , and , all of them of significant importance in both culture-based and culture-independent surveys of the built environment.

MycoKeys. 2018:(28) | 10 Citations (from Europe PMC, 2024-04-06)
25786896
A Comprehensive, Automatically Updated Fungal ITS Sequence Dataset for Reference-Based Chimera Control in Environmental Sequencing Efforts. [PMID: 25786896]
Nilsson RH, Tedersoo L, Ryberg M, Kristiansson E, Hartmann M, Unterseher M, Porter TM, Bengtsson-Palme J, Walker DM, de Sousa F, Gamper HA, Larsson E, Larsson KH, Kõljalg U, Edgar RC, Abarenkov K.

The nuclear ribosomal internal transcribed spacer (ITS) region is the most commonly chosen genetic marker for the molecular identification of fungi in environmental sequencing and molecular ecology studies. Several analytical issues complicate such efforts, one of which is the formation of chimeric-artificially joined-DNA sequences during PCR amplification or sequence assembly. Several software tools are currently available for chimera detection, but rely to various degrees on the presence of a chimera-free reference dataset for optimal performance. However, no such dataset is available for use with the fungal ITS region. This study introduces a comprehensive, automatically updated reference dataset for fungal ITS sequences based on the UNITE database for the molecular identification of fungi. This dataset supports chimera detection throughout the fungal kingdom and for full-length ITS sequences as well as partial (ITS1 or ITS2 only) datasets. The performance of the dataset on a large set of artificial chimeras was above 99.5%, and we subsequently used the dataset to remove nearly 1,000 compromised fungal ITS sequences from public circulation. The dataset is available at http://unite.ut.ee/repository.php and is subject to web-based third-party curation.

Microbes Environ. 2015:30(2) | 90 Citations (from Europe PMC, 2024-04-06)
21949797
Tidying up international nucleotide sequence databases: ecological, geographical and sequence quality annotation of its sequences of mycorrhizal fungi. [PMID: 21949797]
Tedersoo L, Abarenkov K, Nilsson RH, Schüssler A, Grelet GA, Kohout P, Oja J, Bonito GM, Veldre V, Jairus T, Ryberg M, Larsson KH, Kõljalg U.

Sequence analysis of the ribosomal RNA operon, particularly the internal transcribed spacer (ITS) region, provides a powerful tool for identification of mycorrhizal fungi. The sequence data deposited in the International Nucleotide Sequence Databases (INSD) are, however, unfiltered for quality and are often poorly annotated with metadata. To detect chimeric and low-quality sequences and assign the ectomycorrhizal fungi to phylogenetic lineages, fungal ITS sequences were downloaded from INSD, aligned within family-level groups, and examined through phylogenetic analyses and BLAST searches. By combining the fungal sequence database UNITE and the annotation and search tool PlutoF, we also added metadata from the literature to these accessions. Altogether 35,632 sequences belonged to mycorrhizal fungi or originated from ericoid and orchid mycorrhizal roots. Of these sequences, 677 were considered chimeric and 2,174 of low read quality. Information detailing country of collection, geographical coordinates, interacting taxon and isolation source were supplemented to cover 78.0%, 33.0%, 41.7% and 96.4% of the sequences, respectively. These annotated sequences are publicly available via UNITE (http://unite.ut.ee/) for downstream biogeographic, ecological and taxonomic analyses. In European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/), the annotated sequences have a special link-out to UNITE. We intend to expand the data annotation to additional genes and all taxonomic groups and functional guilds of fungi.

PLoS One. 2011:6(9) | 20 Citations (from Europe PMC, 2024-04-06)
20409185
The UNITE database for molecular identification of fungi--recent updates and future perspectives. [PMID: 20409185]
Abarenkov K, Henrik Nilsson R, Larsson KH, Alexander IJ, Eberhardt U, Erland S, Høiland K, Kjøller R, Larsson E, Pennanen T, Sen R, Taylor AF, Tedersoo L, Ursing BM, Vrålstad T, Liimatainen K, Peintner U, Kõljalg U.

null

New Phytol. 2010:186(2) | 683 Citations (from Europe PMC, 2024-04-06)
18211473
Fungal networks made of humans: UNITE, FESIN, and frontiers in fungal ecology. [PMID: 18211473]
Bruns TD, Arnold AE, Hughes KW.

null

New Phytol. 2008:177(3) | 10 Citations (from Europe PMC, 2024-04-06)
15869663
UNITE: a database providing web-based methods for the molecular identification of ectomycorrhizal fungi. [PMID: 15869663]
Kõljalg U, Larsson KH, Abarenkov K, Nilsson RH, Alexander IJ, Eberhardt U, Erland S, Høiland K, Kjøller R, Larsson E, Pennanen T, Sen R, Taylor AF, Tedersoo L, Vrålstad T, Ursing BM.

Identification of ectomycorrhizal (ECM) fungi is often achieved through comparisons of ribosomal DNA internal transcribed spacer (ITS) sequences with accessioned sequences deposited in public databases. A major problem encountered is that annotation of the sequences in these databases is not always complete or trustworthy. In order to overcome this deficiency, we report on UNITE, an open-access database. UNITE comprises well annotated fungal ITS sequences from well defined herbarium specimens that include full herbarium reference identification data, collector/source and ecological data. At present UNITE contains 758 ITS sequences from 455 species and 67 genera of ECM fungi. UNITE can be searched by taxon name, via sequence similarity using blastn, and via phylogenetic sequence identification using galaxie. Following implementation, galaxie performs a phylogenetic analysis of the query sequence after alignment either to pre-existing generic alignments, or to matches retrieved from a blast search on the UNITE data. It should be noted that the current version of UNITE is dedicated to the reliable identification of ECM fungi. The UNITE database is accessible through the URL http://unite.zbi.ee

New Phytol. 2005:166(3) | 434 Citations (from Europe PMC, 2024-04-06)

Ranking

All databases:
110/6000 (98.183%)
Standard ontology and nomenclature:
11/221 (95.475%)
110
Total Rank
2,006
Citations
105.579
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2018-01-28
Curated by:
Lina Ma [2022-04-26]
Shoaib Saleem [2019-10-28]
Dong Zou [2019-01-04]
Lina Ma [2018-06-07]
Sidra Younas [2018-04-24]
Sidra Younas [2018-04-04]
Meiye Jiang [2018-01-27]