Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

MetaCOXI

General information

URL: https://github.com/bachob5/MetaCOXI
Full name:
Description: MetaCOXI is an integrated collection of curated metazoan COXI DNA sequences with their associated harmonized taxonomy and metadata. This collection was built on the two most extensive available data resources, namely the European Nucleotide Archive (ENA) and the Barcode of Life Data System (BOLD).
Year founded: 2022
Last update:
Version:
Accessibility:
Accessible
Country/Region: Italy

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: National Research Council of Italy
Address:
City:
Province/State:
Country/Region: Italy
Contact name (PI/Team): Bachir Balech
Contact email (PI/Helpdesk): b.balech@ibiom.cnr.it

Publications

35134858
MetaCOXI: an integrated collection of metazoan mitochondrial cytochrome oxidase subunit-I DNA sequences. [PMID: 35134858]
Bachir Balech, Anna Sandioniggi, Marinella Marzano, Graziano Pesole, Monica Santamaria

Nucleotide sequences reference collections or databases are fundamental components in DNA barcoding and metabarcoding data analyses pipelines. In such analyses, the accurate taxonomic assignment is a crucial aspect, relying directly on the availability of comprehensive and curated reference sequence collection and its taxonomy information. The currently wide use of the mitochondrial cytochrome oxidase subunit-I (COXI) as a standard DNA barcode marker in metazoan biodiversity studies highlights the need to shed light on the availability of the related relevant information from different data sources and their eventual integration. To adequately address data integration process, many aspects should be markedly considered starting from DNA sequence curation followed by taxonomy alignment with solid reference backbone and metadata harmonization according to universal standards. Here, we present MetaCOXI, an integrated collection of curated metazoan COXI DNA sequences with their associated harmonized taxonomy and metadata. This collection was built on the two most extensive available data resources, namely the European Nucleotide Archive (ENA) and the Barcode of Life Data System (BOLD). The current release contains more than 5.6 million entries (39.1% unique to BOLD, 3.6% unique to ENA, and 57.2% shared between both), their related taxonomic classification based on NCBI reference taxonomy, and their available main metadata relevant to environmental DNA studies, such as geographical coordinates, sampling country and host species. MetaCOXI is available in standard universal formats ('fasta' for sequences & 'tsv' for taxonomy and metadata), which can be easily incorporated in standard or specific DNA barcoding and/or metabarcoding data analysis pipelines. Database URL: https://github.com/bachob5/MetaCOXI.

Database (Oxford). 2022:2022() | 10 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
3110/6895 (54.909%)
Gene genome and annotation:
970/2021 (52.053%)
3110
Total Rank
10
Citations
3.333
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2022-04-25
Curated by:
Lina Ma [2022-05-31]
sun yongqing [2022-05-15]
Qianpeng Li [2022-04-25]