Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

NCycDB

General information

URL: https://github.com/qichao1984/NCyc
Full name: NCycDB
Description: NCycDB is a manually curated integrative database (NCyc) for fast and accurate profiling of N cycle gene (sub) families from shotgun metagenome sequencing data. The NCyc database contains a total of 68 gene (sub) families and covers eight N cycle processes with 84,759 and 219,146 representative sequences.
Year founded: 2019
Last update:
Version:
Accessibility:
Accessible
Country/Region: China

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
NA
Keywords:

Contact information

University/Institution: Shandong University
Address: Institute of Marine Science and Technology, Shandong University, Qingdao, China
City: Jinan
Province/State: Shandong
Country/Region: China
Contact name (PI/Team): Qichao Tu
Contact email (PI/Helpdesk): tuqichao@outlook.com

Publications

30165481
NCycDB: a curated integrative database for fast and accurate metagenomic profiling of nitrogen cycling genes. [PMID: 30165481]
Qichao Tu, Lu Lin, Lei Cheng, Ye Deng, Zhili He

MOTIVATION: The nitrogen (N) cycle is a collection of important biogeochemical pathways in the Earth ecosystem and has gained extensive foci in ecology and environmental studies. Currently, shotgun metagenome sequencing has been widely applied to explore gene families responsible for N cycle processes. However, there are problems in applying publically available orthology databases to profile N cycle gene families in shotgun metagenomes, such as inefficient database searching, unspecific orthology groups and low coverage of N cycle genes and/or gene (sub)families.
RESULTS: To solve these issues, this study built a manually curated integrative database (NCycDB) for fast and accurate profiling of N cycle gene (sub)families from shotgun metagenome sequencing data. NCycDB contains a total of 68 gene (sub)families and covers eight N cycle processes with 84 759 and 219 146 representative sequences at 95 and 100% identity cutoffs, respectively. We also identified 1958 homologous orthology groups and included corresponding sequences in the database to avoid false positive assignments due to 'small database' issues. We applied NCycDB to characterize N cycle gene (sub)families in 52 shotgun metagenomes from the Global Ocean Sampling expedition. Further analysis showed that the structure and composition of N cycle gene families were most strongly correlated with latitude and temperature. NCycDB is expected to facilitate N cycle studies via shotgun metagenome sequencing approaches in various environments. The framework developed in this study can be served as a good reference to build similar knowledge-based functional gene databases in various processes and pathways.
AVAILABILITY AND IMPLEMENTATION: NCycDB database files are available at https://github.com/qichao1984/NCyc.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Bioinformatics. 2019:35(6) | 222 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
484/6895 (92.995%)
Gene genome and annotation:
171/2021 (91.588%)
484
Total Rank
195
Citations
32.5
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2019-09-24
Curated by:
Ghulam Abbas [2019-10-12]
Ghulam Abbas [2019-10-08]
furrukh mehmood [2019-09-24]