Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

Cfam

General information

URL: http://bidd2.cse.nus.edu.sg/cfam
Full name: Chemical Families Database
Description: Cfam is a chemical families database based on iterative selection of functional seeds and seed-directed compound clustering.
Year founded: 2015
Last update: 8/29/2014
Version: v1.0
Accessibility:
Accessible
Country/Region: Singapore

Classification & Tag

Data type:
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: University of Singapore
Address: Singapore 117543
City: Singapore City
Province/State:
Country/Region: Singapore
Contact name (PI/Team): Yu Zong Chen
Contact email (PI/Helpdesk): yzchen@cz3.nus.edu.sg

Publications

25414339
CFam: a chemical families database based on iterative selection of functional seeds and seed-directed compound clustering. [PMID: 25414339]
Zhang C, Tao L, Qin C, Zhang P, Chen S, Zeng X, Xu F, Chen Z, Yang SY, Chen YZ.

Similarity-based clustering and classification of compounds enable the search of drug leads and the structural and chemogenomic studies for facilitating chemical, biomedical, agricultural, material and other industrial applications. A database that organizes compounds into similarity-based as well as scaffold-based and property-based families is useful for facilitating these tasks. CFam Chemical Family database http://bidd2.cse.nus.edu.sg/cfam was developed to hierarchically cluster drugs, bioactive molecules, human metabolites, natural products, patented agents and other molecules into functional families, superfamilies and classes of structurally similar compounds based on the literature-reported high, intermediate and remote similarity measures. The compounds were represented by molecular fingerprint and molecular similarity was measured by Tanimoto coefficient. The functional seeds of CFam families were from hierarchically clustered drugs, bioactive molecules, human metabolites, natural products, patented agents, respectively, which were used to characterize families and cluster compounds into families, superfamilies and classes. CFam currently contains 11,643 classes, 34,880 superfamilies and 87,136 families of 490,279 compounds (1691 approved drugs, 1228 clinical trial drugs, 12,386 investigative drugs, 262,881 highly active molecules, 15,055 human metabolites, 80,255 ZINC-processed natural products and 116,783 patented agents). Efforts will be made to further expand CFam database and add more functional categories and families based on other types of molecular representations. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Nucleic Acids Res. 2015:43(Database issue) | 5 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
6278/6895 (8.963%)
Health and medicine:
1594/1738 (8.343%)
6278
Total Rank
4
Citations
0.4
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2015-06-20
Curated by:
Mengwei Li [2016-03-31]
Mengwei Li [2015-12-01]
Mengwei Li [2015-06-27]