Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

BioLiP

General information

URL: https://zhanggroup.org/BioLiP
Full name:
Description: BioLiP is a semi-manually curated database for high-quality, biologically relevant ligand-protein binding interactions. The structure data are collected primarily from the Protein Data Bank (PDB), with biological insights mined from literature and other specific databases. BioLiP aims to construct the most comprehensive and accurate database for serving the needs of ligand-protein docking, virtual ligand screening and protein function annotation.
Year founded: 2013
Last update: 2023-02-01
Version: v2
Accessibility:
Accessible
Country/Region: United States

Classification & Tag

Data type:
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: University of Michigan
Address: 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218, USA
City: Ann Arbor
Province/State: MI
Country/Region: United States
Contact name (PI/Team): Yang Zhang
Contact email (PI/Helpdesk): zhng@umich.edu

Publications

37522378
BioLiP2: an updated structure database for biologically relevant ligand-protein interactions. [PMID: 37522378]
Chengxin Zhang, Xi Zhang, Peter L Freddolino, Yang Zhang

With the progress of structural biology, the Protein Data Bank (PDB) has witnessed rapid accumulation of experimentally solved protein structures. Since many structures are determined with purification and crystallization additives that are unrelated to a protein's in vivo function, it is nontrivial to identify the subset of protein-ligand interactions that are biologically relevant. We developed the BioLiP2 database (https://zhanggroup.org/BioLiP) to extract biologically relevant protein-ligand interactions from the PDB database. BioLiP2 assesses the functional relevance of the ligands by geometric rules and experimental literature validations. The ligand binding information is further enriched with other function annotations, including Enzyme Commission numbers, Gene Ontology terms, catalytic sites, and binding affinities collected from other databases and a manual literature survey. Compared to its predecessor BioLiP, BioLiP2 offers significantly greater coverage of nucleic acid-protein interactions, and interactions involving large complexes that are unavailable in PDB format. BioLiP2 also integrates cutting-edge structural alignment algorithms with state-of-the-art structure prediction techniques, which for the first time enables composite protein structure and sequence-based searching and significantly enhances the usefulness of the database in structure-based function annotations. With these new developments, BioLiP2 will continue to be an important and comprehensive database for docking, virtual screening, and structure-based protein function analyses.

Nucleic Acids Res. 2024:52(D1) | 57 Citations (from Europe PMC, 2025-12-06)
23087378
BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions. [PMID: 23087378]
Yang J, Roy A, Zhang Y.

BioLiP (http://zhanglab.ccmb.med.umich.edu/BioLiP/) is a semi-manually curated database for biologically relevant ligand-protein interactions. Establishing interactions between protein and biologically relevant ligands is an important step toward understanding the protein functions. Most ligand-binding sites prediction methods use the protein structures from the Protein Data Bank (PDB) as templates. However, not all ligands present in the PDB are biologically relevant, as small molecules are often used as additives for solving the protein structures. To facilitate template-based ligand-protein docking, virtual ligand screening and protein function annotations, we develop a hierarchical procedure for assessing the biological relevance of ligands present in the PDB structures, which involves a four-step biological feature filtering followed by careful manual verifications. This procedure is used for BioLiP construction. Each entry in BioLiP contains annotations on: ligand-binding residues, ligand-binding affinity, catalytic sites, Enzyme Commission numbers, Gene Ontology terms and cross-links to the other databases. In addition, to facilitate the use of BioLiP for function annotation of uncharacterized proteins, a new consensus-based algorithm COACH is developed to predict ligand-binding sites from protein sequence or using 3D structure. The BioLiP database is updated weekly and the current release contains 204 223 entries.

Nucleic Acids Res. 2013:41(Database issue) | 502 Citations (from Europe PMC, 2025-12-06)

Ranking

All databases:
371/6895 (94.634%)
Interaction:
64/1194 (94.724%)
Structure:
39/967 (96.07%)
371
Total Rank
521
Citations
43.417
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2015-06-20
Curated by:
Lina Ma [2024-07-16]
Yuxin Qin [2023-09-06]
Mengwei Li [2016-03-31]
Mengwei Li [2015-11-29]
Mengwei Li [2015-06-28]
Mengwei Li [2015-06-26]