Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

ProSeg

General information

URL: http://riodb.ibase.aist.go.jp/proseg
Full name: a database of local structures of protein segments
Description: a database called ProSeg, which consists of two sub-databases, Segment DB and Cluster DB. Segment DB contains tens of thousands of segments that were prepared by dividing the primary sequences of 370 proteins using a sliding L-residue window (L = 5, 9, 11, 15). These segments were classified into several thousands of clusters according to their three-dimensional structural resemblance.
Year founded: 2009
Last update:
Version:
Accessibility:
Unaccessible
Country/Region: Japan

Classification & Tag

Data type:
Data object:
Database category:
Major species:
NA
Keywords:

Contact information

University/Institution: National Institute of Advanced Industrial Science and Technology
Address:
City:
Province/State:
Country/Region: Japan
Contact name (PI/Team): Shinya Honda
Contact email (PI/Helpdesk): s.honda@aist.go.jp

Publications

18931918
ProSeg: a database of local structures of protein segments. [PMID: 18931918]
Sawada Y, Honda S.

Integration of knowledge on the sequence-structure correlation of proteins provides a basis for the structural design of artificial novel proteins. As one of strategies, it is effective to consider a short segment, whose size is in between an amino acid and a domain, as a correlation unit for exploring the structure-to-sequence relationship. Here we report the development of a database called ProSeg, which consists of two sub-databases, Segment DB and Cluster DB. Segment DB contains tens of thousands of segments that were prepared by dividing the primary sequences of 370 proteins using a sliding L-residue window (L = 5, 9, 11, 15). These segments were classified into several thousands of clusters according to their three-dimensional structural resemblance. Cluster DB contains much cluster-related information, which includes image, rank, frequency, secondary structure assignment, sequence profile, etc. Users can search for a suitable cluster by inputting an appropriate parameter (i.e., PDB ID, dihedral angles, or DSSP symbols), which identifies the backbone structure of a query segment. Analogous to a language, ProSeg could be regarded as a 'structure-sequence dictionary' that contains over 10,000 'protein words'. ProSeg is freely accessible through the Internet ( http://riodb.ibase.aist.go.jp/proseg/ ).

J Comput Aided Mol Des. 2009:23(3) | 2 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
6763/6895 (1.929%)
Structure:
944/967 (2.482%)
6763
Total Rank
2
Citations
0.125
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2018-01-27
Curated by:
Lin Liu [2022-08-20]
Zhuang Xiong [2018-02-23]