Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

SIMAP

General information

URL: http://liferay.csb.univie.ac.at/portal/web/simap/
Full name: The Similarity Matrix of Proteins
Description: The database of all-against-all protein sequence similarities and annotations with new interfaces and increased coverage
Year founded: 2014
Last update: 2015-05-02
Version: v1.0
Accessibility:
Accessible
Country/Region: Canada

Classification & Tag

Data type:
Data object:
NA
Database category:
Major species:
NA
Keywords:

Contact information

University/Institution: University of Toronto
Address: ON M5S 3E1,Canada
City: Toronto
Province/State:
Country/Region: Canada
Contact name (PI/Team): Thomas Rattei
Contact email (PI/Helpdesk): thomas.rattei@univie.ac.at

Publications

24165881
SIMAP--the database of all-against-all protein sequence similarities and annotations with new interfaces and increased coverage. [PMID: 24165881]
Arnold R, Goldenberg F, Mewes HW, Rattei T.

The Similarity Matrix of Proteins (SIMAP, http://mips.gsf.de/simap/) database has been designed to massively accelerate computationally expensive protein sequence analysis tasks in bioinformatics. It provides pre-calculated sequence similarities interconnecting the entire known protein sequence universe, complemented by pre-calculated protein features and domains, similarity clusters and functional annotations. SIMAP covers all major public protein databases as well as many consistently re-annotated metagenomes from different repositories. As of September 2013, SIMAP contains >163 million proteins corresponding to ?70 million non-redundant sequences. SIMAP uses the sensitive FASTA search heuristics, the Smith-Waterman alignment algorithm, the InterPro database of protein domain models and the BLAST2GO functional annotation algorithm. SIMAP assists biologists by facilitating the interactive exploration of the protein sequence universe. Web-Service and DAS interfaces allow connecting SIMAP with any other bioinformatic tool and resource. All-against-all protein sequence similarity matrices of project-specific protein collections are generated on request. Recent improvements allow SIMAP to cover the rapidly growing sequenced protein sequence universe. New Web-Service interfaces enhance the connectivity of SIMAP. Novel tools for interactive extraction of protein similarity networks have been added. Open access to SIMAP is provided through the web portal; the portal also contains instructions and links for software access and flat file downloads.

Nucleic Acids Res. 2014:42(Database issue) | 15 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
4755/6895 (31.051%)
Phylogeny and homology:
205/302 (32.45%)
4755
Total Rank
15
Citations
1.364
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2015-06-20
Curated by:
Lina Ma [2018-07-04]
Lin Xia [2016-03-28]
Lin Xia [2015-06-28]