| URL: | http://csc.columbusstate.edu/carroll/MDB |
| Full name: | MultiDomainBenchmark |
| Description: | Domains are the primary building blocks of protein structure and function. MultiDomainBenchmark was designed to provide robust evaluation of genetic database searching with query sequences that have multi-domains. |
| Year founded: | 2019 |
| Last update: | |
| Version: | |
| Accessibility: |
Accessible
|
| Country/Region: | United States |
| Data type: | |
| Data object: |
NA
|
| Database category: | |
| Major species: | |
| Keywords: |
| University/Institution: | Columbus State University |
| Address: | TSYS School of Computer Science, Columbus State University, 4225 University Avenue, Columbus, 31907, GA, USA |
| City: | |
| Province/State: | |
| Country/Region: | United States |
| Contact name (PI/Team): | Hyrum D. Carroll |
| Contact email (PI/Helpdesk): | carroll_hyrum@columbusstate.edu |
|
MultiDomainBenchmark: a multi-domain query and subject database suite. [PMID: 30764761]
BACKGROUND: Genetic sequence database retrieval benchmarks play an essential role in evaluating the performance of sequence searching tools. To date, all phylogenetically diverse benchmarks known to the authors include only query sequences with single protein domains. Domains are the primary building blocks of protein structure and function. Independently, each domain can fulfill a single function, but most proteins (>80% in Metazoa) exist as multi-domain proteins. Multiple domain units combine in various arrangements or architectures to create different functions and are often under evolutionary pressures to yield new ones. Thus, it is crucial to create gold standards reflecting the multi-domain complexity of real proteins to more accurately evaluate sequence searching tools. |