| URL: | https://metatryp.whoi.edu |
| Full name: | |
| Description: | METATRYP version 2 software identifies shared peptides across the predicted proteomes of organisms within environmental metaproteomics studies to enable accurate taxonomic attribution of peptides during protein inference. METATRYP was designed to facilitate ocean metaproteomics and has been integrated into the Ocean Protein Portal; however, it can be readily applied to other domains, such as the rapid deployment of a coronavirus-specific web portal to aid in use of proteomics on coronavirus research during the ongoing pandemic. |
| Year founded: | 2015 |
| Last update: | |
| Version: | version 2 |
| Accessibility: |
Accessible
|
| Country/Region: | United States |
| Data type: | |
| Data object: | |
| Database category: | |
| Major species: | |
| Keywords: |
| University/Institution: | Woods Hole Oceanographic Institution |
| Address: | 266 Woods Hole Road Mailstop #51, Woods Hole, Massachusetts 02543, United States |
| City: | |
| Province/State: | Massachusetts |
| Country/Region: | United States |
| Contact name (PI/Team): | Mak A. Saito |
| Contact email (PI/Helpdesk): | msaito@whoi.edu |
|
METATRYP v 2.0: Metaproteomic Least Common Ancestor Analysis for Taxonomic Inference Using Specialized Sequence Assemblies - Standalone Software and Web Servers for Marine Microorganisms and Coronaviruses. [PMID: 32897080]
We present METATRYP version-2 software that identifies shared peptides across the predicted proteomes of organisms within environmental metaproteomics studies to enable accurate taxonomic attribution of peptides during protein inference. Improvements include: ingestion of complex sequence assembly data categories (metagenomic and metatranscriptomic assemblies, single cell amplified genomes, and metagenome assembled genomes), prediction of the Least Common Ancestor (LCA) for a peptide shared across multiple organisms, increased performance through updates to the backend architecture, and development of a web portal (https://metatryp.whoi.edu). Major expansion of the marine METATRYP database with predicted proteomes from environmental sequencing confirms low occurrence of shared tryptic peptides among disparate marine microorganisms, implying tractability for targeted metaproteomics. METATRYP was designed to facilitate ocean metaproteomics and has been integrated into the Ocean Protein Portal (https://oceanproteinportal.org); however, it can be readily applied to other domains. We describe the rapid deployment of a coronavirus-specific web portal (https://metatryp-coronavirus.whoi.edu/) to aid in use of proteomics on coronavirus research during the ongoing pandemic. A Coronavirus-focused METATRYP database identified potential SARS-CoV-2 peptide biomarkers and indicated very few shared tryptic peptides between SARS-CoV-2 and other disparate taxa analyzed, sharing 1% peptides with taxa outside of the Betacoronavirus group, establishing that taxonomic specificity is achievable using tryptic peptide-based proteomic diagnostic approaches. |
|
Needles in the blue sea: sub-species specificity in targeted protein biomarker analyses within the vast oceanic microbial metaproteome. [PMID: 26097212]
Proteomics has great potential for studies of marine microbial biogeochemistry, yet high microbial diversity in many locales presents us with unique challenges. We addressed this challenge with a targeted metaproteomics workflow for NtcA and P-II, two nitrogen regulatory proteins, and demonstrated its application for cyanobacterial taxa within microbial samples from the Central Pacific Ocean. Using METATRYP, an open-source Python toolkit, we examined the number of shared (redundant) tryptic peptides in representative marine microbes, with the number of tryptic peptides shared between different species typically being 1% or less. The related cyanobacteria Prochlorococcus and Synechococcus shared an average of 4.8 ± 1.9% of their tryptic peptides, while shared intraspecies peptides were higher, 13 ± 15% shared peptides between 12 Prochlorococcus genomes. An NtcA peptide was found to target multiple cyanobacteria species, whereas a P-II peptide showed specificity to the high-light Prochlorococcus ecotype. Distributions of NtcA and P-II in the Central Pacific Ocean were similar except at the Equator likely due to differential nitrogen stress responses between Prochlorococcus and Synechococcus. The number of unique tryptic peptides coded for within three combined oceanic microbial metagenomes was estimated to be ∼4 × 10(7) , 1000-fold larger than an individual microbial proteome and 27-fold larger than the human proteome, yet still 20 orders of magnitude lower than the peptide diversity possible in all protein space, implying that peptide mapping algorithms should be able to withstand the added level of complexity in metaproteomic samples. |