Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

cropPAL

General information

URL: https://crop-pal.org
Full name: crop Proteins with Annotated Locations
Description: The cropPAL2020 dataset provides a comprehensive subcellular proteomics resource and user interface for exploring global protein distributions within crop cells. It identifies species-specific protein subcellular location divergence and defines the best species for comparisons to drive compartmentation-based approaches to improve yield, protein composition and resilience in future crop varieties.
Year founded: 2016
Last update:
Version:
Accessibility:
Accessible
Country/Region: Australia

Classification & Tag

Data type:
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: University of Western Australia
Address: 35 Stirling Highway
City: Crawley
Province/State:
Country/Region: Australia
Contact name (PI/Team): Cornelia M. Hooper
Contact email (PI/Helpdesk): cornelia.hooper@uwa.edu.au

Publications

32780488
CropPAL for discovering divergence in protein subcellular location in crops to support strategies for molecular crop breeding. [PMID: 32780488]
Cornelia M Hooper, Ian R Castleden, Nader Aryamanesh, Kylie Black, Sally V Grasso, A Harvey Millar

Agriculture faces increasing demand for yield, higher plant-derived protein content and diversity while facing pressure to achieve sustainability. Although the genomes of many of the important crops have been sequenced, the subcellular locations of most of the encoded proteins remain unknown or are only predicted. Protein subcellular location is crucial in determining protein function and accumulation patterns in plants, and is critical for targeted improvements in yield and resilience. Integrating location data from over 800 studies for 12 major crop species into the cropPAL2020 data collection showed that while >80% of proteins in most species are not localised by experimental data, combining species data or integrating predictions can help bridge gaps at similar accuracy. The collation and integration of over 61 505 experimental localisations and more than 6 million predictions showed that the relative sizes of the protein catalogues located in different subcellular compartments are comparable between crops and Arabidopsis. A comprehensive cross-species comparison showed that between 50% and 80% of the subcellulomes are conserved across species and that conservation only depends to some degree on the phylogenetic relationship of the species. Protein subcellular locations in major biosynthesis pathways are more often conserved than in metabolic pathways. Underlying this conservation is a clear potential for subcellular diversity in protein location between species by means of gene duplication and alternative splicing. Our cropPAL data set and search platform (https://crop-pal.org) provide a comprehensive subcellular proteomics resource to drive compartmentation-based approaches for improving yield, protein composition and resilience in future crop varieties.

Plant J.. 2020:() | 13 Citations (from Europe PMC, 2025-12-13)
26556651
Finding the Subcellular Location of Barley, Wheat, Rice and Maize Proteins: The Compendium of Crop Proteins with Annotated Locations (cropPAL). [PMID: 26556651]
Cornelia M Hooper, Ian R Castleden, Nader Aryamanesh, Richard P Jacoby, A Harvey Millar,

Barley, wheat, rice and maize provide the bulk of human nutrition and have extensive industrial use as agricultural products. The genomes of these crops each contains >40,000 genes encoding proteins; however, the major genome databases for these species lack annotation information of protein subcellular location for >80% of these gene products. We address this gap, by constructing the compendium of crop protein subcellular locations called crop Proteins with Annotated Locations (cropPAL). Subcellular location is most commonly determined by fluorescent protein tagging of live cells or mass spectrometry detection in subcellular purifications, but can also be predicted from amino acid sequence or protein expression patterns. The cropPAL database collates 556 published studies, from >300 research institutes in >30 countries that have been previously published, as well as compiling eight pre-computed subcellular predictions for all Hordeum vulgare, Triticum aestivum, Oryza sativa and Zea mays protein sequences. The data collection including metadata for proteins and published studies can be accessed through a search portal http://crop-PAL.org. The subcellular localization information housed in cropPAL helps to depict plant cells as compartmentalized protein networks that can be investigated for improving crop yield and quality, and developing new biotechnological solutions to agricultural challenges.

Plant Cell Physiol.. 2016:57(1) | 44 Citations (from Europe PMC, 2025-12-13)

Ranking

All databases:
2101/6895 (69.543%)
Phylogeny and homology:
95/302 (68.874%)
Interaction:
408/1194 (65.913%)
2101
Total Rank
54
Citations
6
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2020-11-09
Curated by:
Lin Liu [2021-02-20]
Xiaonan Liu [2020-11-26]
Xiaonan Liu [2020-11-20]
Ming Chen [2020-11-09]