PGP repository: a plant phenomics and genomics data publication infrastructure.

Daniel Arend, Astrid Junker, Uwe Scholz, Danuta Schüler, Juliane Wylie, Matthias Lange
Author Information
  1. Daniel Arend: Leibniz Institute for Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Corrensstraße 3, Stadt Seeland, 06466, Gatersleben, Germany arendd@ipk-gatersleben.de.
  2. Astrid Junker: Leibniz Institute for Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Corrensstraße 3, Stadt Seeland, 06466, Gatersleben, Germany.
  3. Uwe Scholz: Leibniz Institute for Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Corrensstraße 3, Stadt Seeland, 06466, Gatersleben, Germany.
  4. Danuta Schüler: Leibniz Institute for Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Corrensstraße 3, Stadt Seeland, 06466, Gatersleben, Germany.
  5. Juliane Wylie: Leibniz Institute for Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Corrensstraße 3, Stadt Seeland, 06466, Gatersleben, Germany.
  6. Matthias Lange: Leibniz Institute for Plant Genetics and Crop Plant Research (IPK), OT Gatersleben, Corrensstraße 3, Stadt Seeland, 06466, Gatersleben, Germany.

Abstract

Plant genomics and phenomics represents the most promising tools for accelerating yield gains and overcoming emerging crop productivity bottlenecks. However, accessing this wealth of plant diversity requires the characterization of this material using state-of-the-art genomic, phenomic and molecular technologies and the release of subsequent research data via a long-term stable, open-access portal. Although several international consortia and public resource centres offer services for plant research data management, valuable digital assets remains unpublished and thus inaccessible to the scientific community. Recently, the Leibniz Institute of Plant Genetics and Crop Plant Research and the German Plant Phenotyping Network have jointly initiated the Plant Genomics and Phenomics Research Data Repository (PGP) as infrastructure to comprehensively publish plant research data. This covers in particular cross-domain datasets that are not being published in central repositories because of its volume or unsupported data scope, like image collections from plant phenotyping and microscopy, unfinished genomes, genotyping data, visualizations of morphological plant models, data from mass spectrometry as well as software and documents.The repository is hosted at Leibniz Institute of Plant Genetics and Crop Plant Research using e!DAL as software infrastructure and a Hierarchical Storage Management System as data archival backend. A novel developed data submission tool was made available for the consortium that features a high level of automation to lower the barriers of data publication. After an internal review process, data are published as citable digital object identifiers and a core set of technical metadata is registered at DataCite. The used e!DAL-embedded Web frontend generates for each dataset a landing page and supports an interactive exploration. PGP is registered as research data repository at BioSharing.org, re3data.org and OpenAIRE as valid EU Horizon 2020 open data archive. Above features, the programmatic interface and the support of standard metadata formats, enable PGP to fulfil the FAIR data principles-findable, accessible, interoperable, reusable.Database URL:http://edal.ipk-gatersleben.de/repos/pgp/.

References

  1. Nature. 2009 Sep 10;461(7261):145 [PMID: 19741659]
  2. J Pharmacol Pharmacother. 2011 Apr;2(2):138-9 [PMID: 21772785]
  3. PLoS One. 2013;8(11):e78080 [PMID: 24223762]
  4. Stand Genomic Sci. 2010 Dec 04;3(3):254-8 [PMID: 21304729]
  5. Nucleic Acids Res. 2014 Jan;42(Database issue):D18-25 [PMID: 24271396]
  6. Nat Methods. 2012 May;9(5):459-62 [PMID: 22543379]
  7. Sci Data. 2015;2:150072 [PMID: 26647166]
  8. J Comput Aided Mol Des. 2014 Oct;28(10):1035-41 [PMID: 25038897]
  9. Science. 2011 Dec 2;334(6060):1226-7 [PMID: 22144613]
  10. Trends Biotechnol. 2012 May;30(5):241-2 [PMID: 22417641]
  11. Int J Epidemiol. 2006 Oct;35(5):1123-7; discussion 1127-8 [PMID: 16987841]
  12. J Exp Bot. 2015 Sep;66(18):5417-27 [PMID: 26044092]
  13. Nucleic Acids Res. 2014 Jan;42(Database issue):D1193-9 [PMID: 24217918]
  14. Curr Biol. 2014 Jan 6;24(1):94-7 [PMID: 24361065]
  15. J Comput Aided Mol Des. 2014 Oct;28(10):1043-8 [PMID: 24980647]
  16. Science. 2009 Oct 9;326(5950):234-6 [PMID: 19815759]
  17. BMC Bioinformatics. 2011;12 Suppl 15:S2 [PMID: 22373175]
  18. PLoS One. 2007;2(3):e308 [PMID: 17375194]
  19. PLoS One. 2011;6(7):e18657 [PMID: 21765886]
  20. Bioinformatics. 2010 Sep 15;26(18):2354-6 [PMID: 20679334]
  21. Nucleic Acids Res. 2014 Jan;42(Database issue):D1-6 [PMID: 24316579]
  22. Nucleic Acids Res. 2012 Jan;40(Database issue):D54-6 [PMID: 22009675]
  23. Nat Rev Microbiol. 2008 Dec;6(12):948-54 [PMID: 19008893]
  24. BMC Bioinformatics. 2014;15:214 [PMID: 24958009]
  25. BMC Genomics. 2015;16:626 [PMID: 26343138]

MeSH Term

Database Management Systems
Databases, Factual
Genome, Plant
Genomics
Internet
Plant Physiological Phenomena
Plants
Publications

Links to CNCB-NGDC Resources

Database Commons: DBC001940 (PGP)

Word Cloud

Created with Highcharts 10.0.0dataPlantplantresearchPGPResearchinfrastructuregenomicsphenomicsusingdigitalLeibnizInstituteGeneticsCroppublishedsoftwarerepositoryfeaturespublicationmetadataregisteredorgrepresentspromisingtoolsacceleratingyieldgainsovercomingemergingcropproductivitybottlenecksHoweveraccessingwealthdiversityrequirescharacterizationmaterialstate-of-the-artgenomicphenomicmoleculartechnologiesreleasesubsequentvialong-termstableopen-accessportalAlthoughseveralinternationalconsortiapublicresourcecentresofferservicesmanagementvaluableassetsremainsunpublishedthusinaccessiblescientificcommunityRecentlyGermanPhenotypingNetworkjointlyinitiatedGenomicsPhenomicsDataRepositorycomprehensivelypublishcoversparticularcross-domaindatasetscentralrepositoriesvolumeunsupportedscopelikeimagecollectionsphenotypingmicroscopyunfinishedgenomesgenotypingvisualizationsmorphologicalmodelsmassspectrometrywelldocumentsThehostede!DALHierarchicalStorageManagementSystemarchivalbackendnoveldevelopedsubmissiontoolmadeavailableconsortiumhighlevelautomationlowerbarriersinternalreviewprocesscitableobjectidentifierscoresettechnicalDataCiteusede!DAL-embeddedWebfrontendgeneratesdatasetlandingpagesupportsinteractiveexplorationBioSharingre3dataOpenAIREvalidEUHorizon2020openarchiveprogrammaticinterfacesupportstandardformatsenablefulfilFAIRprinciples-findableaccessibleinteroperablereusableDatabaseURL:http://edalipk-gaterslebende/repos/pgp/repository:

Similar Articles

Cited By