OPIA: an open archive of plant images and related phenotypic traits.

Yongrong Cao, Dongmei Tian, Zhixin Tang, Xiaonan Liu, Weijuan Hu, Zhang Zhang, Shuhui Song
Author Information
  1. Yongrong Cao: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China. ORCID
  2. Dongmei Tian: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China.
  3. Zhixin Tang: University of Chinese Academy of Sciences, Beijing 100049, China.
  4. Xiaonan Liu: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China. ORCID
  5. Weijuan Hu: Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China.
  6. Zhang Zhang: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China. ORCID
  7. Shuhui Song: National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing 100101, China. ORCID

Abstract

High-throughput plant phenotype acquisition technologies have been extensively utilized in plant phenomics studies, leading to vast quantities of images and image-based phenotypic traits (i-traits) that are critically essential for accelerating germplasm screening, plant diseases identification and biotic & abiotic stress classification. Here, we present the Open Plant Image Archive (OPIA, https://ngdc.cncb.ac.cn/opia/), an open archive of plant images and i-traits derived from high-throughput phenotyping platforms. Currently, OPIA houses 56 datasets across 11 plants, comprising a total of 566 225 images with 2 417 186 labeled instances. Notably, it incorporates 56 i-traits of 93 rice and 105 wheat cultivars based on 18 644 individual RGB images, and these i-traits are further annotated based on the Plant Phenotype and Trait Ontology (PPTO) and cross-linked with GWAS Atlas. Additionally, each dataset in OPIA is assigned an evaluation score that takes account of image data volume, image resolution, and the number of labeled instances. More importantly, OPIA is equipped with useful tools for online image pre-processing and intelligent prediction. Collectively, OPIA provides open access to valuable datasets, pre-trained models, and phenotypic traits across diverse plants and thus bears great potential to play a crucial role in facilitating artificial intelligence-assisted breeding research.

References

  1. Gigascience. 2017 Oct 1;6(10):1-10 [PMID: 29020747]
  2. Genomics Proteomics Bioinformatics. 2023 Oct 6;: [PMID: 37806555]
  3. Plant Methods. 2017 Nov 01;13:79 [PMID: 29118821]
  4. Trends Plant Sci. 2023 Feb;28(2):154-184 [PMID: 36167648]
  5. Plant Phenomics. 2020 Aug 21;2020:1375957 [PMID: 33313541]
  6. Plant Methods. 2019 Dec 11;15:150 [PMID: 31857821]
  7. Plant Methods. 2013 Oct 09;9(1):38 [PMID: 24107223]
  8. Front Plant Sci. 2019 Aug 28;10:1050 [PMID: 31555312]
  9. Planta. 2018 Nov;248(5):1307-1318 [PMID: 30101399]
  10. Plant Physiol. 2015 Mar;167(3):617-27 [PMID: 25614065]
  11. Plant Phenomics. 2023;5:0020 [PMID: 37040495]
  12. Plant Cell Environ. 2023 Feb;46(2):549-566 [PMID: 36354160]
  13. Mol Plant. 2023 Feb 6;16(2):354-373 [PMID: 36447436]
  14. Trends Plant Sci. 2017 Jul;22(7):559-566 [PMID: 28571940]
  15. Nucleic Acids Res. 2020 Jan 8;48(D1):D927-D932 [PMID: 31566222]
  16. New Phytol. 2020 Jul;227(1):260-273 [PMID: 32171029]
  17. Mol Plant. 2023 Jan 2;16(1):279-293 [PMID: 36366781]
  18. Trends Plant Sci. 2021 Nov;26(11):1171-1185 [PMID: 34404587]
  19. Nucleic Acids Res. 2023 Jan 6;51(D1):D969-D976 [PMID: 36263826]
  20. Data Brief. 2021 Dec 31;40:107756 [PMID: 35028343]
  21. Gigascience. 2012 Jul 12;1(1):11 [PMID: 23587345]
  22. Front Plant Sci. 2022 Sep 13;13:963302 [PMID: 36176678]
  23. J Biosci. 2020;45: [PMID: 32975238]
  24. Plant Methods. 2019 Feb 06;15:13 [PMID: 30774703]
  25. Nat Commun. 2014 Oct 08;5:5087 [PMID: 25295980]
  26. Plant Phenomics. 2023 Jun 08;5:0058 [PMID: 37304154]

Grants

  1. 2022ZD04017/The Science and Technology Innovation 2030 - Major Project
  2. 32000475/National Natural Science Foundation of China
  3. XDA24040201/Strategic Priority Research Program of the Chinese Academy of Sciences
  4. Y2021038/Youth Innovation Promotion Association of the Chinese Academy of Sciences

MeSH Term

Artificial Intelligence
Image Processing, Computer-Assisted
Phenotype
Plant Breeding
Plants
Databases, Factual

Links to CNCB-NGDC Resources

Database Commons: DBC009592 (OPIA)

Word Cloud

Similar Articles

Cited By