Big Data in Plant Science: Resources and Data Mining Tools for Plant Genomics and Proteomics.

George V Popescu, Christos Noutsos, Sorina C Popescu
Author Information
  1. George V Popescu: National Institute for Laser, Plasma & Radiation Physics, Str. Atomistilor, Nr. 409, Magurele, 077125, Bucharest, Romania. george.popescu@acm.org.
  2. Christos Noutsos: Cold Spring Harbor Laboratories, 1 Bungtown Street, Cold Spring Harbor, NY, 11731, USA.
  3. Sorina C Popescu: Department of Biochemistry, Molecular Biology, Plant Pathology and Entomology, Mississippi State University, 411 Dorman Hall, PO Box 9655, Mississippi State, MS, 39762, USA.

Abstract

In modern plant biology, progress is increasingly defined by the scientists' ability to gather and analyze data sets of high volume and complexity, otherwise known as "big data". Arguably, the largest increase in the volume of plant data sets over the last decade is a consequence of the application of the next-generation sequencing and mass-spectrometry technologies to the study of experimental model and crop plants. The increase in quantity and complexity of biological data brings challenges, mostly associated with data acquisition, processing, and sharing within the scientific community. Nonetheless, big data in plant science create unique opportunities in advancing our understanding of complex biological processes at a level of accuracy without precedence, and establish a base for the plant systems biology. In this chapter, we summarize the major drivers of big data in plant science and big data initiatives in life sciences with a focus on the scope and impact of iPlant, a representative cyberinfrastructure platform for plant science.

Keywords

MeSH Term

Botany
Computational Biology
Data Mining
Genomics
High-Throughput Nucleotide Sequencing
Internet
Mass Spectrometry
Plants
Proteomics
Systems Biology

Word Cloud

Created with Highcharts 10.0.0dataplantbigsciencebiologysetsvolumecomplexityincreasesequencingbiologicaliPlantBigDataPlantGenomicsProteomicsmodernprogressincreasinglydefinedscientists'abilitygatheranalyzehighotherwiseknown"bigdata"Arguablylargestlastdecadeconsequenceapplicationnext-generationmass-spectrometrytechnologiesstudyexperimentalmodelcropplantsquantitybringschallengesmostlyassociatedacquisitionprocessingsharingwithinscientificcommunityNonethelesscreateuniqueopportunitiesadvancingunderstandingcomplexprocesseslevelaccuracywithoutprecedenceestablishbasesystemschaptersummarizemajordriversinitiativeslifesciencesfocusscopeimpactrepresentativecyberinfrastructureplatformScience:ResourcesMiningToolsDatabasesMassspectrometryNext-generation

Similar Articles

Cited By