Using RNA-Seq to profile soybean seed development from fertilization to maturity.

Sarah I Jones, Lila O Vodkin
Author Information
  1. Sarah I Jones: Department of Crop Sciences, University of Illinois, Urbana, IL, USA.

Abstract

To understand gene expression networks leading to functional properties and compositional traits of the soybean seed, we have undertaken a detailed examination of soybean seed development from a few days post-fertilization to the mature seed using Illumina high-throughput transcriptome sequencing (RNA-Seq). RNA was sequenced from seven different stages of seed development, yielding between 12 million and 78 million sequenced transcripts. These have been aligned to the 79,000 gene models predicted from the soybean genome recently sequenced by the Department of Energy Joint Genome Institute. Over one hundred gene models were identified with high expression exclusively in young seed stages, starting at just four days after fertilization. These were annotated as being related to many basic components and processes such as histones and proline-rich proteins. Genes encoding storage proteins such as glycinin and beta-conglycinin had their highest expression levels at the stages of largest fresh weight, confirming previous knowledge that these storage products are being rapidly accumulated before the seed begins the desiccation process. Other gene models showed high expression in the dry, mature seeds, perhaps indicating the preparation of pathways needed later, in the early stages of imbibition. Many highly-expressed gene models at the dry seed stage are, as expected, annotated as hydrophilic proteins associated with low water conditions, such as late embryogenesis abundant (LEA) proteins and dehydrins, which help preserve the cellular structures and nutrients within the seed during desiccation. More significantly, the power of RNA-Seq to detect genes expressed at low levels revealed hundreds of transcription factors with notable expression in at least one stage of seed development. Results from a second biological replicate demonstrate high reproducibility of these data revealing a comprehensive view of the transciptome of seed development in the cultivar Williams, the reference cultivar for the first soybean genome sequence.

References

  1. Plant Cell. 2008 Aug;20(8):2238-51 [PMID: 18682547]
  2. Acta Crystallogr D Biol Crystallogr. 2004 Mar;60(Pt 3):613-5 [PMID: 14993710]
  3. Plant Physiol. 2007 Jun;144(2):562-74 [PMID: 17556519]
  4. J Integr Plant Biol. 2011 Jul;53(7):570-85 [PMID: 21676172]
  5. BMC Genomics. 2011 Jun 02;12:286 [PMID: 21635767]
  6. Protoplasma. 2010 Sep;245(1-4):3-14 [PMID: 20411284]
  7. Genome Biol. 2004;5(10):R80 [PMID: 15461798]
  8. Plant Cell. 2009 Oct;21(10):3063-77 [PMID: 19820189]
  9. Nat Methods. 2008 Jul;5(7):621-8 [PMID: 18516045]
  10. Plant Cell. 1989 Sep;1(9):937-43 [PMID: 2535533]
  11. Plant Cell. 1991 Jun;3(6):561-71 [PMID: 1841717]
  12. Plant Cell. 2001 Sep;13(9):2099-114 [PMID: 11549766]
  13. J Biol Chem. 2000 Feb 25;275(8):5668-74 [PMID: 10681550]
  14. J Biol Chem. 1980 Sep 25;255(18):88907-13 [PMID: 6157689]
  15. BMC Genomics. 2010 Feb 24;11:136 [PMID: 20181280]
  16. Annu Rev Plant Biol. 2007;58:115-36 [PMID: 17177638]
  17. Genome Biol. 2009;10(3):R25 [PMID: 19261174]
  18. Plant Physiol. 1993 Dec;103(4):1035-1040 [PMID: 12231998]
  19. BMC Plant Biol. 2011 Oct 26;11:145 [PMID: 22029708]
  20. Nature. 2010 Jan 14;463(7278):178-83 [PMID: 20075913]
  21. DNA Res. 2011 Aug;18(4):263-76 [PMID: 21685489]
  22. Plant Physiol Biochem. 2012 Mar;52:147-53 [PMID: 22245912]
  23. BMC Plant Biol. 2012 Oct 02;12:177 [PMID: 23031057]
  24. Plant Mol Biol. 1999 May;40(2):237-48 [PMID: 10412903]
  25. Development. 2005 Aug;132(16):3577-85 [PMID: 16077088]
  26. Planta. 1981 Oct;153(2):130-9 [PMID: 24276763]
  27. Biotechniques. 2003 Feb;34(2):374-8 [PMID: 12613259]
  28. Trends Plant Sci. 2005 Feb;10(2):79-87 [PMID: 15708345]
  29. BMC Genomics. 2007 Dec 19;8:468 [PMID: 18093333]
  30. Mol Biol Rep. 2012 Mar;39(3):2867-74 [PMID: 21667246]
  31. Genome Biol. 2010;11(10):R106 [PMID: 20979621]
  32. Plant Cell. 1989 Mar;1(3):313-28 [PMID: 2485233]
  33. Plant Physiol. 1998 Mar;116(3):923-33 [PMID: 9501125]
  34. Biochim Biophys Acta. 2002 May 10;1571(1):64-70 [PMID: 12031291]
  35. BMC Genomics. 2012 Jul 16;13:310 [PMID: 22799740]
  36. Plant J. 2011 Aug;67(3):406-19 [PMID: 21477081]
  37. Nucleic Acids Res. 2012 Jan;40(Database issue):D1178-86 [PMID: 22110026]
  38. Plant Physiol. 2003 May;132(1):118-36 [PMID: 12746518]

MeSH Term

Antigens, Plant
Gene Expression Profiling
Gene Expression Regulation, Developmental
Gene Expression Regulation, Plant
Globulins
High-Throughput Nucleotide Sequencing
Models, Genetic
Plant Proteins
Pollination
Seed Storage Proteins
Seeds
Sequence Analysis, RNA
Soybean Proteins
Glycine max
Transcriptome

Chemicals

Antigens, Plant
Globulins
Plant Proteins
Seed Storage Proteins
Soybean Proteins
beta-conglycinin protein, Glycine max
late embryogenesis abundant protein, plant
dehydrin proteins, plant
glycinin

Word Cloud

Similar Articles

Cited By