Draft genome sequence of an elite Dura palm and whole-genome patterns of DNA variation in oil palm.

Jingjing Jin, May Lee, Bin Bai, Yanwei Sun, Jing Qu, Yuzer Alfiko, Chin Huat Lim, Antonius Suwanto, Maria Sugiharti, Limsoon Wong, Jian Ye, Nam-Hai Chua, Gen Hua Yue
Author Information
  1. Jingjing Jin: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  2. May Lee: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  3. Bin Bai: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  4. Yanwei Sun: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  5. Jing Qu: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  6. Rahmadsyah: R & D Department, Wilmar International Plantation, Palembang, Indonesia.
  7. Yuzer Alfiko: Biotech Lab, Wilmar International, Jakarta, Indonesia.
  8. Chin Huat Lim: R & D Department, Wilmar International Plantation, Palembang, Indonesia.
  9. Antonius Suwanto: Biotech Lab, Wilmar International, Jakarta, Indonesia.
  10. Maria Sugiharti: Biotech Lab, Wilmar International, Jakarta, Indonesia.
  11. Limsoon Wong: School of Computing, National University of Singapore, Singapore.
  12. Jian Ye: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore jianye@im.ac.cn chua@mail.rockefeller.edu genhua@tll.org.sg.
  13. Nam-Hai Chua: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore jianye@im.ac.cn chua@mail.rockefeller.edu genhua@tll.org.sg.
  14. Gen Hua Yue: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore jianye@im.ac.cn chua@mail.rockefeller.edu genhua@tll.org.sg.

Abstract

Oil palm is the world's leading source of vegetable oil and fat. Dura, Pisifera and Tenera are three forms of oil palm. The genome sequence of Pisifera is available whereas the Dura form has not been sequenced yet. We sequenced the genome of one elite Dura palm, and re-sequenced 17 palm genomes. The assemble genome sequence of the elite Dura tree contained 10,971 scaffolds and was 1.701 Gb in length, covering 94.49% of the oil palm genome. 36,105 genes were predicted. Re-sequencing of 17 additional palm trees identified 18.1 million SNPs. We found high genetic variation among palms from different geographical regions, but lower variation among Southeast Asian Dura and Pisifera palms. We mapped 10,000 SNPs on the linkage map of oil palm. In addition, high linkage disequilibrium (LD) was detected in the oil palms used in breeding populations of Southeast Asia, suggesting that LD mapping is likely to be practical in this important oil crop. Our data provide a valuable resource for accelerating genetic improvement and studying the mechanism underlying phenotypic variations of important oil palm traits.

Keywords

References

  1. Proc Natl Acad Sci U S A. 2011 Jul 26;108(30):12527-32 [PMID: 21709233]
  2. FEBS Lett. 2005 May 9;579(12):2709-14 [PMID: 15862313]
  3. Nat Biotechnol. 2011 Dec 11;30(1):105-11 [PMID: 22158310]
  4. Nat Genet. 2013 Apr;45(4):456-61, 461e1-2 [PMID: 23435089]
  5. Genetics. 2000 Jun;155(2):945-59 [PMID: 10835412]
  6. Nat Methods. 2012 Mar 04;9(4):357-9 [PMID: 22388286]
  7. Science. 2002 Apr 5;296(5565):92-100 [PMID: 11935018]
  8. Science. 2009 Nov 20;326(5956):1112-5 [PMID: 19965430]
  9. Nat Biotechnol. 2011 May 29;29(6):521-7 [PMID: 21623354]
  10. PLoS Biol. 2007 Jul;5(7):e171 [PMID: 17579516]
  11. Bioinformatics. 2009 Aug 15;25(16):2078-9 [PMID: 19505943]
  12. PLoS Genet. 2012;8(11):e1002967 [PMID: 23166502]
  13. Bioinformatics. 2012 Apr 15;28(8):1176-7 [PMID: 22402612]
  14. Nat Genet. 2010 Dec;42(12):1053-9 [PMID: 21076406]
  15. Nature. 2009 Jan 29;457(7229):551-6 [PMID: 19189423]
  16. Bioinformatics. 2012 Dec 15;28(24):3326-8 [PMID: 23060615]
  17. Bioinformatics. 2005 Jan 15;21(2):263-5 [PMID: 15297300]
  18. PLoS Biol. 2006 Mar;4(3):e72 [PMID: 16494531]
  19. Theor Appl Genet. 2004 May;108(7):1274-84 [PMID: 14676949]
  20. Nature. 2010 Jan 14;463(7278):178-83 [PMID: 20075913]
  21. Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5 [PMID: 17130148]
  22. Brief Bioinform. 2008 Jul;9(4):299-306 [PMID: 18417537]
  23. Bioinformatics. 2007 May 1;23(9):1061-7 [PMID: 17332020]
  24. Genome Biol. 2004;5(2):R12 [PMID: 14759262]
  25. BMC Genomics. 2013 Dec 13;14:877 [PMID: 24330649]
  26. BMC Bioinformatics. 2011 Dec 22;12:491 [PMID: 22192575]
  27. Nature. 2000 Dec 14;408(6814):796-815 [PMID: 11130711]
  28. Nature. 2013 Aug 15;500(7462):335-9 [PMID: 23883927]
  29. Bioinformatics. 2009 Jul 15;25(14):1754-60 [PMID: 19451168]
  30. Sci Rep. 2015 Feb 04;5:8232 [PMID: 25648560]
  31. Nature. 2013 Aug 15;500(7462):340-4 [PMID: 23883930]
  32. Theor Appl Genet. 2005 Feb;110(4):754-65 [PMID: 15723275]
  33. Genome Res. 2002 Jun;12(6):962-8 [PMID: 12045149]
  34. Nature. 2007 Sep 27;449(7161):463-7 [PMID: 17721507]

MeSH Term

Arecaceae
Genome, Plant
Linkage Disequilibrium
Polymorphism, Genetic

Word Cloud

Similar Articles

Cited By