Draft genome sequence of an elite Dura palm and whole-genome patterns of DNA variation in oil palm.

Jingjing Jin, May Lee, Bin Bai, Yanwei Sun, Jing Qu, Yuzer Alfiko, Chin Huat Lim, Antonius Suwanto, Maria Sugiharti, Limsoon Wong, Jian Ye, Nam-Hai Chua, Gen Hua Yue
Author Information
  1. Jingjing Jin: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  2. May Lee: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  3. Bin Bai: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  4. Yanwei Sun: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  5. Jing Qu: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore.
  6. Rahmadsyah: R & D Department, Wilmar International Plantation, Palembang, Indonesia.
  7. Yuzer Alfiko: Biotech Lab, Wilmar International, Jakarta, Indonesia.
  8. Chin Huat Lim: R & D Department, Wilmar International Plantation, Palembang, Indonesia.
  9. Antonius Suwanto: Biotech Lab, Wilmar International, Jakarta, Indonesia.
  10. Maria Sugiharti: Biotech Lab, Wilmar International, Jakarta, Indonesia.
  11. Limsoon Wong: School of Computing, National University of Singapore, Singapore.
  12. Jian Ye: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore jianye@im.ac.cn chua@mail.rockefeller.edu genhua@tll.org.sg.
  13. Nam-Hai Chua: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore jianye@im.ac.cn chua@mail.rockefeller.edu genhua@tll.org.sg.
  14. Gen Hua Yue: Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore jianye@im.ac.cn chua@mail.rockefeller.edu genhua@tll.org.sg.

Abstract

Oil palm is the world's leading source of vegetable oil and fat. Dura, Pisifera and Tenera are three forms of oil palm. The genome sequence of Pisifera is available whereas the Dura form has not been sequenced yet. We sequenced the genome of one elite Dura palm, and re-sequenced 17 palm genomes. The assemble genome sequence of the elite Dura tree contained 10,971 scaffolds and was 1.701 Gb in length, covering 94.49% of the oil palm genome. 36,105 genes were predicted. Re-sequencing of 17 additional palm trees identified 18.1 million SNPs. We found high genetic variation among palms from different geographical regions, but lower variation among Southeast Asian Dura and Pisifera palms. We mapped 10,000 SNPs on the linkage map of oil palm. In addition, high linkage disequilibrium (LD) was detected in the oil palms used in breeding populations of Southeast Asia, suggesting that LD mapping is likely to be practical in this important oil crop. Our data provide a valuable resource for accelerating genetic improvement and studying the mechanism underlying phenotypic variations of important oil palm traits.

Keywords

References

  1. Proc Natl Acad Sci U S A. 2011 Jul 26;108(30):12527-32 pubmed:21709233
  2. FEBS Lett. 2005 May 9;579(12):2709-14 pubmed:15862313
  3. Nat Biotechnol. 2011 Dec 11;30(1):105-11 pubmed:22158310
  4. Nat Genet. 2013 Apr;45(4):456-61, 461e1-2 pubmed:23435089
  5. Genetics. 2000 Jun;155(2):945-59 pubmed:10835412
  6. Nat Methods. 2012 Mar 04;9(4):357-9 pubmed:22388286
  7. Science. 2002 Apr 5;296(5565):92-100 pubmed:11935018
  8. Science. 2009 Nov 20;326(5956):1112-5 pubmed:19965430
  9. Nat Biotechnol. 2011 May 29;29(6):521-7 pubmed:21623354
  10. PLoS Biol. 2007 Jul;5(7):e171 pubmed:17579516
  11. Bioinformatics. 2009 Aug 15;25(16):2078-9 pubmed:19505943
  12. PLoS Genet. 2012;8(11):e1002967 pubmed:23166502
  13. Bioinformatics. 2012 Apr 15;28(8):1176-7 pubmed:22402612
  14. Nat Genet. 2010 Dec;42(12):1053-9 pubmed:21076406
  15. Nature. 2009 Jan 29;457(7229):551-6 pubmed:19189423
  16. Bioinformatics. 2012 Dec 15;28(24):3326-8 pubmed:23060615
  17. Bioinformatics. 2005 Jan 15;21(2):263-5 pubmed:15297300
  18. PLoS Biol. 2006 Mar;4(3):e72 pubmed:16494531
  19. Theor Appl Genet. 2004 May;108(7):1274-84 pubmed:14676949
  20. Nature. 2010 Jan 14;463(7278):178-83 pubmed:20075913
  21. Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5 pubmed:17130148
  22. Brief Bioinform. 2008 Jul;9(4):299-306 pubmed:18417537
  23. Bioinformatics. 2007 May 1;23(9):1061-7 pubmed:17332020
  24. Genome Biol. 2004;5(2):R12 pubmed:14759262
  25. BMC Genomics. 2013 Dec 13;14:877 pubmed:24330649
  26. BMC Bioinformatics. 2011 Dec 22;12:491 pubmed:22192575
  27. Nature. 2000 Dec 14;408(6814):796-815 pubmed:11130711
  28. Nature. 2013 Aug 15;500(7462):335-9 pubmed:23883927
  29. Bioinformatics. 2009 Jul 15;25(14):1754-60 pubmed:19451168
  30. Sci Rep. 2015 Feb 04;5:8232 pubmed:25648560
  31. Nature. 2013 Aug 15;500(7462):340-4 pubmed:23883930
  32. Theor Appl Genet. 2005 Feb;110(4):754-65 pubmed:15723275
  33. Genome Res. 2002 Jun;12(6):962-8 pubmed:12045149
  34. Nature. 2007 Sep 27;449(7161):463-7 pubmed:17721507

MeSH Term

Arecaceae
Genome, Plant
Linkage Disequilibrium
Polymorphism, Genetic

Word Cloud