Pan-Genome of Wild and Cultivated Soybeans.

Yucheng Liu, Huilong Du, Pengcheng Li, Yanting Shen, Hua Peng, Shulin Liu, Guo-An Zhou, Haikuan Zhang, Zhi Liu, Miao Shi, Xuehui Huang, Yan Li, Min Zhang, Zheng Wang, Baoge Zhu, Bin Han, Chengzhi Liang, Zhixi Tian
Author Information
  1. Yucheng Liu: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
  2. Huilong Du: State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
  3. Pengcheng Li: Berry Genomics Corporation, Beijing 100015, China.
  4. Yanting Shen: School of Pharmaceutical Sciences, Guangzhou University of Chinese Medicine, Guangzhou 510006, China.
  5. Hua Peng: State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
  6. Shulin Liu: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China.
  7. Guo-An Zhou: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China.
  8. Haikuan Zhang: Berry Genomics Corporation, Beijing 100015, China.
  9. Zhi Liu: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
  10. Miao Shi: Berry Genomics Corporation, Beijing 100015, China.
  11. Xuehui Huang: College of Life Sciences, Shanghai Normal University, Shanghai 200234, China.
  12. Yan Li: National Center for Gene Research, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China.
  13. Min Zhang: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China.
  14. Zheng Wang: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China.
  15. Baoge Zhu: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China.
  16. Bin Han: National Center for Gene Research, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China.
  17. Chengzhi Liang: State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China. Electronic address: cliang@genetics.ac.cn.
  18. Zhixi Tian: State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China. Electronic address: zxtian@genetics.ac.cn.

Abstract

Soybean is one of the most important vegetable oil and protein feed crops. To capture the entire genomic diversity, it is needed to construct a complete high-quality pan-genome from diverse soybean accessions. In this study, we performed individual de novo genome assemblies for 26 representative soybeans that were selected from 2,898 deeply sequenced accessions. Using these assembled genomes together with three previously reported genomes, we constructed a graph-based genome and performed pan-genome analysis, which identified numerous genetic variations that cannot be detected by direct mapping of short sequence reads onto a single reference genome. The structural variations from the 2,898 accessions that were genotyped based on the graph-based genome and the RNA sequencing (RNA-seq) data from the representative 26 accessions helped to link genetic variations to candidate genes that are responsible for important traits. This pan-genome resource will promote evolutionary and functional genomics studies in soybean.

Keywords

MeSH Term

Base Sequence
Chromosomes, Plant
Domestication
Ecotype
Gene Duplication
Gene Expression Regulation, Plant
Gene Fusion
Genome, Plant
Geography
Molecular Sequence Annotation
Phylogeny
Polymorphism, Single Nucleotide
Polyploidy
Glycine max