Two divergent haplotypes from a highly heterozygous lychee genome suggest independent domestication events for early and late-maturing cultivars.
Guibing Hu, Junting Feng, Xu Xiang, Jiabao Wang, Jarkko Salojärvi, Chengming Liu, Zhenxian Wu, Jisen Zhang, Xinming Liang, Zide Jiang, Wei Liu, Liangxi Ou, Jiawei Li, Guangyi Fan, Yingxiao Mai, Chengjie Chen, Xingtan Zhang, Jiakun Zheng, Yanqing Zhang, Hongxiang Peng, Lixian Yao, Ching Man Wai, Xinping Luo, Jiaxin Fu, Haibao Tang, Tianying Lan, Biao Lai, Jinhua Sun, Yongzan Wei, Huanling Li, Jiezhen Chen, Xuming Huang, Qian Yan, Xin Liu, Leah K McHale, William Rolling, Romain Guyot, David Sankoff, Chunfang Zheng, Victor A Albert, Ray Ming, Houbin Chen, Rui Xia, Jianguo Li
Author Information
Guibing Hu: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Junting Feng: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Xu Xiang: Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China.
Jiabao Wang: Danzhou Scientific Observing and Experimental Station of Agro-Environment, Ministry of Agriculture and Rural Affairs, Environment and Plant Protection Institute, Chinese Academy of Tropical Agriculture Sciences, Haikou, China.
Jarkko Salojärvi: School of Biological Sciences, Nanyang Technological University, Singapore, Singapore. ORCID
Chengming Liu: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Zhenxian Wu: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China. ORCID
Jisen Zhang: Center for Genomics and Biotechnology, Haixia Institute of Science and Technology Fujian Agriculture and Forestry University, Fuzhou, China. ORCID
Zide Jiang: Guangdong Key Laboratory of Microbial Signals and Disease Control, College of Plant Protection, South China Agricultural University, Guangzhou, China.
Wei Liu: Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China.
Liangxi Ou: Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China.
Jiawei Li: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Yingxiao Mai: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Chengjie Chen: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Xingtan Zhang: Center for Genomics and Biotechnology, Haixia Institute of Science and Technology Fujian Agriculture and Forestry University, Fuzhou, China.
Jiakun Zheng: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Yanqing Zhang: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Hongxiang Peng: Horticultural Research Institute, Guangxi Academy of Agricultural Sciences, Nanning, China.
Lixian Yao: College of Natural Resources and Environment, South China Agricultural University, Guangzhou, China.
Ching Man Wai: Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
Xinping Luo: Institute of Tropical and Subtropical Cash Crops, Yunnan Academy of Agricultural Sciences, Baoshan, China.
Jiaxin Fu: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Haibao Tang: Center for Genomics and Biotechnology, Haixia Institute of Science and Technology Fujian Agriculture and Forestry University, Fuzhou, China. ORCID
Tianying Lan: Department of Biological Sciences, University at Buffalo, Buffalo, NY, USA.
Biao Lai: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Jinhua Sun: Danzhou Scientific Observing and Experimental Station of Agro-Environment, Ministry of Agriculture and Rural Affairs, Environment and Plant Protection Institute, Chinese Academy of Tropical Agriculture Sciences, Haikou, China.
Yongzan Wei: Key Laboratory for Tropical Fruit Biology of Ministry of Agriculture and Rural Affair, South Subtropical Crops Research Institute, Chinese Academy of Tropical Agriculture Sciences, Zhanjiang, China. ORCID
Huanling Li: Danzhou Scientific Observing and Experimental Station of Agro-Environment, Ministry of Agriculture and Rural Affairs, Environment and Plant Protection Institute, Chinese Academy of Tropical Agriculture Sciences, Haikou, China.
Jiezhen Chen: Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China.
Xuming Huang: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China.
Qian Yan: Key Laboratory of South Subtropical Fruit Biology and Genetic Resource Utilization, Institute of Fruit Tree Research, Guangdong Academy of Agricultural Sciences, Ministry of Agriculture and Rural Affairs, Guangdong Provincial Key Laboratory of Tropical and Subtropical Fruit Tree Research, Guangzhou, China.
Leah K McHale: Department of Horticulture and Crop Sciences and Center for Applied Plant Sciences, The Ohio State University, Columbus, OH, USA. ORCID
William Rolling: Center for Applied Plant Sciences, The Ohio State University, Columbus, OH, USA.
Romain Guyot: IRD, UMR DIADE, EVODYN, Montpellier, France.
David Sankoff: Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada. ORCID
Chunfang Zheng: Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada.
Victor A Albert: School of Biological Sciences, Nanyang Technological University, Singapore, Singapore. vaalbert@buffalo.edu. ORCID
Ray Ming: Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA. rayming@illinois.edu. ORCID
Houbin Chen: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China. hbchen@scau.edu.cn. ORCID
Rui Xia: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China. rxia@scau.edu.cn. ORCID
Jianguo Li: State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, Key Laboratory of Biology and Germplasm Enhancement of Horticultural Crops, Ministry of Agriculture and Rural Affairs, Guangdong Litchi Engineering Research Center, College of Horticulture, South China Agricultural University, Guangzhou, China. jianli@scau.edu.cn. ORCID
Lychee is an exotic tropical fruit with a distinct flavor. The genome of cultivar 'Feizixiao' was assembled into 15 pseudochromosomes, totaling ~470 Mb. High heterozygosity (2.27%) resulted in two complete haplotypic assemblies. A total of 13,517 allelic genes (42.4%) were differentially expressed in diverse tissues. Analyses of 72 resequenced lychee accessions revealed two independent domestication events. The extremely early maturing cultivars preferentially aligned to one haplotype were domesticated from a wild population in Yunnan, whereas the late-maturing cultivars that mapped mostly to the second haplotype were domesticated independently from a wild population in Hainan. Early maturing cultivars were probably developed in Guangdong via hybridization between extremely early maturing cultivar and late-maturing cultivar individuals. Variable deletions of a 3.7 kb region encompassed by a pair of CONSTANS-like genes probably regulate fruit maturation differences among lychee cultivars. These genomic resources provide insights into the natural history of lychee domestication and will accelerate the improvement of lychee and related crops.
References
Li, C. et al. De novo assembly and characterization of fruit transcriptome in Litchi chinensis Sonn and analysis of differentially regulated genes in fruit in response to shading. BMC Genomics 14, 552 (2013).
[PMID: 23941440]
Liu, C. & Mei, M. Classification of lychee cultivars with RAPD analysis. Acta Hortic. 665, 149–160 (2005).
[DOI: 10.17660/ActaHortic.2005.665.17]
Liu, W. et al. Identifying Litchi (Litchi chinensis Sonn.) cultivars and their genetic relationships using single nucleotide polymorphism (SNP) markers. PLoS ONE 10, e0135390 (2015).
[PMID: 26261993]
VanBuren, R. et al. Longli is not a hybrid of longan and lychee as revealed by genome size analysis and trichome morphology. Trop. Plant Biol. 4, 228–236 (2011).
[DOI: 10.1007/s12042-011-9084-3]
Huang, S., Kang, M. & Xu, A. HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly. Bioinformatics 33, 2577–2579 (2017).
[PMID: 28407147]
Jaillon, O. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007).
[PMID: 17721507]
Lam, H.-M. et al. Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nat. Genet. 42, 1053 (2010).
[PMID: 21076406]
Cao, K. et al. Comparative population genomics reveals the domestication history of the peach, Prunus persica, and human influences on perennial fruit crops. Genome Biol. 15, 415 (2014).
[PMID: 25079967]
Alexander, D. H. & Lange, K. Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinf. 12, 246 (2011).
[DOI: 10.1186/1471-2105-12-246]
Patterson, N. et al. Ancient admixture in human history. Genetics 192, 1065–1093 (2012).
[PMID: 22960212]
Julca, I. et al. Genomic evidence for recurrent genetic admixture during the domestication of Mediterranean olive trees (Olea europaea L.). BMC Biol. 18, 148 (2020).
[PMID: 33100219]
Edge, P., Bafna, V. & Bansal, V. HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies. Genome Res. 27, 801–812 (2017).
[PMID: 27940952]
Combes, M.-C., Dereeper, A., Severac, D., Bertrand, B. & Lashermes, P. Contribution of subgenomes to the transcriptome and their intertwined regulation in the allopolyploid Coffea arabica grown at contrasted temperatures. N. Phytol. 200, 251–260 (2013).
[DOI: 10.1111/nph.12371]
Payne, J. L. & Wagner, A. The causes of evolvability and their evolution. Nat. Rev. Genet. 20, 24–38 (2019).
[PMID: 30385867]
Lee, J. H. et al. Role of SVP in the control of flowering time by ambient temperature in Arabidopsis. Genes Dev. 21, 397–402 (2007).
[PMID: 17322399]
Swaminathan, K., Peterson, K. & Jack, T. The plant B3 superfamily. Trends Plant Sci. 13, 647–655 (2008).
[PMID: 18986826]
Levy, Y. Y., Mesnage, S., Mylne, J. S., Gendall, A. R. & Dean, C. Multiple roles of Arabidopsis VRN1 in vernalization and flowering time control. Science 297, 243–246 (2002).
[PMID: 12114624]
Suárez-López, P. et al. CONSTANS mediates between the circadian clock and the control of flowering in Arabidopsis. Nature 410, 1116–1120 (2001).
[PMID: 11323677]
Lin, W. L. Exploring on the source of Pearl River. Front. Lit. 3, 51–52 (2008).
Qian, S. Volcanic activity and magma evolution in the north of the Hainan Island. PhD Thesis, Institute of Geology. China Earthquake Administration. (2003).
Chen, L., Zhang, Y. F., Li, T. J., Yang, W. F. & Chen, J. Sedimentary environment and its evolution of Qiongzhou Strait and nearby seas since last ten thousand years. Earth Sci. J. China Univ. Geosci. 39, 696–704 (2014).
Fan, Q. C., Sun, Q. & Sui, J. L. Periods of volcanic activity and magma evolution of Holocene in North Hainan Island. Acta Petrol. Sin. 20, 533–544 (2004).
Nordborg, M. & Donnelly, P. The coalescent process with selfing. Genetics 146, 1185–1195 (1997).
[PMID: 9215919]
Bäurle, I. & Dean, C. The timing of developmental transitions in plants. Cell 125, 655–664 (2006).
[PMID: 16713560]
Andrés, F. & Coupland, G. The genetic basis of flowering responses to seasonal cues. Nat. Rev. Genet. 13, 627–639 (2012).
[PMID: 22898651]
Li, H.-T. et al. Origin of angiosperms and the puzzle of the Jurassic gap. Nat. Plants 5, 461–470 (2019).
[PMID: 31061536]
Zhang, L. et al. The water lily genome and the early evolution of flowering plants. Nature 577, 79–84 (2020).
[PMID: 31853069]
Terhorst, J., Kamm, J. A. & Song, Y. S. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat. Genet. 49, 303–309 (2017).
[PMID: 28024154]
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010).
[PMID: 20926424]
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
[PMID: 24695404]
Xie, T. et al. De novo plant genome assembly based on chromatin interactions: a case study of Arabidopsis thaliana. Mol. Plant 8, 489–492 (2015).
[PMID: 25667002]
Salmela, L. & Rivals, E. LoRDEC: accurate and efficient long read error correction. Bioinformatics 30, 3506–3514 (2014).
[PMID: 25165095]
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
[PMID: 28298431]
Huang, S., Kang, M. & Xu, A. HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly. Bioinformatics 33, 2577–2579 (2017).
[PMID: 28407147]
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
[PMID: 27467249]
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
[PMID: 28336562]
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
[PMID: 19451168]
Wolff, J. et al. Galaxy HiCExplorer: a web server for reproducible Hi-C data analysis, quality control and visualization. Nucleic Acids Res. 46, W11–W16 (2018).
[PMID: 29901812]
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
[PMID: 19505943]
Poplin, R. et al. Scaling accurate genetic variant discovery to tens of thousands of samples. Preprint at bioRxiv https://doi.org/10.1101/201178 (2018).
Edge, P., Bafna, V. & Bansal, V. HapCUT2: robust and accurate haplotype assembly for diverse sequencing technologies. Genome Res. 27, 801–812 (2017).
[PMID: 27940952]
Marks, P. et al. Resolving the full spectrum of human genome variation using linked-reads. Genome Res. 29, 635–645 (2019).
[PMID: 30894395]
Zhang, X. et al. Genomes of the banyan tree and pollinator wasp provide insights into fig–wasp coevolution. Cell 183, 875–889.e17 (2020).
[PMID: 33035453]
Alonge, M. et al. RaGOO: fast and accurate reference-guided scaffolding of draft genomes. Genome Biol. 20, 224 (2019).
[PMID: 31661016]
Holt, C. & Yandell, M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinf. 12, 491 (2011).
[DOI: 10.1186/1471-2105-12-491]
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435–W439 (2006).
[PMID: 16845043]
Korf, I. Gene finding in novel genomes. BMC Bioinf. 5, 59 (2004).
[DOI: 10.1186/1471-2105-5-59]
Bairoch, A. & Apweiler, R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28, 45–48 (2000).
[PMID: 10592178]
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
[PMID: 21572440]
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 9, R7–R7 (2008).
[PMID: 18190707]
Wu, T. D. & Watanabe, C. K. GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21, 1859–1875 (2005).
[PMID: 15728110]
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
[PMID: 26059717]
Xu, Z. & Wang, H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res. 35, W265–W268 (2007).
[PMID: 17485477]
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinf. 9, 18 (2008).
[DOI: 10.1186/1471-2105-9-18]
Ou, S. & Jiang, N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2018).
[PMID: 29233850]
Ou, S., Chen, J. & Jiang, N. Assessing genome assembly quality using the LTR assembly index (LAI). Nucleic Acids Res. 46, e126–e126 (2018).
[PMID: 30107434]
Lin, Y. et al. Genome-wide sequencing of longan (Dimocarpus longan Lour.) provides insights into molecular basis of its polyphenol-rich characteristics. Gigascience 6, 1–14 (2017).
[PMID: 29099922]
Bi, Q. et al. Pseudomolecule-level assembly of the Chinese oil tree yellowhorn (Xanthoceras sorbifolium) genome. Gigascience 8, giz070 (2019).
Xu, Q. et al. The draft genome of sweet orange (Citrus sinensis). Nat. Genet. 45, 59–66 (2013).
[PMID: 23179022]
Initiative, T. A. G. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796–815 (2000).
[DOI: 10.1038/35048692]
Ming, R. et al. The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature 452, 991–996 (2008).
[PMID: 18432245]
Edger, P. P. Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity. Gigascience 7, gix124 (2017).
Velasco, R. et al. The genome of the domesticated apple (Malus × domestica Borkh.). Nat. Genet. 42, 833–839 (2010).
[PMID: 20802477]
Li, Q. A chromosome-scale genome assembly of cucumber (Cucumis sativus L.). Gigascience 8, giz072 (2019).
Tang, H. et al. An improved genome release (version Mt4.0) for the model legume Medicago truncatula. BMC Genomics 15, 312 (2014).
Hosmani, P. S. et al. An improved de novo assembly and annotation of the tomato reference genome using single-molecule sequencing, Hi-C proximity ligation and optical maps. Preprint at bioRxiv https://doi.org/10.1101/767764 (2019).
Yang, J. et al. De novo genome assembly of the endangered Acer yangbiense, a plant species with extremely small populations endemic to Yunnan Province, China. Gigascience 8, giz085 (2019).
Chen, C. et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13, 1194–1202 (2020).
[PMID: 32585190]
Emms, D.M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019)
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
[PMID: 17483113]
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49–e49 (2012).
[PMID: 22217600]
Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (Springer-Verlag, 2016).
Chen, C. et al. sRNAanno—a database repository of uniformly annotated small RNAs in plants. Hortic. Res. 8, 45 (2021)
Tang, H., Krishnakumar, V. & Li, J. jcvi: JCVI utility libraries https://doi.org/10.5281/zenodo.31631 (2015).
Andrews, S. FastQC: a quality control tool for high throughput sequence data. (Barbraham Bioinformatics, 2010).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
[PMID: 22388286]
Picard toolkit (Broad Institute, GitHub repository, 2019).
Narasimhan, V. et al. BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data. Bioinformatics 32, 1749–1751 (2016).
[PMID: 26826718]
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
[PMID: 25722852]
Lee, T.-H., Guo, H., Wang, X., Kim, C. & Paterson, A. H. SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data. BMC Genomics 15, 162 (2014).
[PMID: 24571581]
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
[PMID: 24451623]
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: analysis of next generation sequencing data. BMC Bioinf. 15, 356 (2014).
[DOI: 10.1186/s12859-014-0356-4]
Salojärvi, J. et al. Genome sequencing and population genomic analyses provide insights into the adaptive landscape of silver birch. Nat. Genet. 49, 904–912 (2017).
[PMID: 28481341]
R Core Team. R: A language and environment for statistical computing. (R Foundation for Statistical Computing, 2017).
Liu, X. & Fu, Y.-X. Stairway Plot 2: demographic history inference with folded SNP frequency spectra. Genome Biol. 21, 280 (2020).
[PMID: 33203475]
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
[PMID: 21753753]
Excoffier, L., Dupanloup, I., Huerta-Sánchez, E., Sousa, V. C. & Foll, M. Robust demographic inference from genomic and SNP data. PLoS Genet. 9, e1003905 (2013).
[PMID: 24204310]
Zhang, C., Dong, S.-S., Xu, J.-Y., He, W.-M. & Yang, T.-L. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 35, 1786–1788 (2018).
[DOI: 10.1093/bioinformatics/bty875]
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
[PMID: 21653522]
Weir, B. S. & Cockerham, C. C. Estimating F-statistics for the analysis of population structure. Evolution 38, 1358–1370 (1984).
[PMID: 28563791]
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly (Austin) 6, 80–92 (2012).
[DOI: 10.4161/fly.19695]
Chen, J., Glémin, S. & Lascoux, M. Genetic diversity and the efficacy of purifying selection across plant and animal species. Mol. Biol. Evol. 34, 1417–1428 (2017).
[PMID: 28333215]
Salojärvi, J. jsalojar/PiNSiR: first release of PiNSiR https://doi.org/10.5281/zenodo.5136527 (2021).
Bradbury, P. J. et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635 (2007).
[PMID: 17586829]
Rabah, S. et al. Plastome sequencing of ten nonmodel crop species uncovers a large insertion of mitochondrial DNA in cashew. Plant Genome 10 https://doi.org/10.3835/plantgenome2017.03.0020 (2017).
Chevreux, B. et al. Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs. Genome Res. 14, 1147–1159 (2004).
[PMID: 15140833]
Hahn, C., Bachmann, L. & Chevreux, B. Reconstructing mitochondrial genomes directly from genomic next-generation sequencing reads—a baiting and iterative mapping approach. Nucleic Acids Res. 41, e129–e129 (2013).
[PMID: 23661685]
Katoh, K., Misawa, K., Kuma, K. & Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30, 3059–3066 (2002).
[PMID: 12136088]
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
[PMID: 19505945]
Chernomor, O., von Haeseler, A. & Minh, B. Q. Terrace aware data structure for phylogenomic inference from supermatrices. Syst. Biol. 65, 997–1008 (2016).
[PMID: 27121966]