Exclusion and Genomic Relatedness Methods for Assignment of Parentage Using Genotyping-by-Sequencing Data.

Ken G Dodds, John C McEwan, Rudiger Brauning, Tracey C van Stijn, Suzanne J Rowe, K Mary McEwan, Shannon M Clarke
Author Information
  1. Ken G Dodds: AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand ken.dodds@agresearch.co.nz. ORCID
  2. John C McEwan: AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand. ORCID
  3. Rudiger Brauning: AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand. ORCID
  4. Tracey C van Stijn: AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand. ORCID
  5. Suzanne J Rowe: AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand. ORCID
  6. K Mary McEwan: AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand.
  7. Shannon M Clarke: AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel 9053, New Zealand. ORCID

Abstract

Genotypes are often used to assign parentage in agricultural and ecological settings. Sequencing can be used to obtain genotypes but does not provide unambiguous genotype calls, especially when sequencing depth is low in order to reduce costs. In that case, standard parentage analysis methods no longer apply. A strategy for using low-depth sequencing data for parentage assignment is developed here. It entails the use of relatedness estimates along with a metric termed excess mismatch rate which, for parent-offspring pairs or trios, is the difference between the observed mismatch rate and the rate expected under a model of inheritance and allele reads without error. When more than one putative parent has similar statistics, bootstrapping can provide a measure of the relatedness similarity. Putative parent-offspring trios can be further checked for consistency by comparing the offspring's estimated inbreeding to half the parent relatedness. Suitable thresholds are required for each metric. These methods were applied to a deer breeding operation consisting of two herds of different breeds. Relatedness estimates were more in line with expectation when the herds were analyzed separately than when combined, although this did not alter which parents were the best matches with each offspring. Parentage results were largely consistent with those based on a microsatellite parentage panel with three discordant parent assignments out of 1561. Two models are investigated to allow the parentage metrics to be calculated with non-random selection of alleles. The tools and strategies given here allow parentage to be assigned from low-depth sequencing data.

Keywords

References

  1. Genetics. 2018 Jun;209(2):389-400 [PMID: 29588288]
  2. Nucleic Acids Res. 2017 Dec 1;45(21):e178 [PMID: 29036322]
  3. Gigascience. 2015 Feb 25;4:7 [PMID: 25722852]
  4. PLoS One. 2011 May 04;6(5):e19379 [PMID: 21573248]
  5. Gigascience. 2019 May 1;8(5): [PMID: 31042285]
  6. Anim Genet. 2019 Jun;50(3):307-310 [PMID: 30957265]
  7. Genetics. 2018 May;209(1):65-76 [PMID: 29487138]
  8. J Dairy Sci. 2010 Feb;93(2):743-52 [PMID: 20105546]
  9. BMC Genomics. 2015 Dec 09;16:1047 [PMID: 26654230]
  10. Nat Rev Genet. 2010 Nov;11(11):800-5 [PMID: 20877324]
  11. J Anim Sci. 2019 Jan 1;97(1):35-42 [PMID: 30329120]
  12. Mol Ecol Resour. 2018 Feb 17;:null [PMID: 29455472]
  13. Genetics. 2017 May;206(1):105-118 [PMID: 28341647]
  14. Nat Rev Genet. 2016 Feb;17(2):81-92 [PMID: 26729255]
  15. Genetics. 2017 Aug;206(4):2085-2103 [PMID: 28550018]
  16. Ecol Evol. 2016 Jul 29;6(17):6107-20 [PMID: 27648229]
  17. Plant Sci. 2016 Jan;242:14-22 [PMID: 26566821]
  18. Mol Phylogenet Evol. 2004 Dec;33(3):880-95 [PMID: 15522810]
  19. PLoS One. 2008;3(10):e3376 [PMID: 18852878]
  20. BMC Bioinformatics. 2014 Nov 25;15:356 [PMID: 25420514]
  21. Nat Genet. 2010 Jan;42(1):30-5 [PMID: 19915526]
  22. J Anim Breed Genet. 2019 Mar;136(2):102-112 [PMID: 30548685]
  23. Genet Sel Evol. 2018 May 18;50(1):26 [PMID: 29776335]
  24. Mol Ecol. 1998 May;7(5):639-55 [PMID: 9633105]

MeSH Term

Algorithms
Alleles
Breeding
Databases, Genetic
Family
Gene Frequency
Genomics
Genotype
Genotyping Techniques
Microsatellite Repeats
Models, Genetic
Pedigree
Sequence Analysis, DNA

Word Cloud

Created with Highcharts 10.0.0parentageratecansequencingrelatednessmismatchparentusedprovidemethodslow-depthdataestimatesmetricparent-offspringtriosherdsRelatednessParentageallowGenotypesoftenassignagriculturalecologicalsettingsSequencingobtaingenotypesunambiguousgenotypecallsespeciallydepthloworderreducecostscasestandardanalysislongerapplystrategyusingassignmentdevelopedentailsusealongtermedexcesspairsdifferenceobservedexpectedmodelinheritanceallelereadswithouterroroneputativesimilarstatisticsbootstrappingmeasuresimilarityPutativecheckedconsistencycomparingoffspring'sestimatedinbreedinghalfSuitablethresholdsrequiredapplieddeerbreedingoperationconsistingtwodifferentbreedslineexpectationanalyzedseparatelycombinedalthoughalterparentsbestmatchesoffspringresultslargelyconsistentbasedmicrosatellitepanelthreediscordantassignments1561Twomodelsinvestigatedmetricscalculatednon-randomselectionallelestoolsstrategiesgivenassignedExclusionGenomicMethodsAssignmentUsingGenotyping-by-SequencingDataexclusiongenomicrelationshipmatrixgenotyping-by-sequencing

Similar Articles

Cited By (7)