Genes mirror geography within Europe.

John Novembre, Toby Johnson, Katarzyna Bryc, Zoltán Kutalik, Adam R Boyko, Adam Auton, Amit Indap, Karen S King, Sven Bergmann, Matthew R Nelson, Matthew Stephens, Carlos D Bustamante
Author Information
  1. John Novembre: Department of Ecology and Evolutionary Biology, Interdepartmental Program in Bioinformatics, University of California-Los Angeles, Los Angeles, California 90095, USA. jnovembre@ucla.edu

Abstract

Understanding the genetic structure of human populations is of fundamental interest to medical, forensic and anthropological sciences. Advances in high-throughput genotyping technology have markedly improved our understanding of global patterns of human genetic variation and suggest the potential to use large samples to uncover variation among closely spaced populations. Here we characterize genetic variation in a sample of 3,000 European individuals genotyped at over half a million variable DNA sites in the human genome. Despite low average levels of genetic differentiation among Europeans, we find a close correspondence between genetic and geographic distances; indeed, a geographical map of Europe arises naturally as an efficient two-dimensional summary of genetic variation in Europeans. The results emphasize that when mapping the genetic basis of a disease phenotype, spurious associations can arise if genetic structure is not properly accounted for. In addition, the results are relevant to the prospects of genetic ancestry testing; an individual's DNA can be used to infer their geographic origin with surprising accuracy-often to within a few hundred kilometres.

References

  1. Science. 1978 Sep 1;201(4358):786-92 [PMID: 356262]
  2. Nat Genet. 2005 Aug;37(8):868-72 [PMID: 16041375]
  3. Am J Hum Genet. 2006 Jul;79(1):1-12 [PMID: 16773560]
  4. Nat Genet. 2008 May;40(5):584-91 [PMID: 18391950]
  5. Nat Genet. 2008 May;40(5):646-9 [PMID: 18425127]
  6. Nature. 2007 Jun 7;447(7145):661-78 [PMID: 17554300]
  7. Nat Genet. 2006 Aug;38(8):904-9 [PMID: 16862161]
  8. Genet Epidemiol. 2002 Aug;23(2):181-96 [PMID: 12214310]
  9. Genome Res. 2005 Nov;15(11):1496-502 [PMID: 16251459]
  10. Nat Genet. 2008 Feb;40(2):149-51 [PMID: 18193046]
  11. Nat Genet. 2008 May;40(5):575-83 [PMID: 18391952]
  12. Science. 2008 Feb 22;319(5866):1100-4 [PMID: 18292342]
  13. Evolution. 1985 Jan;39(1):53-65 [PMID: 28563643]
  14. PLoS Genet. 2008 May 23;4(5):e1000078 [PMID: 18497854]
  15. PLoS Genet. 2008 Jan;4(1):e236 [PMID: 18208327]
  16. Genet Epidemiol. 2000;19 Suppl 1:S29-35 [PMID: 11055367]
  17. Am J Hum Genet. 2008 Sep;83(3):347-58 [PMID: 18760391]
  18. PLoS Genet. 2006 Dec;2(12):e190 [PMID: 17194218]
  19. Am J Hum Genet. 2007 Sep;81(3):559-75 [PMID: 17701901]
  20. Nat Rev Genet. 2008 May;9(5):356-69 [PMID: 18398418]
  21. Evolution. 1984 Nov;38(6):1358-1370 [PMID: 28563791]
  22. Nature. 2008 Feb 21;451(7181):998-1003 [PMID: 18288195]
  23. BMC Cardiovasc Disord. 2008 Mar 17;8:6 [PMID: 18366642]
  24. Nat Rev Genet. 2004 Aug;5(8):611-8 [PMID: 15266343]
  25. PLoS Genet. 2008 Jan;4(1):e4 [PMID: 18208329]
  26. Am J Hum Genet. 2007 May;80(5):948-56 [PMID: 17436249]
  27. Genetics. 2003 Aug;164(4):1567-87 [PMID: 12930761]

Grants

  1. R01 GM083606/NIGMS NIH HHS
  2. R01 GM083606-01/NIGMS NIH HHS
  3. R01 GM083606-02/NIGMS NIH HHS

MeSH Term

Emigration and Immigration
Europe
Genetic Variation
Genetics, Population
Genome, Human
Genome-Wide Association Study
Genotype
Geography
Humans
Phylogeny
Polymorphism, Single Nucleotide
Principal Component Analysis
Quantitative Trait, Heritable
Sample Size

Word Cloud

Created with Highcharts 10.0.0geneticvariationhumanstructurepopulationsamongDNAEuropeansgeographicEuroperesultscanwithinUnderstandingfundamentalinterestmedicalforensicanthropologicalsciencesAdvanceshigh-throughputgenotypingtechnologymarkedlyimprovedunderstandingglobalpatternssuggestpotentialuselargesamplesuncovercloselyspacedcharacterizesample3000EuropeanindividualsgenotypedhalfmillionvariablesitesgenomeDespitelowaveragelevelsdifferentiationfindclosecorrespondencedistancesindeedgeographicalmaparisesnaturallyefficienttwo-dimensionalsummaryemphasizemappingbasisdiseasephenotypespuriousassociationsariseproperlyaccountedadditionrelevantprospectsancestrytestingindividual'susedinferoriginsurprisingaccuracy-oftenhundredkilometresGenesmirrorgeography

Similar Articles

Cited By