Introduction

Inference of population structure and individual ancestry is important both for population genetics and for association studies. With next generation sequencing technologies it is possible to obtain genetic data for all accessible genetic variations in the genome. Existing methods for admixture analysis rely on known genotypes. However, individual genotypes cannot be inferred from low-depth sequencing data without introducing errors. This article presents a new method for inferring an individual's ancestry that takes the uncertainty introduced in next generation sequencing data into account. This is achieved by working directly with genotype likelihoods that contain all relevant information of the unobserved genotypes. Using simulations as well as publicly available sequencing data, we demonstrate that the presented method has great accuracy even for very low-depth data. At the same time, we demonstrate that applying existing methods to genotypes called from the same data can introduce severe biases. The presented method is implemented in the NGSadmix software available at http://www.popgen.dk/software.

Publications

  1. Estimating individual admixture proportions from next generation sequencing data.
    Cite this
    Skotte L, Korneliussen TS, Albrechtsen A, 2013-11-01 - Genetics

Credits

  1. Line Skotte
    Developer

    The Bioinformatics Centre, Department of Biology

  2. Thorfinn Sand Korneliussen
    Developer

  3. Anders Albrechtsen
    Investigator

Community Ratings

UsabilityEfficiencyReliabilityRated By
0 user
Sign in to rate
Summary
AccessionBT005127
Tool TypeApplication
Category
PlatformsLinux/Unix
TechnologiesC, C++
User InterfaceTerminal Command Line
Download Count0
Submitted ByAnders Albrechtsen