Introduction

We consider the correction of errors from nucleotide sequences produced by next-generation targeted amplicon sequencing. The next-generation sequencing (NGS) platforms can provide a great deal of sequencing data thanks to their high throughput, but the associated error rates often tend to be high. Denoising in high-throughput sequencing has thus become a crucial process for boosting the reliability of downstream analyses. Our methodology, named DUDE-Seq, is derived from a general setting of reconstructing finite-valued source data corrupted by a discrete memoryless channel and effectively corrects substitution and homopolymer indel errors, the two major types of sequencing errors in most high-throughput targeted amplicon sequencing platforms. Our experimental studies with real and simulated datasets suggest that the proposed DUDE-Seq not only outperforms existing alternatives in terms of error-correction capability and time efficiency, but also boosts the reliability of downstream analyses. Further, the flexibility of DUDE-Seq enables its robust application to different sequencing platforms and analysis pipelines by simple updates of the noise model. DUDE-Seq is available at http://data.snu.ac.kr/pub/dude-seq.

Publications

  1. DUDE-Seq: Fast, flexible, and robust denoising for targeted amplicon sequencing.
    Cite this
    Lee B, Moon T, Yoon S, Weissman T, 2017-01-01 - PloS one

Credits

  1. Byunghan Lee
    Developer

    Electrical and Computer Engineering, Seoul National University

  2. Taesup Moon
    Developer

    College of Information and Communication Engineering, Sungkyunkwan University

  3. Sungroh Yoon
    Developer

    Neurology and Neurological Sciences, Stanford University, United States of America

  4. Tsachy Weissman
    Investigator

    Electrical Engineering, Stanford University, United States of America

Community Ratings

UsabilityEfficiencyReliabilityRated By
0 user
Sign in to rate
Summary
AccessionBT000032
Tool TypeApplication
Category
PlatformsWindows
TechnologiesC, C++
User InterfaceTerminal Command Line
Download Count0
Country/RegionUnited States of America
Submitted ByTsachy Weissman