StriDe

Introduction

String and de Bruijn graphs are two graph models used by most genome assemblers. At present, none of the existing assemblers clearly outperforms the others across all datasets. We found that although a string graph can make use of entire reads for resolving repeats, de Bruijn graphs can naturally assemble through regions that are error-prone due to sequencing bias.We developed a novel assembler called StriDe that has advantages of both string and de Bruijn graphs. First, the reads are decomposed adaptively only in error-prone regions. Second, each paired-end read is extended into a long read directly using an FM-index. The decomposed and extended reads are used to build an assembly graph. In addition, several essential components of an assembler were designed or improved. The resulting assembler was fully parallelized, tested and compared with state-of-the-art assemblers using benchmark datasets. The results indicate that contiguity of StriDe is comparable with top assemblers on both short-read and long-read datasets, and the assembly accuracy is high in comparison with the others.https://github.com/ythuang0522/StriDe: ythuang@cs.ccu.edu.twSupplementary data are available at Bioinformatics online.

Publications

Integration of string and de Bruijn graphs for genome assembly.
Cite this
Huang YT, Liao CF, 2016-05-01 - Bioinformatics (Oxford, England)

Credits

Yao-Ting Huang
Developer
Department of Computer Science and Information Engineering, National Chung Cheng University, Taiwan, Province of China
Chen-Fu Liao
Investigator
Department of Computer Science and Information Engineering, National Chung Cheng University, Taiwan, Province of China

Community Ratings

Usability	Efficiency	Reliability	Rated By
			0 user
Sign in to rate

Summary

Accession	BT006405
Tool Type	Application
Category
Platforms	Linux/Unix
Technologies
User Interface	Terminal Command Line
Download Count	0
Country/Region	Taiwan, Province of China
Submitted By	Chen-Fu Liao

StriDe

Introduction

Publications

Integration of string and de Bruijn graphs for genome assembly. Cite this

Credits

Community Ratings

Integration of string and de Bruijn graphs for genome assembly.
Cite this