SparseAssembler

Introduction

BACKGROUND: The very large memory requirements for the construction of assembly graphs for de novo genome assembly limit current algorithms to super-computing environments. METHODS: In this paper, we demonstrate that constructing a sparse assembly graph which stores only a small fraction of the observed k-mers as nodes and the links between these nodes allows the de novo assembly of even moderately-sized genomes (~500 M) on a typical laptop computer. RESULTS: We implement this sparse graph concept in a proof-of-principle software package, SparseAssembler, utilizing a new sparse k-mer graph structure evolved from the de Bruijn graph. We test our SparseAssembler with both simulated and real data, achieving ~90% memory savings and retaining high assembly accuracy, without sacrificing speed in comparison to existing de novo assemblers.

Publications

Exploiting sparseness in de novo genome assembly.
Cite this
Ye C, Ma ZS, Cannon CH, Pop M, Yu DW, 2012-01-01 - BMC bioinformatics

Credits

Chengxi Ye
Developer
Zhanshan Sam Ma
Developer
Charles H Cannon
Developer
Mihai Pop
Developer
Douglas W Yu
Investigator

Community Ratings

Usability	Efficiency	Reliability	Rated By
			0 user
Sign in to rate

Summary

Accession	BT000346
Tool Type	Application
Category
Platforms	Linux/Unix
Technologies
User Interface	Terminal Command Line
Download Count	0
Submitted By	Douglas W Yu

SparseAssembler

Introduction

Publications

Exploiting sparseness in de novo genome assembly. Cite this

Credits

Community Ratings

Exploiting sparseness in de novo genome assembly.
Cite this