PairMotifChIP

Introduction

Identifying conserved patterns in DNA sequences, namely, motif discovery, is an important and challenging computational task. With hundreds or more sequences contained, the high-throughput sequencing data set is helpful to improve the identification accuracy of motif discovery but requires an even higher computing performance. To efficiently identify motifs in large DNA data sets, a new algorithm called PairMotifChIP is proposed by extracting and combining pairs of l-mers in the input with relatively small Hamming distance. In particular, a method for rapidly extracting pairs of l-mers is designed, which can be used not only for PairMotifChIP, but also for other DNA data mining tasks with the same demand. Experimental results on the simulated data show that the proposed algorithm can find motifs successfully and runs faster than the state-of-the-art motif discovery algorithms. Furthermore, the validity of the proposed algorithm has been verified on real data.

Publications

PairMotifChIP: A Fast Algorithm for Discovery of Patterns Conserved in Large ChIP-seq Data Sets.
Cite this
Yu Q, Huo H, Feng D, 2016-01-01 - BioMed Research International

Credits

Qiang Yu
Developer
School of Computer Science and Technology, Xidian University, China
Hongwei Huo
Developer
School of Computer Science and Technology, Xidian University, China
Dazheng Feng
Investigator
School of Electronic Engineering, Xidian University, China

Community Ratings

Usability	Efficiency	Reliability	Rated By
			0 user
Sign in to rate

Summary

Accession	BT005269
Tool Type	Application
Category
Platforms	Linux/Unix
Technologies	C, C++
User Interface	Terminal Command Line
Download Count	0
Country/Region	China
Submitted By	Dazheng Feng

PairMotifChIP

Introduction

Publications

PairMotifChIP: A Fast Algorithm for Discovery of Patterns Conserved in Large ChIP-seq Data Sets. Cite this

Credits

Community Ratings

PairMotifChIP: A Fast Algorithm for Discovery of Patterns Conserved in Large ChIP-seq Data Sets.
Cite this