MCAT: Motif Combining and Association Tool.

Yanshen Yang, Jeffrey A Robertson, Zhen Guo, Jake Martinez, Christy Coghlan, Lenwood S Heath
Author Information
  1. Yanshen Yang: Department of Computer Science, Virginia Tech, Blacksburg, Virginia.
  2. Jeffrey A Robertson: Department of Computer Science, Virginia Tech, Blacksburg, Virginia.
  3. Zhen Guo: Department of Computer Science, Virginia Tech, Blacksburg, Virginia.
  4. Jake Martinez: Department of Computer Science, Virginia Tech, Blacksburg, Virginia.
  5. Christy Coghlan: Department of Computer Science, Virginia Tech, Blacksburg, Virginia.
  6. Lenwood S Heath: Department of Computer Science, Virginia Tech, Blacksburg, Virginia.

Abstract

De novo motif discovery in biological sequences is an important and computationally challenging problem. A myriad of algorithms have been developed to solve this problem with varying success, but it can be difficult for even a small number of these tools to reach a consensus. Because individual tools can be better suited for specific scenarios, an ensemble tool that combines the results of many algorithms can yield a more confident and complete result. We present a novel and fast tool ensemble MCAT (Motif Combining and Association Tool) for de novo motif discovery by combining six state-of-the-art motif discovery tools (MEME, BioProspector, DECOD, XXmotif, Weeder, and CMF). We apply MCAT to data sets with DNA sequences that come from various species and compare our results with two well-established ensemble motif-finding tools, EMD and DynaMIT. The experimental results show that MCAT is able to identify exact match motifs in DNA sequences efficiently, and it has a significantly better performance in practice.

Keywords

MeSH Term

Algorithms
Animals
Computational Biology
Humans
Sequence Analysis, DNA
Software

Word Cloud

Created with Highcharts 10.0.0motiftoolsensemblediscoverysequencescanresultsMCATnovoproblemalgorithmsbettertoolMotifCombiningAssociationToolDNADebiologicalimportantcomputationallychallengingmyriaddevelopedsolvevaryingsuccessdifficultevensmallnumberreachconsensusindividualsuitedspecificscenarioscombinesmanyyieldconfidentcompleteresultpresentnovelfastdecombiningsixstate-of-the-artMEMEBioProspectorDECODXXmotifWeederCMFapplydatasetscomevariousspeciescomparetwowell-establishedmotif-findingEMDDynaMITexperimentalshowableidentifyexactmatchmotifsefficientlysignificantlyperformancepracticeMCAT:algorithmfindingprotein-bindingsite

Similar Articles

Cited By