ParaPindel A Parallel Detection Framework for Structural Variation

Introduction

ParaPindel, an optimized version of serial Pindel that utilizes distributed multi-process and shared multi-threading, for efficient large-scale detection of structural variation for human whole-genome sequencing data. ParaPindel divides the chromosome into multiple small windows with a fixed-length window size, so as to realize the parallel detection between different windows and different chromosomes. A cross-window with a smaller length is introduced to cope with possible structural variations at the edge of the window. The experimental results show that ParaPindel shortens the time to detect an individual's genome-wide structural variation from 186 hours to 33 minutes under the premise that the detection results are basically consistent. Employing 256 processes on 128 nodes on the TH-1HN supercomputer, the speedup ratio has reached 163 times, and the parallel efficiency has reached 69.74%.

Publications

No Publication Information

Credits

  1. Yaning Yang yangyn@hnu.edu.cn
    InvestigatorDeveloper

    College of Computer Science and Electronic Engineering, Hunan University, China

  2. Shaoliang Peng slpeng@hnu.edu.cn
    InvestigatorDeveloper

    College of Computer Science and Electronic Engineering, Hunan University, China

Community Ratings

UsabilityEfficiencyReliabilityRated By
0 user
Sign in to rate
Summary
AccessionBT007209
Tool TypeFramework
CategoryStructural variant detection
PlatformsLinux/Unix
TechnologiesC++
User InterfaceTerminal Command Line
Input DataBAM, FASTA
Download Count0
Country/RegionChina
Submitted ByShaoliang Peng
Fundings

2018YFC0910400