Introduction

Many forms of variations exist in the human genome including single nucleotide polymorphism, small insert/deletion (DEL) (indel) and structural variation (SV). Somatically acquired SV may regulate the expression of tumor-related genes and result in cell proliferation and uncontrolled growth, eventually inducing tumor formation. Virus integration with host genome sequence is a type of SV that causes the related gene instability and normal cells to transform into tumor cells. Cancer SVs and viral integration sites must be discovered in a genome-wide scale for clarifying the mechanism of tumor occurrence and development.In this paper, we propose a new tool called seeksv to detect somatic SVs and viral integration events. Seeksv simultaneously uses split read signal, discordant paired-end read signal, read depth signal and the fragment with two ends unmapped. Seeksv can detect DEL, insertion, inversion and inter-chromosome transfer at single-nucleotide resolution. Different types of sequencing data, such as single-end sequencing data or paired-end sequencing data can accommodate to detect SV. Seeksv develops a rescue model for SV with breakpoints located in sequence homology regions. Results on simulated and real data from the 1000 Genomes Project and esophageal squamous cell carcinoma samples show that seeksv has higher efficiency and precision compared with other similar software in detecting SVs. For the discovery of hepatitis B virus integration sites from probe capture data, the verified experiments show that more than 90% viral integration sequences detected by seeksv are true.seeksv is implemented in C ++ and can be downloaded from https://github.com/qkl871118/seeksv CONTACT: : dragonbw@163.comSupplementary information: Supplementary data are available at Bioinformatics online.

Publications

  1. Seeksv: an accurate tool for somatic structural variation and virus integration detection.
    Cite this
    Liang Y, Qiu K, Liao B, Zhu W, Huang X, Li L, Chen X, Li K, 2017-01-01 - Bioinformatics (Oxford, England)

Credits

  1. Ying Liang
    Developer

    BGI, Shenzhen, China

  2. Kunlong Qiu
    Developer

    BGI, Shenzhen, China

  3. Bo Liao
    Developer

    College of Information Science and Engineering, Hunan University, China

  4. Wen Zhu
    Developer

    College of Information Science and Engineering, Hunan University, China

  5. Xuanlin Huang
    Developer

    BGI, Shenzhen, China

  6. Lin Li
    Developer

    BGI, Shenzhen, China

  7. Xiangtao Chen
    Developer

    College of Information Science and Engineering, Hunan University, China

  8. Keqin Li
    Investigator

    Department of Computer Science State, University of New York, United States of America

Community Ratings

UsabilityEfficiencyReliabilityRated By
0 user
Sign in to rate
Summary
AccessionBT002208
Tool TypeApplication
Category
PlatformsLinux/Unix
TechnologiesC++
User InterfaceTerminal Command Line
Download Count0
Country/RegionUnited States of America
Submitted ByKeqin Li