Introduction

The field of protein sequence analysis is dominated by tools rooted in substitution matrices and alignments. A complementary approach is provided by methods of quantitative characterization. A major advantage of the approach is that quantitative properties defines a multidimensional solution space, where sequences can be related to each other and differences can be meaningfully interpreted.Quantiprot is a software package in Python, which provides a simple and consistent interface to multiple methods for quantitative characterization of protein sequences. The package can be used to calculate dozens of characteristics directly from sequences or using physico-chemical properties of amino acids. Besides basic measures, Quantiprot performs quantitative analysis of recurrence and determinism in the sequence, calculates distribution of n-grams and computes the Zipf's law coefficient.We propose three main fields of application of the Quantiprot package. First, quantitative characteristics can be used in alignment-free similarity searches, and in clustering of large and/or divergent sequence sets. Second, a feature space defined by quantitative properties can be used in comparative studies of protein families and organisms. Third, the feature space can be used for evaluating generative models, where large number of sequences generated by the model can be compared to actually observed sequences.

Publications

  1. Quantiprot - a Python package for quantitative analysis of protein sequences.
    Cite this
    Konopka BM, Marciniak M, Dyrka W, 2017-07-01 - BMC bioinformatics

Credits

  1. Bogumił M Konopka
    Developer

    Katedra InŻynierii Biomedycznej, Wydział Podstawowych Problemów Techniki, Poland

  2. Marta Marciniak
    Developer

    Katedra InŻynierii Biomedycznej, Wydział Podstawowych Problemów Techniki, Poland

  3. Witold Dyrka
    Investigator

    Katedra InŻynierii Biomedycznej, Wydział Podstawowych Problemów Techniki, Poland

Community Ratings

UsabilityEfficiencyReliabilityRated By
0 user
Sign in to rate
Summary
AccessionBT006834
Tool TypeApplication
Category
PlatformsLinux/Unix
Technologies
User InterfaceTerminal Command Line
Download Count0
Country/RegionPoland
Submitted ByWitold Dyrka