Introduction

In recent years, many studies focused on the description and comparison of large sets of related bacteriophage genomes. Due to the peculiar mosaic structure of these genomes, few informative approaches for comparing whole genomes exist: dot plots diagrams give a mostly qualitative assessment of the similarity/dissimilarity between two or more genomes, and clustering techniques are used to classify genomes. Multiple alignments are conspicuously absent from this scene. Indeed, whole genome aligners interpret lack of similarity between sequences as an indication of rearrangements, insertions, or losses. This behavior makes them ill-prepared to align bacteriophage genomes, where even closely related strains can accomplish the same biological function with highly dissimilar sequences.In this paper, we propose a multiple alignment strategy that exploits functional collinearity shared by related strains of bacteriophages, and uses partial orders to capture mosaicism of sets of genomes. As classical alignments do, the computed alignments can be used to predict that genes have the same biological function, even in the absence of detectable similarity. The Alpha aligner implements these ideas in visual interactive displays, and is used to compute several examples of alignments of Staphylococcus aureus and Mycobacterium bacteriophages, involving up to 29 genomes. Using these datasets, we prove that Alpha alignments are at least as good as those computed by standard aligners. Comparison with the progressive Mauve aligner - which implements a partial order strategy, but whose alignments are linearized - shows a greatly improved interactive graphic display, while avoiding misalignments.Multiple alignments of whole bacteriophage genomes work, and will become an important conceptual and visual tool in comparative genomics of sets of related strains. A python implementation of Alpha, along with installation instructions for Ubuntu and OSX, is available on bitbucket (https://bitbucket.org/thekswenson/alpha).

Publications

  1. Aligning the unalignable: bacteriophage whole genome alignments.
    Cite this
    Bérard S, Chateau A, Pompidor N, Guertin P, Bergeron A, Swenson KM, 2016-01-01 - BMC bioinformatics

Credits

  1. Sèverine Bérard
    Developer

    LIRMM, CNRS - Univ. Montpellier

  2. Annie Chateau
    Developer

    IBC Institut de Biologie Computationnelle, Montpellier

  3. Nicolas Pompidor
    Developer

    LIRMM, CNRS - Univ. Montpellier

  4. Paul Guertin
    Developer

    Département de mathématiques, Collège André-Grasset

  5. Anne Bergeron
    Developer

    LaCIM, Université du Québec à Montréal

  6. Krister M Swenson
    Investigator

    IBC Institut de Biologie Computationnelle, Montpellier

Community Ratings

UsabilityEfficiencyReliabilityRated By
0 user
Sign in to rate
Summary
AccessionBT006988
Tool TypeApplication
Category
PlatformsLinux/Unix
Technologies
User InterfaceTerminal Command Line
Download Count0
Submitted ByKrister M Swenson