Introduction

For flow cytometry data, there are two common approaches to the unsupervised clustering problem: one is based on the finite mixture model and the other on spatial exploration of the histograms. The former is computationally slow and has difficulty to identify clusters of irregular shapes. The latter approach cannot be applied directly to high-dimensional data as the computational time and memory become unmanageable and the estimated histogram is unreliable. An algorithm without these two problems would be very useful.In this article, we combine ideas from the finite mixture model and histogram spatial exploration. This new algorithm, which we call flowPeaks, can be applied directly to high-dimensional data and identify irregular shape clusters. The algorithm first uses K-means algorithm with a large K to partition the cell population into many small clusters. These partitioned data allow the generation of a smoothed density function using the finite mixture model. All local peaks are exhaustively searched by exploring the density function and the cells are clustered by the associated local peak. The algorithm flowPeaks is automatic, fast and reliable and robust to cluster shape and outliers. This algorithm has been applied to flow cytometry data and it has been compared with state of the art algorithms, including Misty Mountain, FLOCK, flowMeans, flowMerge and FLAME.The R package flowPeaks is available at https://github.com/yongchao/flowPeaks.yongchao.ge@mssm.eduSupplementary data are available at Bioinformatics online.

Publications

  1. flowPeaks: a fast unsupervised clustering for flow cytometry data via K-means and density peak finding.
    Cite this
    Ge Y, Sealfon SC, 2012-08-01 - Bioinformatics (Oxford, England)

Credits

  1. Yongchao Ge
    Developer

    Department of Neurology and Center of Translational System Biology, Mount Sinai School of Medicine

  2. Stuart C Sealfon
    Investigator

Community Ratings

UsabilityEfficiencyReliabilityRated By
0 user
Sign in to rate
Summary
AccessionBT006814
Tool TypeApplication
Category
PlatformsLinux/Unix
TechnologiesR
User InterfaceTerminal Command Line
Download Count0
Submitted ByStuart C Sealfon