Clustering with t-SNE, provably.

George C Linderman, Stefan Steinerberger
Author Information
  1. George C Linderman: Program in Applied Mathematics, Yale University, New Haven, CT 06511, USA.
  2. Stefan Steinerberger: Department of Mathematics, Yale University, New Haven, CT 06511, USA.

Abstract

t-distributed Stochastic Neighborhood Embedding (t-SNE), a clustering and visualization method proposed by van der Maaten & Hinton in 2008, has rapidly become a standard tool in a number of natural sciences. Despite its overwhelming success, there is a distinct lack of mathematical foundations and the inner workings of the algorithm are not well understood. The purpose of this paper is to prove that t-SNE is able to recover well-separated clusters; more precisely, we prove that t-SNE in the 'early exaggeration' phase, an optimization technique proposed by van der Maaten & Hinton (2008) and van der Maaten (2014), can be rigorously analyzed. As a byproduct, the proof suggests novel ways for setting the exaggeration parameter and step size . Numerical examples illustrate the effectiveness of these rules: in particular, the quality of embedding of topological structures (e.g. the swiss roll) improves. We also discuss a connection to spectral clustering methods.

Keywords

References

  1. Cell. 2015 May 21;161(5):1202-1214 [PMID: 26000488]

Grants

  1. R01 HG008383/NHGRI NIH HHS
  2. T32 GM007205/NIGMS NIH HHS

Word Cloud

Created with Highcharts 10.0.0t-SNEclusteringvanderMaatenproposed&Hinton2008provespectralt-distributedStochasticNeighborhoodEmbeddingvisualizationmethodrapidlybecomestandardtoolnumbernaturalsciencesDespiteoverwhelmingsuccessdistinctlackmathematicalfoundationsinnerworkingsalgorithmwellunderstoodpurposepaperablerecoverwell-separatedclustersprecisely'earlyexaggeration'phaseoptimizationtechnique2014canrigorouslyanalyzedbyproductproofsuggestsnovelwayssettingexaggerationparameterstepsizeNumericalexamplesillustrateeffectivenessrules:particularqualityembeddingtopologicalstructuresegswissrollimprovesalsodiscussconnectionmethodsClusteringprovablyconvergenceratesdimensionalityreductiontheoreticalguarantees

Similar Articles

Cited By (52)