A Bi-Criteria Approximation Algorithm for k-Means

Makarychev, Konstantin; Makarychev, Yury; Sviridenko, Maxim; Ward, Justin

doi:10.4230/LIPIcs.APPROX-RANDOM.2016.14

conference paper

A Bi-Criteria Approximation Algorithm for k-Means

Makarychev, Konstantin

•

Makarychev, Yury

•

Sviridenko, Maxim

more

2016

Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2016)

We consider the classical k-means clustering problem in the setting of bi-criteria approximation, in which an algorithm is allowed to output betak > k clusters, and must produce a clustering with cost at most alpha times the to the cost of the optimal set of k clusters. We argue that this approach is natural in many settings, for which the exact number of clusters is a priori unknown, or unimportant up to a constant factor. We give new bi-criteria approximation algorithms, based on linear programming and local search, respectively, which attain a guarantee alpha(beta) depending on the number betak of clusters that may be opened. Our guarantee alpha(beta) is always at most 9 + epsilon and improves rapidly with beta (for example: alpha(2) < 2.59, and alpha(3) < 1.4). Moreover, our algorithms have only polynomial dependence on the dimension of the input data, and so are applicable in high-dimensional settings.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/139537

Type

conference paper

DOI

10.4230/LIPIcs.APPROX-RANDOM.2016.14

Authors

Makarychev, Konstantin

•

Makarychev, Yury

•

Sviridenko, Maxim

•

Ward, Justin

Editors

Jansen, Klaus

•

Mathieu, Claire

•

Rolim, José D.P.

•

Umans, Chris

Publication date

2016

Publisher

Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik

Published in

Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2016)

ISBN of the book

978-3-95977-018-7

Publisher place

Dagstuhl, Germany

Series title/Series vol.

Leibniz International Proceedings in Informatics (LIPIcs)

Start page

14:1

Subjects

k-means clustering

bicriteria approximat...

linear programming

local search

Peer reviewed

REVIEWED

EPFL units

THL2

Available on Infoscience

August 3, 2017