Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Nearly-Tight and Oblivious Algorithms for Explainable Clustering
 
conference paper

Nearly-Tight and Oblivious Algorithms for Explainable Clustering

Gamlath, Buddhima  
•
Jia, Xinrui  
•
Polak, Adam Teodor  
Show more
2021
Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021
35th Conference on Neural Information Processing Systems, NeurIPS 2021

We study the problem of explainable clustering in the setting first formalized by Dasgupta, Frost, Moshkovitz, and Rashtchian (ICML 2020). A k-clustering is said to be explainable if it is given by a decision tree where each internal node splits data points with a threshold cut in a single dimension (feature), and each of the k leaves corresponds to a cluster. We give an algorithm that outputs an explainable clustering that loses at most a factor of O(log2 k) compared to an optimal (not necessarily explainable) clustering for the k-medians objective, and a factor of O(k log2 k) for the k-means objective. This improves over the previous best upper bounds of O(k) and O(k2), respectively, and nearly matches the previous Ω(log k) lower bound for k-medians and our new Ω(k) lower bound for k-means. The algorithm is remarkably simple. In particular, given an initial not necessarily explainable clustering in Rd, it is oblivious to the data points and runs in time O(dk log2 k), independent of the number of data points n. Our upper and lower bounds also generalize to objectives given by higher ℓp-norms. © 2021 Neural information processing systems foundation.

  • Details
  • Metrics
Type
conference paper
Scopus ID

2-s2.0-85131865084

Author(s)
Gamlath, Buddhima  
Jia, Xinrui  
Polak, Adam Teodor  
Svensson, Ola Nils Anders  
Date Issued

2021

Published in
Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021
Series title/Series vol.

Advances in Neural Information Processing Systems; 34

Volume

34

Start page

28929

End page

28939

URL

Link to conference paper

https://proceedings.neurips.cc/paper/2021/hash/f24ad6f72d6cc4cb51464f2b29ab69d3-Abstract.html
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DISOPT  
Event nameEvent placeEvent date
35th Conference on Neural Information Processing Systems, NeurIPS 2021

virtual

December 6-14, 2021

Available on Infoscience
November 17, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/192309
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés