Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Size Matters: Cardinality-Constrained Clustering and Outlier Detection via Conic Optimization
 
research article

Size Matters: Cardinality-Constrained Clustering and Outlier Detection via Conic Optimization

Rujeerapaiboon, Napat  
•
Schindler, Kilian  
•
Kuhn, Daniel  
Show more
2019
SIAM Journal on Optimization

Plain vanilla K-means clustering is prone to produce unbalanced clusters and suffers from outlier sensitivity. To mitigate both shortcomings, we formulate a joint outlier-detection and clustering problem, which assigns a prescribed number of datapoints to an auxiliary outlier cluster and performs cardinality-constrained K-means clustering on the residual dataset. We cast this problem as a mixed-integer linear program (MILP) that admits tractable semidefinite and linear programming relaxations. We propose deterministic rounding schemes that transform the relaxed solutions to high quality solutions for the MILP. We prove that these solutions are optimal in the MILP if a cluster separation condition holds. To our best knowledge, we propose the first tractable solution scheme for the joint outlier-detection and clustering problem with optimality guarantees.

  • Details
  • Metrics
Type
research article
DOI
10.1137/17M1150670
Author(s)
Rujeerapaiboon, Napat  
Schindler, Kilian  
Kuhn, Daniel  
Wiesemann, Wolfram
Date Issued

2019

Published in
SIAM Journal on Optimization
Volume

29

Issue

2

Start page

1211

End page

1239

Subjects

Semidefinite programming

•

K-means clustering

•

Outlier detection

•

Optimality guarantee

Note

Available from Optimization Online

URL

URL

http://www.optimization-online.org/DB_HTML/2017/05/6027.html
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
RAO  
Available on Infoscience
May 23, 2017
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/137495
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés