Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Solving the Correlation Cluster LP in Sublinear Time
 
conference paper

Solving the Correlation Cluster LP in Sublinear Time

Cao, Nairen
•
Cohen-Addad, Vincent
•
Lee, Euiwoong
Show more
June 15, 2025
STOC'25. Proceedings of the 57th Annual ACM Symposium on Theory of Computing
57th Annual ACM Symposium on Theory of Computing (STOC 2025)

Correlation Clustering is a fundamental and widely-studied problem in unsupervised learning and data mining. The input is a graph and the goal is to construct a clustering minimizing the number of inter-cluster edges plus the number of missing intra-cluster edges. Cao, Cohen-Addad, Lee, Li, Newman, and Vogl [STOC 2024] introduced the cluster LP for Correlation Clustering, which they argued captures the problem much more succinctly than previous linear programming formulations. However, the cluster LP has exponential size, with a variable for every possible set of vertices in the input graph. Nevertheless, they showed how to find a feasible solution for the cluster LP in time O(npoly(1/ε)) with objective value at most (1+ε) times the value of an optimal solution for the respective Correlation Clustering instance. Furthermore, they showed how to round a solution to the cluster LP, yielding a (1.437+ε)-approximation algorithm for the Correlation Clustering problem. The main technical result of this paper is a new approach to find a feasible solution for the cluster LP with objective value at most (1+ε) of the optimum in time O(2poly(1/ε) n), where n is the number of vertices in the graph. We also show how to implement the rounding within the same time bounds, thus achieving a fast (1.437+ε)-approximation algorithm for the Correlation Clustering problem. This bridges the gap between state-of-the-art methods for approximating Correlation Clustering and the recent focus on fast algorithms.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1145/3717823.3718181
Author(s)
Cao, Nairen
Cohen-Addad, Vincent
Lee, Euiwoong
Li, Shi
Lolck, David Rasmussen
Newman, Alantha
Thorup, Mikkel
Vogl, Lukas  

École Polytechnique Fédérale de Lausanne

Yan, Shuyi
Zhang, Hanwen
Date Issued

2025-06-15

Publisher

ACM

Publisher place

New York, NY, USA

Published in
STOC'25. Proceedings of the 57th Annual ACM Symposium on Theory of Computing
ISBN of the book

979-8-4007-1510-5

Start page

1154

End page

1165

Subjects

• Theory of computation → Facility location and clustering

•

Streaming, sublinear and near linear time algorithms

•

Massively parallel algorithms Clustering, Approximation Algorithms, Exponential-Size Linear Programming, Sublinear-Time Algorithm, MPC Algorithm

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
THL2  
Event nameEvent acronymEvent placeEvent date
57th Annual ACM Symposium on Theory of Computing (STOC 2025)

STOC '25

Prague, Czechia

2025-06-23 - 2025-06-27

FunderFunding(s)Grant NumberGrant URL

NSF

2236669

Villum Fonden

54451

Basic Algorithms Research Copenhagen (BARC)

Show more
Available on Infoscience
June 17, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/251417
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés