Solving the Correlation Cluster LP in Sublinear Time
Correlation Clustering is a fundamental and widely-studied problem in unsupervised learning and data mining. The input is a graph and the goal is to construct a clustering minimizing the number of inter-cluster edges plus the number of missing intra-cluster edges. Cao, Cohen-Addad, Lee, Li, Newman, and Vogl [STOC 2024] introduced the cluster LP for Correlation Clustering, which they argued captures the problem much more succinctly than previous linear programming formulations. However, the cluster LP has exponential size, with a variable for every possible set of vertices in the input graph. Nevertheless, they showed how to find a feasible solution for the cluster LP in time O(npoly(1/ε)) with objective value at most (1+ε) times the value of an optimal solution for the respective Correlation Clustering instance. Furthermore, they showed how to round a solution to the cluster LP, yielding a (1.437+ε)-approximation algorithm for the Correlation Clustering problem. The main technical result of this paper is a new approach to find a feasible solution for the cluster LP with objective value at most (1+ε) of the optimum in time O(2poly(1/ε) n), where n is the number of vertices in the graph. We also show how to implement the rounding within the same time bounds, thus achieving a fast (1.437+ε)-approximation algorithm for the Correlation Clustering problem. This bridges the gap between state-of-the-art methods for approximating Correlation Clustering and the recent focus on fast algorithms.
École Polytechnique Fédérale de Lausanne
2025-06-15
New York, NY, USA
979-8-4007-1510-5
1154
1165
REVIEWED
EPFL
| Event name | Event acronym | Event place | Event date |
STOC '25 | Prague, Czechia | 2025-06-23 - 2025-06-27 | |
| Funder | Funding(s) | Grant Number | Grant URL |
NSF | 2236669 | ||
Villum Fonden | 54451 | ||
Basic Algorithms Research Copenhagen (BARC) | |||
| Show more | |||