Abstract
Current exact single-linkage clustering algorithms have asymptotically quadratic complexity. We present algorithms for approximate single-linkage clustering with empirically near-linear scalability. We explore both graph index-based incremental nearest neighbor search and an iterative exploration scheme on the graph index approximating the MST of the reachability graph similar to Kruskal. As graph index, we use both the bottom layer and a combination of all layers of an HNSW as a stand-in for connected search graphs. We provide experiments comparing the clusterings to baselines such as exact single linkage implementation and an algorithm using metric tree-based searchers. We explore the impact of the HNSW hyperparameters on the performance in terms of running time and clustering quality and evaluate the empirical asymptotic complexity.
| Original language | English |
|---|---|
| Title of host publication | SISAP |
| Number of pages | 15 |
| Publication date | 7 Oct 2025 |
| Pages | 233-247 |
| ISBN (Print) | 978-3-032-06068-6 |
| ISBN (Electronic) | 978-3-032-06069-3 |
| Publication status | Published - 7 Oct 2025 |
| Event | International Conference on Similarity Search and Applications - Reykjavik, Iceland Duration: 1 Oct 2025 → 3 Oct 2025 Conference number: 18 https://www.sisap.org/2025 |
Conference
| Conference | International Conference on Similarity Search and Applications |
|---|---|
| Number | 18 |
| Country/Territory | Iceland |
| City | Reykjavik |
| Period | 01/10/2025 → 03/10/2025 |
| Internet address |
Keywords
- Clustering
- Hierarchical Clustering
- Approximate Clustering
Fingerprint
Dive into the research topics of 'Approximate Single-Linkage Clustering Using Graph-Based Indexes: MST-Based Approaches and Incremental Searchers.'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver