Approximate Single-Linkage Clustering Using Graph-Based Indexes: MST-Based Approaches and Incremental Searchers.

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

Current exact single-linkage clustering algorithms have asymptotically quadratic complexity. We present algorithms for approximate single-linkage clustering with empirically near-linear scalability. We explore both graph index-based incremental nearest neighbor search and an iterative exploration scheme on the graph index approximating the MST of the reachability graph similar to Kruskal. As graph index, we use both the bottom layer and a combination of all layers of an HNSW as a stand-in for connected search graphs. We provide experiments comparing the clusterings to baselines such as exact single linkage implementation and an algorithm using metric tree-based searchers. We explore the impact of the HNSW hyperparameters on the performance in terms of running time and clustering quality and evaluate the empirical asymptotic complexity.
OriginalsprogEngelsk
TitelSISAP
Antal sider15
Publikationsdato7 okt. 2025
Sider233-247
ISBN (Trykt)978-3-032-06068-6
ISBN (Elektronisk)978-3-032-06069-3
StatusUdgivet - 7 okt. 2025

Fingeraftryk

Dyk ned i forskningsemnerne om 'Approximate Single-Linkage Clustering Using Graph-Based Indexes: MST-Based Approaches and Incremental Searchers.'. Sammen danner de et unikt fingeraftryk.

Citationsformater