Skip to main navigation Skip to search Skip to main content

The Curious Case of High-Dimensional Indexing as a File Structure: A Case Study of eCP-FS

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

While approximate nearest-neighbor (ANN) search is an integral part of the modern multimedia analytics pipeline, the ever-hungry AI-models may frequently starve the ANN structures of resources, particularly memory. We present a novel white-box implementation of the disk-based hierarchical eCP index, called eCP-FS, that extends the disk-based strategy beyond merely storing data on disk, and instead implements the entire structure as an overlay file system using Zarr. This maps the (normally complex) index structure intuitively to a familiar hierarchical folder structure, which in turn makes the index much easier to visualize and analyse than the typical in-memory black-box structures of other algorithms. We furthermore implement incremental retrieval over eCP-FS, which benefits even more from file-system caching. Using an experimental benchmark inspired by live retrieval competitions, we show that despite trading raw speed for reduced memory footprint, eCP-FS is still a competitive option in the modern day analytics pipeline.
Original languageEnglish
Title of host publicationSimilarity Search and Applications
EditorsGiuseppe Amato, Vladimir Mic, Agma Traina, Nicola Messina, Laurent Amsaleg, Gylfi Þór Guðmundsson, Björn Þór Jónsson, Lucia Vadicamo
Number of pages9
Place of PublicationCham
PublisherSpringer Nature Switzerland
Publication date8 Oct 2025
Pages303-311
ISBN (Print)978-3-032-06068-6
ISBN (Electronic)978-3-032-06069-3
DOIs
Publication statusPublished - 8 Oct 2025
Externally publishedYes
EventSimilarity Search and Applications - Bologna, Italy
Duration: 5 Oct 20227 Oct 2022
Conference number: 15th

Conference

ConferenceSimilarity Search and Applications
Number15th
Country/TerritoryItaly
CityBologna
Period05/10/202207/10/2022
SeriesLNCS
Volume16134

Keywords

  • High-dimensional indexing
  • Resource Constrained Search
  • Incremental Retrieval
  • Disk-based ANN

Fingerprint

Dive into the research topics of 'The Curious Case of High-Dimensional Indexing as a File Structure: A Case Study of eCP-FS'. Together they form a unique fingerprint.

Cite this