TY - GEN
T1 - Approximate Furthest Neighbor in High Dimensions
AU - Pagh, Rasmus
AU - Silvestri, Francesco
AU - Sivertsen, Johan von Tangen
AU - Skala, Matthew
PY - 2015
Y1 - 2015
N2 - Much recent work has been devoted to approximate nearest neighbor queries. Motivated by applications in recommender systems, we consider approximate furthest neighbor (AFN) queries. We present a simple, fast, and highly practical data structure for answering AFN queries in high-dimensional Euclidean space. We build on the technique of Indyk (SODA 2003), storing random projections to provide sublinear query time for AFN. However, we introduce a different query algorithm, improving on Indyk’s approximation factor and reducing the running time by a logarithmic factor. We also present a variation based on a query-independent ordering of the database points; while this does not have the provable approximation factor of the query-dependent data structure, it offers significant improvement in time and space complexity. We give a theoretical analysis, and experimental results.
AB - Much recent work has been devoted to approximate nearest neighbor queries. Motivated by applications in recommender systems, we consider approximate furthest neighbor (AFN) queries. We present a simple, fast, and highly practical data structure for answering AFN queries in high-dimensional Euclidean space. We build on the technique of Indyk (SODA 2003), storing random projections to provide sublinear query time for AFN. However, we introduce a different query algorithm, improving on Indyk’s approximation factor and reducing the running time by a logarithmic factor. We also present a variation based on a query-independent ordering of the database points; while this does not have the provable approximation factor of the query-dependent data structure, it offers significant improvement in time and space complexity. We give a theoretical analysis, and experimental results.
KW - Approximate furthest neighbor
KW - High-dimensional data structures
KW - Random projections
KW - Query optimization
KW - Recommender systems
KW - Approximate furthest neighbor
KW - High-dimensional data structures
KW - Random projections
KW - Query optimization
KW - Recommender systems
U2 - 10.1007/978-3-319-25087-8_1
DO - 10.1007/978-3-319-25087-8_1
M3 - Article in proceedings
SN - 978-3-319-25086-1
T3 - Lecture Notes in Computer Science
SP - 3
EP - 14
BT - Similarity Search and Applications
PB - Springer
ER -