TY - GEN
T1 - Interactive Learning for Multimedia at Large
AU - Khan, Omar Shahbaz
AU - Jónsson, Björn Thór
AU - Rudinac, Stevan
AU - Zahálka, Jan
AU - Ragnarsdóttir, Hanna
AU - Þorleiksdóttir, Þórhildur
AU - Guðmundsson, Gylfi Þór
AU - Amsaleg, Laurent
AU - Worring, Marcel
PY - 2020/4
Y1 - 2020/4
N2 - Interactive learning has been suggested as a key method for addressing analytic multimedia tasks arising in several domains. Until recently, however, methods to maintain interactive performance at the scale of today’s media collections have not been addressed. We propose an interactive learning approach that builds on and extends the state of the art in user relevance feedback systems and high-dimensional indexing for multimedia. We report on a detailed experimental study using the ImageNet and YFCC100M collections, containing 14 million and 100 million images respectively. The proposed approach outperforms the relevant state-of-the-art approaches in terms of interactive performance, while improving suggestion relevance in some cases. In particular, even on YFCC100M, our approach requires less than 0.3 s per interaction round to generate suggestions, using a single computing core and less than 7 GB of main memory.
AB - Interactive learning has been suggested as a key method for addressing analytic multimedia tasks arising in several domains. Until recently, however, methods to maintain interactive performance at the scale of today’s media collections have not been addressed. We propose an interactive learning approach that builds on and extends the state of the art in user relevance feedback systems and high-dimensional indexing for multimedia. We report on a detailed experimental study using the ImageNet and YFCC100M collections, containing 14 million and 100 million images respectively. The proposed approach outperforms the relevant state-of-the-art approaches in terms of interactive performance, while improving suggestion relevance in some cases. In particular, even on YFCC100M, our approach requires less than 0.3 s per interaction round to generate suggestions, using a single computing core and less than 7 GB of main memory.
KW - Large multimedia collections
KW - Interactive multimodal learning
KW - YFCC100M
KW - High-dimensional indexing
KW - Large multimedia collections
KW - Interactive multimodal learning
KW - YFCC100M
KW - High-dimensional indexing
U2 - 10.1007/978-3-030-45439-5_33
DO - 10.1007/978-3-030-45439-5_33
M3 - Article in proceedings
SN - 978-3-030-45438-8
T3 - Lecture Notes in Computer Science
SP - 495
EP - 510
BT - Advances in Information Retrieval
PB - Springer
CY - Lisbon, Portugal
ER -