Broad expertise retrieval in sparse data environments

Krisztian Balog, Toine Bogers, Leif Azzopardi, Maarten De Rijke, Antal Van Den Bosch

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

Expertise retrieval has been largely unexplored on data other than the W3C collection. At the same time, many intranets of universities and other knowledge-intensive organisations offer examples of relatively small but clean multilingual expertise data, covering broad ranges of expertise areas. We first present two main expertise retrieval tasks, along with a set of baseline approaches based on generative language modeling, aimed at finding expertise relations between topics and people. For our experimental evaluation, we introduce (and release) a new test set based on a crawl of a university site. Using this test set, we conduct two series of experiments. The first is aimed at determining the effectiveness of baseline expertise retrieval methods applied to the new test set. The second is aimed at assessing refined models that exploit characteristic features of the new test set, such as the organizational structure of the university, and the hierarchical structure of the topics in the test set. Expertise retrieval models are shown to be robust with respect to environments smaller than the W3C collection, and current techniques appear to be generalizable to other settings.

OriginalsprogEngelsk
TitelProceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
Antal sider8
Publikationsdato30 nov. 2007
Sider551-558
ISBN (Trykt)1595935975, 9781595935977
DOI
StatusUdgivet - 30 nov. 2007
Udgivet eksterntJa
Begivenhed30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07 - Amsterdam, Holland
Varighed: 23 jul. 200727 jul. 2007

Konference

Konference30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
Land/OmrådeHolland
ByAmsterdam
Periode23/07/200727/07/2007
Sponsor

Emneord

  • Expertise retrieval
  • Multilingual expertise data
  • Generative language modeling
  • Baseline approaches
  • Hierarchical topic structure

Fingeraftryk

Dyk ned i forskningsemnerne om 'Broad expertise retrieval in sparse data environments'. Sammen danner de et unikt fingeraftryk.

Citationsformater