Evaluating different methods of estimating retrieval quality for resource selection

Nottelmann, Henrik; Fuhr, Norbert

doi:10.1145/860435.860489

Veröffentlicht

Evaluating different methods of estimating retrieval quality for resource selection

In a federated digital library system, it is too expensive to query every accessible library. Resource selection is the task to decide to which libraries a query should be routed. Most existing resource selection algorithms compute a library ranking in a heuristic way. In contrast, the decision-theoretic framework (DTF) follows a different approach on a better theoretic foundation: It computes a selection which minimises the overall costs (e.g. retrieval quality, time, money) of the distributed retrieval. For estimating retrieval quality the recall-precision function is proposed. In this paper, we introduce two new methods: The first one computes the empirical distribution of the probabilities of relevance from a small library sample, and assumes it to be representative for the whole library. The second method assumes that the indexing weights follow a normal distribution, leading to a normal distribution for the document scores. Furthermore, we present the first evaluation of DTF by comparing this theoretical approach with the heuristical stateof- the-art system CORI; here we find that DTF outperforms CORI in most cases.

Vorschau

Einordnung

Konferenz:

SIGIR03: The 26th ACM/SIGIR International Symposium on Information Retrieval; 28 July 2003- 1 August 2003; Toronto Canada

Datum der Erstellung:

01.07.2003

Datum der Veröffentlichung:

10.08.2012

URN:

urn:nbn:de:hbz:464-20120810-132150-7

PURL:

http://purl.oclc.org/NET/duett-07062004-110231

DOI:

10.1145/860435.860489

Sprache:

Englisch

Ressourcentyp:

Text

Schlagwörter:

normal distribution; desicion-theoretic framework; formal models; Resource selection

Kollektion:

E-Publikationen

Dewey Dezimal-Klassifikation:

004 Datenverarbeitung; Informatik

Sachgruppen der Deutschen Nationalbibliographie:

004 Informatik

Einrichtung:

Fakultät für Ingenieurwissenschaften, Informatik und Angewandte Kognitionswissenschaft, Informatik, Interaktive Systeme / Interaktionsdesign

Informationen zur Erstveröffentlichung:

Callan, Jamie (ed.): SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval. New York: ACM, 2003, 290–297. - ISBN: 978-1-58113-646-3

Online also available at: https://doi.org/10.1145/860435.860489

auf die Merkliste