From Retrieval Status Values to Probabilities of Relevance for Advanced IR Applications

Nottelmann, Henrik; Fuhr, Norbert

doi:10.1023/A:1026080230789

Veröffentlicht

From Retrieval Status Values to Probabilities of Relevance for Advanced IR Applications

Information Retrieval systems typically sort the result with respect to document retrieval status values (RSV). According to the Probability Ranking Principle, this ranking ensures optimum retrieval quality if the RSVs are monotonously increasing with the probabilities of relevance (as e.g. for probabilistic IR models). However, advanced applications like filtering or distributed retrieval require estimates of the actual probability of relevance. The relationship between the RSV of a document and its probability of relevance can be described by a 'normalisation' function which maps the retrieval status value onto the probability of relevance ('mapping functions'). In this paper, we explore the use of linear and logistic mapping functions for different retrieval methods. In a series of upper-bound experiments, we compare the approximation quality of the different mapping functions. We also investigate the effect on the resulting retrieval quality in distributed retrieval (only merging, without resource selection). These experiments show that good estimates of the actual probability of relevance can be achieved, and that the logistic model outperforms the linear one. Retrieval quality for distributed retrieval is only slightly improved by using the logistic function.

In:

Information Retrieval 6 (2003), 4

Vorschau

Einordnung

Datum der Erstellung:

2004

URN:

urn:nbn:de:hbz:464-duett-07022004-1005304

PURL:

http://purl.oclc.org/NET/duett-07022004-100530

DOI:

10.1023/A:1026080230789

Sprache:

Englisch

Ressourcentyp:

Text

Schlagwörter:

none

Kollektion:

E-Publikationen

Dewey Dezimal-Klassifikation:

000 Informatik, Wissen, Systeme

Sachgruppen der Deutschen Nationalbibliographie:

000 Allgemeines, Wissenschaft

Einrichtung:

Fakultät für Ingenieurwissenschaften, Informatik und Angewandte Kognitionswissenschaft

Informationen zur Erstveröffentlichung:

This is the Author Manuscript of an article published in Information Retrieval 6, pages 363–388 (2003).

The final authenticated version is available online at: http://dx.doi.org/10.1023/A:1026080230789

Versionskennzeichen:

Author Manuscript

auf die Merkliste

Zitieren

Zitierform:

urn:nbn:de:hbz:464-duett-07022004-1005304
Zitier-Link kopieren

Rechte

Nutzung und Vervielfältigung:

Export

BibTeX, Endnote, MODS, MARCXML, RIS, ISI, PICA, DC, CSV

DuEPublico 2

Duisburg-Essen Publications online

From Retrieval Status Values to Probabilities of Relevance for Advanced IR Applications

In:

Vorschau

Einordnung

Zitieren

Rechte

Nutzung und Vervielfältigung:

Export