Exploiting Hyperlinks to Learn a Retrieval Model

Grangier, David; Bengio, Samy

2005

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Information Retrieval (IR) aims at solving a ranking problem: given a query $q$ and a corpus $C$, the documents of $C$ should be ranked such that the documents relevant to $q$ appear above the others. This task is generally performed by ranking the documents $d \in C$ according to their similarity with respect to $q$, $sim (q,d)$. The identification of an effective function $a,b \to sim(a,b)$ could be performed using a large set of queries with their corresponding relevance assessments. However, such data are especially expensive to label, thus, as an alternative, we propose to rely on hyperlink data which convey analogous semantic relationships. We then empirically show that a measure $sim$ inferred from hyperlinked documents can actually outperform the state-of-the-art {\em Okapi} approach, when applied over a non-hyperlinked retrieval corpus.

Details

Title Exploiting Hyperlinks to Learn a Retrieval Model

Author(s) Grangier, David ; Bengio, Samy

Published in NIPS Workshop on Learning to Rank

Conference NIPS Workshop on Learning to Rank

Date 2005

Publisher Whistler, Canada

Keywords

Speech

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2006-03-10

Actions

Preview

Select file: