A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Benjamin Piwowarski,Patrick Gallinari

doi:10.1007/s10791-005-0751-6

Abstract

Most recent document standards like XML rely on structured representations. On the other hand, current information retrieval systems have been developed for flat document representations and cannot be easily extended to cope with more complex document types. The design of such systems is still an open problem. We present a new model for structured document retrieval which allows computing scores of document parts. This model is based on Bayesian networks whose conditional probabilities are learnt from a labelled collection of structured documents--which is composed of documents, queries and their associated assessments. Training these models is a complex machine learning task and is not standard. This is the focus of the paper: we propose here to train the structured Bayesian Network model using a cross-entropy training criterion. Results are presented on the INEX corpus of XML documents.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Abstract

Talk to us

Similar Papers

More From: Information Retrieval

Lead the way for us

Journal: Information Retrieval	Publication Date: Dec 1, 2005
Citations: 35

Similar Papers

A Machine Learning Model for Information Retrieval with Structured Documents
Benjamin Piwowarski ... Patrick Gallinari
-
Benjamin Piwowarski, et. al.Benjamin Piwowarski ... Patrick Gallinari
05 Jul 2003
05 Jul 2003

Shifts of interactive intentions and information-seeking strategies in interactive information retrieval
Hong (Iris) Xie
Journal of the American Society for Information Science | VOL. 51
Hong (Iris) XieHong (Iris) Xie
01 Jan 1999
Journal of the American Society for Information Science | VOL. 51

Image retrieval: Benchmarking visual information indexing and retrieval systems
Abebe Rorissa
Bulletin of the American Society for Information Science and Technology | VOL. 33
Abebe RorissaAbebe Rorissa
01 Feb 2007
Bulletin of the American Society for Information Science and Technology | VOL. 33

A library’s information retrieval system (In)effectiveness: case study
Robert Marijan ... Robert Leskovar
Library Hi Tech | VOL. 33
Robert Marijan, et. al.Robert Marijan ... Robert Leskovar
21 Sep 2015
Library Hi Tech | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Abstract

Talk to us

Similar Papers

More From: Information Retrieval