Retrieval Status Value Research Articles

Annotations are a means to make critical remarks, to explain and comment things, to add notes and give opinions, and to relate objects. Nowadays, they can be found in digital libraries and collaboratories, for example as a building block for scientific discussion on the one hand or as private notes on the other. We further find them in product reviews, scientific databases and many Web 2.0 applications; even well-established concepts like emails can be regarded as annotations in a certain sense. Digital annotations can be (textual) comments, markings (i.e. highlighted parts) and references to other documents or document parts. Since annotations convey information which is potentially important to satisfy a user's information need, this thesis tries to answer the question of how to exploit annotations for information retrieval. It gives a first answer to the question if retrieval effectiveness can be improved with annotations. A survey of the universe reveals some facets of annotations; for example, they can be content level annotations (extending the content of the annotation object) or meta level ones (saying something about the annotated object). Besides the annotations themselves, other objects created during the process of annotation can be interesting for retrieval, these being the annotated fragments. These objects are integrated into an object-oriented model comprising digital objects such as structured documents and annotations as well as fragments. In this model, the different relationships among the various objects are reflected. From this model, the basic data structure for annotation-based retrieval, the structured annotation hypertext, is derived. In order to thoroughly exploit the information contained in structured annotation hypertexts, a probabilistic, object-oriented logical framework called POLAR is introduced. In POLAR, structured annotation hypertexts can be modelled by means of probabilistic propositions and four-valued logics. POLAR allows for specifying several relationships among annotations and annotated (sub)parts or fragments. Queries can be posed to extract the knowledge contained in structured annotation hypertexts. POLAR supports annotation-based retrieval, i.e. document and discussion search, by applying an augmentation strategy (knowledge augmentation, propagating propositions from subcontexts like annotations, or relevance augmentation, where retrieval status values are propagated) in conjunction with probabilistic inference, where P(d -> q), the probability that a document d implies a query q, is estimated. POLAR's semantics is based on possible worlds and accessibility relations. It is implemented on top of four-valued probabilistic Datalog. POLAR's core retrieval functionality, knowledge augmentation with probabilistic inference, is evaluated for discussion and document search. The experiments show that all relevant POLAR objects, merged annotation targets, fragments and content annotations, are able to increase retrieval effectiveness when used as a context for discussion or document search. Additional experiments reveal that we can determine the polarity of annotations with an accuracy of around 80%.

Read full abstract

BackgroundIn the context of the BioCreative competition, where training data were very sparse, we investigated two complementary tasks: 1) given a Swiss-Prot triplet, containing a protein, a GO (Gene Ontology) term and a relevant article, extraction of a short passage that justifies the GO category assignement; 2) given a Swiss-Prot pair, containing a protein and a relevant article, automatic assignement of a set of categories.MethodsSentence is the basic retrieval unit. Our classifier computes a distance between each sentence and the GO category provided with the Swiss-Prot entry. The Text Categorizer computes a distance between each GO term and the text of the article. Evaluations are reported both based on annotator judgements as established by the competition and based on mean average precision measures computed using a curated sample of Swiss-Prot.ResultsOur system achieved the best recall and precision combination both for passage retrieval and text categorization as evaluated by official evaluators. However, text categorization results were far below those in other data-poor text categorization experiments The top proposed term is relevant in less that 20% of cases, while categorization with other biomedical controlled vocabulary, such as the Medical Subject Headings, we achieved more than 90% precision. We also observe that the scoring methods used in our experiments, based on the retrieval status value of our engines, exhibits effective confidence estimation capabilities.ConclusionFrom a comparative perspective, the combination of retrieval and natural language processing methods we designed, achieved very competitive performances. Largely data-independent, our systems were no less effective that data-intensive approaches. These results suggests that the overall strategy could benefit a large class of information extraction tasks, especially when training data are missing. However, from a user perspective, results were disappointing. Further investigations are needed to design applicable end-user text mining tools for biologists.

Read full abstract

Retrieval Status Value Research Articles

Related Topics

Articles published on Retrieval Status Value

A Relative Information Gain-based Query Performance Prediction Framework with Generated Query Variants

Burst-aware data fusion for microblog search

A probabilistic framework for information modelling and retrieval based on user annotations on digital objects

A merging strategy proposal: The 2-step retrieval status value method

Data-poor categorization and passage retrieval for gene ontology annotation in Swiss-Prot.

An entropy‐based interpretation of retrieval status value‐based retrieval, and its application to the computation of term and query discrimination value

A model of fuzzy linguistic IRS based on multi-granular linguistic information

From Retrieval Status Values to Probabilities of Relevance for Advanced IR Applications

Information retrieval method from uncertain requests and its optical implementation

Combining the evidence of multiple query representations for information retrieval

The use of semantic links in hypertext information retrieval

The use of semantic links in hypertext information retrieval

Fuzzy information retrieval

A critical investigation of recall and precision as measures of retrieval system performance

Estimating effective display size in online retrieval systems

ABSTRACTS 25 (Chosen by Vo Raghavan from recent issues of journals in the retrieval area)

Requirements for query evaluation in weighted information retrieval

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Retrieval Status Value Research Articles

Related Topics

Articles published on Retrieval Status Value

A Relative Information Gain-based Query Performance Prediction Framework with Generated Query Variants

Burst-aware data fusion for microblog search

A probabilistic framework for information modelling and retrieval based on user annotations on digital objects

A merging strategy proposal: The 2-step retrieval status value method

Data-poor categorization and passage retrieval for gene ontology annotation in Swiss-Prot.

An entropy‐based interpretation of retrieval status value‐based retrieval, and its application to the computation of term and query discrimination value

A model of fuzzy linguistic IRS based on multi-granular linguistic information

From Retrieval Status Values to Probabilities of Relevance for Advanced IR Applications

Information retrieval method from uncertain requests and its optical implementation

Combining the evidence of multiple query representations for information retrieval

The use of semantic links in hypertext information retrieval

The use of semantic links in hypertext information retrieval

Fuzzy information retrieval

A critical investigation of recall and precision as measures of retrieval system performance

Estimating effective display size in online retrieval systems

ABSTRACTS 25 (Chosen by Vo Raghavan from recent issues of journals in the retrieval area)

Requirements for query evaluation in weighted information retrieval