Probabilistic Information Retrieval Research Articles

The maximum entropy principle may be applied to the design of probabilistic retrieval systems. When there are inconsistent expert judgments, the resulting optimization problem cannot be solved. The inconsistency of the expert judgments can be revealed by solving a linear programming formulation. In the case of inconsistent judgment, four plausible schemes are proposed in order to find revised judgments which are consistent with the true data structure but still reflect the original expert judgment. These schemes are the Interactive, Minimum Distance, Minimum Cross-Entropy, and Path methods. Background and Purpose of the Study The maximum entropy principle (MEP) based on Shannon’s measure (Shannon, 1928) has been used with great success in many areas. Cooper and Huizinga (1982) and Cooper (1983) have applied the MEP to the design of probabilistic information retrieval systems. Specifically, we consider a collection of documents which are categorized into Boolean components by attributes. The MEP estimates the probability of “relevance” of each Boolean component by integrating expert judgments about the “relevance” of attributes with the observed distribution of the Boolean components. The MEP retrieval system, in response to a user’s request, provides an ordering of the Boolean components using this estimated probability of “relevance.” Cooper (1983) has noted the potential of the MEP retrieval system as follows

Read full abstract

A component theory of information retrieval using single content terms as component for queries and documents was reviewed and experimented with. The theory has the advantages of being able to (1) bootstrap itself, that is, define initial term weights naturally based on the fact that items are self relevent; (2) make use of within-item term frequencies; (3) account for query-focused and document-focused indexing and retrieval strategies cooperatively; and (4) allow for component-specific feedback if such information is available. Retrieval results with four collections support the effectiveness of all the first three aspects, except for predictive retrieval. At the initial indexing stage, the retrieval theory performed much more consistantly across collections than croft's model and provided results comparable to Salton's tf*idf approach. An inverse collection term frequency (ICTF) formula was also tested that performed much better than the inverse document frequency (IDF). With full feedback retrospective retrieval, the component theory performed substantially better than Croft's, because of the highly specific nature of document-focused feedback. Repetitive retireval results with partial relevance feedback mirrored those for the retrospective. However, for the important case of predictive retrieval using residual ranking, results were not unequivocal.

Read full abstract

Probabilistic Information Retrieval Research Articles

Related Topics

Articles published on Probabilistic Information Retrieval

A risk minimization framework for information retrieval

Probabilistic information retrieval model for a dependency structured indexing system

An information retrieval model based on simple Bayesian networks

New informetric aspects of the Internet: some reflections - many problems

New informetric aspects of the Internet: some reflections - many problems

Indexing and retrieval of broadcast news

Probabilistic datalog: Implementing logical information retrieval for advanced applications

Testing the maximum entropy principle for information retrieval

Comparing Boolean and probabilistic information retrieval systems across queries and disciplines

Bayesian Belief Networks: Odds and Ends

A network approach to probabilistic information retrieval

TREC and TIPSTER experiments with inquery

Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval

Term dependence: Truncating the Bahadur Lazarsfeld expansion

Probabilistic information retrieval as a combination of abstraction, inductive learning, and probabilistic assumptions

Probabilistic Models in Information Retrieval

A study of probabilistic information retrieval systems in the case of inconsistent expert judgments

SAPHIRE—An information retrieval system featuring concept matching, automatic indexing, probabilistic retrieval, and hierarchical relationships

Experiments with a component theory of probabilistic information retrieval based on single terms as document components

A sensitivity analysis of a probabilistic information retrieval system

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Probabilistic Information Retrieval Research Articles

Related Topics

Articles published on Probabilistic Information Retrieval

A risk minimization framework for information retrieval

Probabilistic information retrieval model for a dependency structured indexing system

An information retrieval model based on simple Bayesian networks

New informetric aspects of the Internet: some reflections - many problems

New informetric aspects of the Internet: some reflections - many problems

Indexing and retrieval of broadcast news

Probabilistic datalog: Implementing logical information retrieval for advanced applications

Testing the maximum entropy principle for information retrieval

Comparing Boolean and probabilistic information retrieval systems across queries and disciplines

Bayesian Belief Networks: Odds and Ends

A network approach to probabilistic information retrieval

TREC and TIPSTER experiments with inquery

Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval

Term dependence: Truncating the Bahadur Lazarsfeld expansion

Probabilistic information retrieval as a combination of abstraction, inductive learning, and probabilistic assumptions

Probabilistic Models in Information Retrieval

A study of probabilistic information retrieval systems in the case of inconsistent expert judgments

SAPHIRE—An information retrieval system featuring concept matching, automatic indexing, probabilistic retrieval, and hierarchical relationships

Experiments with a component theory of probabilistic information retrieval based on single terms as document components

A sensitivity analysis of a probabilistic information retrieval system