Document Representation Research Articles

Overview

84 Articles

Published in last 50 years

Articles published on Document Representation

82 Search results

Document representation in probabilistic models of information retrieval

AbstractProbabilistic models of retrieval have provided insights into the document retrieval process and contain the basis for very effective search strategies. A major limitation of these models is that they assume that documents are represented by binary index terms. In many cases the index terms will be assigned weights, such as within‐document frequency weights, which are derived from the content of the documents by the indexing process. These weights, which are referred to here as term significance weights, indicate the relative importance of the terms in individual documents. This article describes how retrieval models which use either independence or dependence assumptions can be extended to include document representatives containing term significance weights. Comparison with other research indicates that search strategies based on models modified in this way can further improve the effectiveness of document retrieval systems.

Read full abstract

Journal of the American Society for Information Science
Nov 1, 1981
W Bruce Croft

Cite

Save

Document representation models for retrieval systems

Document retrieval system models are presented. Measures to rank the closeness of documents to a query are given. Algorithms to calculate the measures for graph and partition models are provided.

Read full abstract