Abstract
The Information Retrieval (IR) problem can be described as the quest to find the set of relevant information objects corresponding to a given information need, which is represented by a request. IR begins with a user who wishes to satisfy an information need. The information need is typically formulated in the form of a request, denoted by q. The intention is that the request q be as good as possible description of the information need N. The information being queried is modelled as a set of information carriers, or documents. An often used premise in IR is the following: if a given document d is about the request q, then there is a high likelihood that d is relevant with respect to the associated information need. Thus, the IR problem is reduced to deciding the aboutness relation between documents and requests. Many IR models have been developed, and there is a wide variation in how they determine aboutness. A common approach is to equate aboutness with overlap, meaning if there is sufficient overlap between the document and the query, the document is deemed to be about the query. The degree of overlap determined by the retrieval model is usually expressed as a real number between zero (no overlap) and unity (complete overlap). In most models, “sufficient” overlap is not prescribed by the model but by experimentation within a given IR setting. Such experimentation may lead to the conclusion, say, that sufficient overlap is apparent when the overlap measure is greater than α, 0 < α ≤ 1. The value α varies from setting to setting.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.