Abstract

SYNONYMY In most collections, the same concept may be referred to using different words. This issue, known as synonymy , has an impact on the recall of most information retrieval (IR) systems. For example, you would want a search for aircraft to match plane (but only for references to an airplane , not a woodworking plane), and for a search on thermodynamics to match references to heat in appropriate discussions. Users often attempt to address this problem themselves by manually refining a query, as was discussed in Section 1.4; in this chapter, we discuss ways in which a system can help with query refinement, either fully automatically or with the user in the loop. The methods for tackling this problem split into two major classes: global methods and local methods. Global methods are techniques for expanding or reformulating query terms independent of the query and results returned from it, so that changes in the query wording will cause the new query to match other semantically similar terms. Global methods include: Query expansion/reformulation with a thesaurus or WordNet (Section 9.2.2) Query expansion via automatic thesaurus generation (Section 9.2.3) Techniques like spelling correction (discussed in Chapter 3) Local methods adjust a query relative to the documents that initially appear to match the query. The basic methods here are: Relevance feedback (Section 9.1) Pseudorelevance feedback, also known as blind relevance feedback (Section 9.1.6) (Global) Indirect relevance feedback (Section 9.1.7)

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.