Abstract
One of the difficulties in using the current Boolean-based information retrieval systems is that it is hard for a user, especially a novice, to formulate an effective Boolean query. Query reformulation can be even more difficult and complex than formulation since the user can have difficulty in incorporating the new information gained from the previous search into his/her next query. In this research, query reformulation is viewed as a classification problem (i.e., classifying documents as either relevant or nonrelevant), and a new reformulation algorithm is proposed which builds a treestructured classifier (named the query tree) at each reformulation from a set of feedback documents retrieved from the previous search. The query tree can be easily transformed into a Boolean query. The query tree and two of the most important current query reformulation algorithms were compared on benchmark test sets (CACM. CISI, and MedJars). The query tree showed significant improvements over the current algorithms in most experiments. We attribute this improved performance to the ability of the query tree algorithm to select good search terms and to represent the relationships among search terms into a tree structure.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.