Query Expansion Terms Research Articles

The rapid growth of contents on the Web in different languages increases the demand of Cross-Lingual Information Retrieval (CLIR). The accuracy of result suffers due to many problems such as ambiguity and drift issue in query. Query Expansion (QE) offers reliable solution for obtaining suitable documents for user queries. In this paper, we proposed an architecture for Hindi–English CLIR system using QE for improving the relevancy of retrieved results. In this architecture, for the addition of term(s) at appropriate position(s), we proposed a location-based algorithm to resolve the drift query issue in QE. User queries in Hindi language have been translated into document language (i.e. English) and the accuracy of translation is improved using Back-Translation. Google search has been performed and the retrieved documents are ranked using Okapi BM25 to arrange the documents in the order of decreasing relevancy to select the most suitable terms for QE. We used term selection value (TSV) for QE and for retrieving the terms, we created three test collections namely the (i) description and narration of the Forum for Information Retrieval Evaluation (FIRE) dataset, (ii) Snippets of retrieved documents against each query and (iii) Nearest-Neighborhood (NN) words against each query word among the ranked documents. To evaluate the system, 50 queries of Hindi language are selected from the FIRE-2012 dataset. In this paper, we performed two experiments: (i) impact of the proposed location-based algorithm on the proposed architecture of CLIR; and (ii) analysis of QE using three datasets, i.e. FIRE, NN and Snippets. In the first case, result shows that the relevancy of Hindi–English CLIR is improved by performing QE using the location-based algorithm and a 12% of improvement is achieved as compared to the results of QE obtained without applying the location-based algorithm. In the second case, the location-based algorithm is applied on three datasets. The Mean Average Precision (MAP) values of retrieved documents after QE are 0.5379 (NN), 0.6018 (FIRE) and 0.6406 (Snippets) for the three test collections, whereas the MAP before QE is 0.37102. This clearly shows the significant improvement of retrieved results for all three test collections. Among the three test collections, QE has been found most effective along with Snippets as indicated by the results with the improvements of 6.48% and 19.12% over FIRE and NN test collections, respectively.

Objective:With the onset of the Coronavirus Disease 2019 (COVID-19) pandemic, there has been a surge in the number of publicly available biomedical information sources, which makes it an increasingly challenging research goal to retrieve a relevant text to a topic of interest. In this paper, we propose a Contextual Query Expansion framework based on the clinical Domain knowledge (CQED) for formalizing an effective search over PubMed to retrieve relevant COVID-19 scholarly articles to a given information need. Materials and Methods:For the sake of training and evaluation, we use the widely adopted TREC-COVID benchmark. Given a query, the proposed framework utilizes a contextual and a domain-specific neural language model to generate a set of candidate query expansion terms that enrich the original query. Moreover, the framework includes a multi-head attention mechanism that is trained alongside a learning-to-rank model for re-ranking the list of generated expansion candidate terms. The original query and the top-ranked expansion terms are posed to the PubMed search engine for retrieving relevant scholarly articles to an information need. The framework, CQED, can have four different variations, depending upon the learning path adopted for training and re-ranking the candidate expansion terms. Results:The model drastically improves the search performance, when compared to the original query. The performance improvement in comparison to the original query, in terms of RECALL@1000 is 190.85% and in terms of NDCG@1000 is 343.55%. Additionally, the model outperforms all existing state-of-the-art baselines. In terms of P@10, the model that has been optimized based on Precision outperforms all baselines (0.7987). On the other hand, in terms of NDCG@10 (0.7986), MAP (0.3450) and bpref (0.4900), the CQED model that has been optimized based on an average of all retrieval measures outperforms all the baselines. Conclusion:The proposed model successfully expands queries posed to PubMed, and improves search performance, as compared to all existing baselines. A success/failure analysis shows that the model improved the search performance of each of the evaluated queries. Moreover, an ablation study depicted that if ranking of generated candidate terms is not conducted, the overall performance decreases. For future work, we would like to explore the application of the presented query expansion framework in conducting technology-assisted Systematic Literature Reviews (SLR).

Query Expansion Terms Research Articles

Related Topics

Articles published on Query Expansion Terms

A Study of Word Bigrams for Pseudo-relevance Feedback in Information Retrieval

Query Expansion Using Proposed Location-Based Algorithm for Hindi–English CLIR: Analyzing Three Test Collections

Query Context Expansion for Open-Domain Question Answering

SPRF: A semantic Pseudo-relevance Feedback enhancement for information retrieval via ConceptNet

Learning to rank query expansion terms for COVID-19 scholarly search

Optimal Query Expansion Based on Hybrid Group Mean Enhanced Chimp Optimization Using Iterative Deep Learning

Document Representation and Query Expansion Models for Blog Recommendation

Health Query Expansion based on Graph Matching between DBpedia and UMLS

Research on Query Term Expansion based on RankSVM and LDA Model

A contemporary combined approach for query expansion

A hybrid semantic query expansion approach for Arabic information retrieval

Document-based and term-based linear methods for pseudo-relevance feedback

QER: a new feature selection method for sentiment analysis

Rank fusion and semantic genetic notion based automatic query expansion model

Lexical Co-Occurrence and Contextual Window-Based Approach with Semantic Similarity for Query Expansion

An Optimal Ranking Approach for Cluster based of Clicked URLsusing Firefly Algorithm for Efficient Personalized Web Search

Leveraging semantic resources in diversified query expansion

Ranks Aggregation and Semantic Genetic Approach based Hybrid Model for Query Expansion

Term co-occurrence and context window-based combined approach for query expansion with the semantic notion of terms

Relevance Feedback-based Query Expansion Model using Ranks Combining and Word2Vec Approach

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Query Expansion Terms Research Articles

Related Topics

Articles published on Query Expansion Terms

A Study of Word Bigrams for Pseudo-relevance Feedback in Information Retrieval

Query Expansion Using Proposed Location-Based Algorithm for Hindi–English CLIR: Analyzing Three Test Collections

Query Context Expansion for Open-Domain Question Answering

SPRF: A semantic Pseudo-relevance Feedback enhancement for information retrieval via ConceptNet

Learning to rank query expansion terms for COVID-19 scholarly search

Optimal Query Expansion Based on Hybrid Group Mean Enhanced Chimp Optimization Using Iterative Deep Learning

Document Representation and Query Expansion Models for Blog Recommendation

Health Query Expansion based on Graph Matching between DBpedia and UMLS

Research on Query Term Expansion based on RankSVM and LDA Model

A contemporary combined approach for query expansion

A hybrid semantic query expansion approach for Arabic information retrieval

Document-based and term-based linear methods for pseudo-relevance feedback

QER: a new feature selection method for sentiment analysis

Rank fusion and semantic genetic notion based automatic query expansion model

Lexical Co-Occurrence and Contextual Window-Based Approach with Semantic Similarity for Query Expansion

An Optimal Ranking Approach for Cluster based of Clicked URLsusing Firefly Algorithm for Efficient Personalized Web Search

Leveraging semantic resources in diversified query expansion

Ranks Aggregation and Semantic Genetic Approach based Hybrid Model for Query Expansion

Term co-occurrence and context window-based combined approach for query expansion with the semantic notion of terms

Relevance Feedback-based Query Expansion Model using Ranks Combining and Word2Vec Approach