Information Retrieval Model Research Articles

Recent developments of machine learning models, and in particular deep neural networks, have yielded significant improvements on several computer vision, natural language processing, and speech recognition tasks. Progress with information retrieval (IR) tasks has been slower, however, due to the lack of large-scale training data as well as neural network models specifically designed for effective information retrieval [9]. In this dissertation, we address these two issues by introducing task-specific neural network architectures for a set of IR tasks and proposing novel unsupervised or weakly supervised solutions for training the models. The proposed learning solutions do not require labeled training data. Instead, in our weak supervision approach, neural models are trained on a large set of noisy and biased training data obtained from external resources, existing models, or heuristics. We first introduce relevance-based embedding models [3] that learn distributed representations for words and queries. We show that the learned representations can be effectively employed for a set of IR tasks, including query expansion, pseudo-relevance feedback, and query classification [1, 2]. We further propose a standalone learning to rank model based on deep neural networks [5, 8]. Our model learns a sparse representation for queries and documents. This enables us to perform efficient retrieval by constructing an inverted index in the learned semantic space. Our model outperforms state-of-the-art retrieval models, while performing as efficiently as term matching retrieval models. We additionally propose a neural network framework for predicting the performance of a retrieval model for a given query [7]. Inspired by existing query performance prediction models, our framework integrates several information sources, such as retrieval score distribution and term distribution in the top retrieved documents. This leads to state-of-the-art results for the performance prediction task on various standard collections. We finally bridge the gap between retrieval and recommendation models, as the two key components in most information systems. Search and recommendation often share the same goal: helping people get the information they need at the right time. Therefore, joint modeling and optimization of search engines and recommender systems could potentially benefit both systems [4]. In more detail, we introduce a retrieval model that is trained using user-item interaction (e.g., recommendation data), with no need to query-document relevance information for training [6]. Our solutions and findings in this dissertation smooth the path towards learning efficient and effective models for various information retrieval and related tasks, especially when large-scale training data is not available.

Read full abstract

Information Retrieval (IR) concerns about the structure, analysis, organization, storage, and retrieval of information. Among different retrieval models proposed in the past decades, generative retrieval models, especially those under the statistical probabilistic framework, are one of the most popular techniques that have been widely applied to Information Retrieval problems. While they are famous for their well-grounded theory and good empirical performance in text retrieval, their applications in IR are often limited by their complexity and low extendability in the modeling of high-dimensional information. Recently, advances in deep learning techniques provide new opportunities for representation learning and generative models for information retrieval. In contrast to statistical models, neural models have much more flexibility because they model information and data correlation in latent spaces without explicitly relying on any prior knowledge. Previous studies on pattern recognition and natural language processing have shown that semantically meaningful representations of text, images, and many types of information can be acquired with neural models through supervised or unsupervised training. Nonetheless, the effectiveness of neural models for information retrieval is mostly unexplored. In this thesis, we study how to develop new generative models and representation learning frameworks with neural models for information retrieval. Specifically, our contributions include three main components: (1) Theoretical Analysis : We present the first theoretical analysis and adaptation of existing neural embedding models for ad-hoc retrieval tasks; (2) Design Practice : Based on our experience and knowledge, we show how to design an embedding-based neural generative model for practical information retrieval tasks such as personalized product search; And (3) Generic Framework : We further generalize our proposed neural generative framework for complicated heterogeneous information retrieval scenarios that concern text, images, knowledge entities, and their relationships. Empirical results show that the proposed neural generative framework can effectively learn information representations and construct retrieval models that outperform the state-of-the-art systems in a variety of IR tasks.

Read full abstract

Information Retrieval Model Research Articles

Related Topics

Articles published on Information Retrieval Model

An end-to-end pseudo relevance feedback framework for neural document retrieval

Neural models for information retrieval without labeled data

Neural generative models and representation learning for information retrieval

A Question-Answering System for Applicant Support Using Modern Messaging Apps

A topic‐based term frequency normalization framework to enhance probabilistic information retrieval

Adaptive and Optimization of Personalized Information Retrieval Model in Semantic Web

Trick Me If You Can: Human-in-the-Loop Generation of Adversarial Examples for Question Answering

A question-entailment approach to question answering

Smart System for the Retrieval of Digital Educational Content

Multilingual Information Access (MLIA) Tools on Google and WorldCat: Bi/Multilingual University Students’ Experience and Perceptions

Focal elements of neural information retrieval models. An outlook through a reproducibility study

An Efficient Information System for Providing Location Based Services in Network Environments

Bi lingual Information Retrieval System for English and Italian Tweets using Python

Design and Evaluation of a Contextual Model for Information Retrieval From Web-Scale Discovery Services to Improve Evidence-Based Practice by Health Care Practitioners: Mixed Methods Study.

Automatic recall of software lessons learned for software project managers

Passage-Based Text Summarization for Legal Information Retrieval

Data Augmentation Based on Adversarial Autoencoder Handling Imbalance for Learning to Rank

DeepTileBars: Visualizing Term Distribution for Neural Information Retrieval

Convergence of Learning Dynamics in Information Retrieval Games

A Deep Look into neural ranking models for information retrieval

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Information Retrieval Model Research Articles

Related Topics

Articles published on Information Retrieval Model

An end-to-end pseudo relevance feedback framework for neural document retrieval

Neural models for information retrieval without labeled data

Neural generative models and representation learning for information retrieval

A Question-Answering System for Applicant Support Using Modern Messaging Apps

A topic‐based term frequency normalization framework to enhance probabilistic information retrieval

Adaptive and Optimization of Personalized Information Retrieval Model in Semantic Web

Trick Me If You Can: Human-in-the-Loop Generation of Adversarial Examples for Question Answering

A question-entailment approach to question answering

Smart System for the Retrieval of Digital Educational Content

Multilingual Information Access (MLIA) Tools on Google and WorldCat: Bi/Multilingual University Students’ Experience and Perceptions

Focal elements of neural information retrieval models. An outlook through a reproducibility study

An Efficient Information System for Providing Location Based Services in Network Environments

Bi lingual Information Retrieval System for English and Italian Tweets using Python

Design and Evaluation of a Contextual Model for Information Retrieval From Web-Scale Discovery Services to Improve Evidence-Based Practice by Health Care Practitioners: Mixed Methods Study.

Automatic recall of software lessons learned for software project managers

Passage-Based Text Summarization for Legal Information Retrieval

Data Augmentation Based on Adversarial Autoencoder Handling Imbalance for Learning to Rank

DeepTileBars: Visualizing Term Distribution for Neural Information Retrieval

Convergence of Learning Dynamics in Information Retrieval Games

A Deep Look into neural ranking models for information retrieval