Difficult Queries Research Articles

Recent developments have shown that entity-based models that rely on information from the knowledge graph can improve document retrieval performance. However, given the non-transitive nature of relatedness between entities on the knowledge graph, the use of semantic relatedness measures can lead to topic drift. To address this issue, we propose a relevance-based model for entity selection based on pseudo-relevance feedback, which is then used to systematically expand the input query leading to improved retrieval performance. We perform our experiments on the widely used TREC Web corpora and empirically show that our proposed approach to entity selection significantly improves ad hoc document retrieval compared to strong baselines. More concretely, the contributions of this work are as follows: (1) We introduce a graphical probability model that captures dependencies between entities within the query and documents. (2) We propose an unsupervised entity selection method based on the graphical model for query entity expansion and then for ad hoc retrieval. (3) We thoroughly evaluate our method and compare it with the state-of-the-art keyword and entity based retrieval methods. We demonstrate that the proposed retrieval model shows improved performance over all the other baselines on ClueWeb09B and ClueWeb12B, two widely used Web corpora, on the NDCG@20, and ERR@20 metrics. We also show that the proposed method is most effective on the difficult queries. In addition, We compare our proposed entity selection with a state-of-the-art entity selection technique within the context of ad hoc retrieval using a basic query expansion method and illustrate that it provides more effective retrieval for all expansion weights and different number of expansion entities.

PurposeThis paper aims to investigate how readers assess relevance of retrieved documents in a foreign language they know well compared with their native language, and whether work‐task scenario descriptions have effect on the assessment process.Design/methodology/approachQueries, test collections, and relevance assessments were used from the 2002 Interactive CLEF. Swedish first‐language speakers, fluent in English, were given simulated information‐seeking scenarios and presented with retrieval results in both languages. Twenty‐eight subjects in four groups were asked to rate the retrieved text documents by relevance. A two‐level work‐task scenario description framework was developed and applied to facilitate the study of context effects on the assessment process.FindingsRelevance assessment takes longer in a foreign language than in the user first language. The quality of assessments by comparison with pre‐assessed results is inferior to those made in the users' first language. Work‐task scenario descriptions had an effect on the assessment process, both by measured access time and by self‐report by subjects. However, effects on results by traditional relevance ranking were detectable. This may be an argument for extending the traditional IR experimental topical relevance measures to cater for context effects.Originality/valueAn extended two‐level work‐task scenario description framework was developed and applied. Contextual aspects had an effect on the relevance assessment process. English texts took longer to assess than Swedish and were assessed less well, especially for the most difficult queries. The IR research field needs to close this gap and to design information access systems with users' language competence in mind.

Difficult Queries Research Articles

Related Topics

Articles published on Difficult Queries

SeeSaw: Interactive Ad-hoc Search Over Image Databases

GuP: Fast Subgraph Matching by Guard-based Pruning

An Insight to Automation & Digitalization of Retail Stores

Genomic and Proteomic Semantic Annotations Integrating Cross Ontology

Relevance-based entity selection for ad hoc retrieval

Overcoming low-utility facets for complex answer retrieval

Indiscriminabilité dans les espaces de représentation des termes et des documents

Using knowledge-based relatedness for information retrieval

A multiple relevance feedback strategy with positive and negative models.

Latent word context model for information retrieval

Nauki społeczne w programie badań Dalekiej Północy

Opinionated document retrieval using subjective triggers

A Discriminative Kernel-Based Approach to Rank Images from Text Queries

SIGIR workshop report

Effects of foreign language and task scenario on relevance assessment

Toward a unified retrieval outcome analysis framework for cross‐language information retrieval

Enhancing a new design for subject access to online catalogs

An improved division operator for relational algebra

Human factors comparison of a procedural and a nonprocedural query language

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Difficult Queries Research Articles

Related Topics

Articles published on Difficult Queries

SeeSaw: Interactive Ad-hoc Search Over Image Databases

GuP: Fast Subgraph Matching by Guard-based Pruning

An Insight to Automation &amp; Digitalization of Retail Stores

Genomic and Proteomic Semantic Annotations Integrating Cross Ontology

Relevance-based entity selection for ad hoc retrieval

Overcoming low-utility facets for complex answer retrieval

Indiscriminabilité dans les espaces de représentation des termes et des documents

Using knowledge-based relatedness for information retrieval

A multiple relevance feedback strategy with positive and negative models.

Latent word context model for information retrieval

Nauki społeczne w programie badań Dalekiej Północy

Opinionated document retrieval using subjective triggers

A Discriminative Kernel-Based Approach to Rank Images from Text Queries

SIGIR workshop report

Effects of foreign language and task scenario on relevance assessment

Toward a unified retrieval outcome analysis framework for cross‐language information retrieval

Enhancing a new design for subject access to online catalogs

An improved division operator for relational algebra

Human factors comparison of a procedural and a nonprocedural query language

An Insight to Automation & Digitalization of Retail Stores