Evaluation Of Information Retrieval Systems Research Articles

PurposeThe subject of this paper is the idea of Brain–Computer Interface (BCI). The main goal is to assess the potential impact of BCI on the design, use and evaluation of information retrieval systems operating in libraries.Design/methodology/approachThe method of literature review was used to establish the state of research. The search according to accepted queries was carried out in the Scopus database and complementary in Google Scholar. To determine the state of research on BCI on the basis of library and information science, a specialist LISTA abstract database was also searched. The most current papers published in the years 2015–2019 in the English language or having at least an abstract in this language were taken into account.FindingsThe analysis showed that BCI issues are extremely popular in subject literature from various fields, mainly computer science, but practically does not occur in the context of using this technology in information retrieval systems.Research limitations/implicationsDue to the fact that BCI solutions are not yet implemented in libraries and are rarely the subject of scientific considerations in the field of library and information science, this article is mainly based on literature from other disciplines. The goal was to consider how much BCI solutions can affect library information retrieval systems. The considerations presented in this article are theoretical in nature due to the lack of empirical materials on which to base. The author's assumption was to initiate a discussion about BCI on the basis of library and information science, not to propose final solutions.Practical implicationsThe results can be widely used in practice as a framework for the implementation of BCI in libraries.Social implicationsThe article can help to facilitate the debate on the role of implementing new technologies in libraries.Originality/valueThe problem of BCI is very rarely addressed in the subject literature in the field of library and information science.

Read full abstract

While test collections provide the cornerstone for Cranfield-based evaluation of information retrieval (IR) systems, it has become practically infeasible to rely on traditional pooling techniques to construct test collections at the scale of today’s massive document collections (e.g., ClueWeb12’s 700M+ Webpages). This has motivated a flurry of studies proposing more cost-effective yet reliable IR evaluation methods. In this paper, we propose a new intelligent topic selection method which reduces the number of search topics (and thereby costly human relevance judgments) needed for reliable IR evaluation. To rigorously assess our method, we integrate previously disparate lines of research on intelligent topic selection and deep vs. shallow judging (i.e., whether it is more cost-effective to collect many relevance judgments for a few topics or a few judgments for many topics). While prior work on intelligent topic selection has never been evaluated against shallow judging baselines, prior work on deep vs. shallow judging has largely argued for shallowed judging, but assuming random topic selection. We argue that for evaluating any topic selection method, ultimately one must ask whether it is actually useful to select topics, or should one simply perform shallow judging over many topics? In seeking a rigorous answer to this over-arching question, we conduct a comprehensive investigation over a set of relevant factors never previously studied together: 1) method of topic selection; 2) the effect of topic familiarity on human judging speed; and 3) how different topic generation processes (requiring varying human effort) impact (i) budget utilization and (ii) the resultant quality of judgments. Experiments on NIST TREC Robust 2003 and Robust 2004 test collections show that not only can we reliably evaluate IR systems with fewer topics, but also that: 1) when topics are intelligently selected, deep judging is often more cost-effective than shallow judging in evaluation reliability; and 2) topic familiarity and topic generation costs greatly impact the evaluation cost vs. reliability trade-off. Our findings challenge conventional wisdom in showing that deep judging is often preferable to shallow judging when topics are selected intelligently.

Read full abstract

Evaluation Of Information Retrieval Systems Research Articles

Related Topics

Articles published on Evaluation Of Information Retrieval Systems

Relevance feedback for building pooled test collections

Big Data Oriented Soft-Technologies and ICT Management

Report on the 6th International and Interdisciplinary Perspectives on Children & Recommender and Information Retrieval Systems (KidRec 2022) Workshop at ACM IDC 2022

Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning

On the effect of relevance scales in crowdsourcing relevance assessments for Information Retrieval evaluation

Is the Reign of Interactive Search Eternal? Findings from the Video Browser Showdown 2020

Brain–computer interface in the context of information retrieval systems in a library

A Dialectical Approach to Search Engine Evaluation

Three approaches to measuring recall on the Web: a systematic review

Exploring Topic Difficulty in Information Retrieval Systems Evaluation

Professor Pia Borlund

The Impact of Task Abandonment in Crowdsourcing

QUALITY OF CROWDSOURCED RELEVANCE JUDGMENTS IN ASSOCIATION WITH LOGICAL REASONING ABILITY

Reproduce and Improve

Reproduce. Generalize. Extend. On Information Retrieval Evaluation without Relevance Judgments

Ranking Retrieval Systems with Partial Relevance Judgements

Intelligent topic selection for low-cost information retrieval evaluation: A New perspective on deep vs. shallow judging

Using Replicates in Information Retrieval Evaluation.

情報検索システムの評価 : テストコレクションを中心に( 図書館・情報活動と )

Multi-armed bandits for adjudicating documents in pooling-based evaluation of information retrieval systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Evaluation Of Information Retrieval Systems Research Articles

Related Topics

Articles published on Evaluation Of Information Retrieval Systems

Relevance feedback for building pooled test collections

Big Data Oriented Soft-Technologies and ICT Management

Report on the 6th International and Interdisciplinary Perspectives on Children &amp; Recommender and Information Retrieval Systems (KidRec 2022) Workshop at ACM IDC 2022

Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning

On the effect of relevance scales in crowdsourcing relevance assessments for Information Retrieval evaluation

Is the Reign of Interactive Search Eternal? Findings from the Video Browser Showdown 2020

Brain–computer interface in the context of information retrieval systems in a library

A Dialectical Approach to Search Engine Evaluation

Three approaches to measuring recall on the Web: a systematic review

Exploring Topic Difficulty in Information Retrieval Systems Evaluation

Professor Pia Borlund

The Impact of Task Abandonment in Crowdsourcing

QUALITY OF CROWDSOURCED RELEVANCE JUDGMENTS IN ASSOCIATION WITH LOGICAL REASONING ABILITY

Reproduce and Improve

Reproduce. Generalize. Extend. On Information Retrieval Evaluation without Relevance Judgments

Ranking Retrieval Systems with Partial Relevance Judgements

Intelligent topic selection for low-cost information retrieval evaluation: A New perspective on deep vs. shallow judging

Using Replicates in Information Retrieval Evaluation.

情報検索システムの評価 : テストコレクションを中心に( 図書館・情報活動と )

Multi-armed bandits for adjudicating documents in pooling-based evaluation of information retrieval systems

Report on the 6th International and Interdisciplinary Perspectives on Children & Recommender and Information Retrieval Systems (KidRec 2022) Workshop at ACM IDC 2022