Current Information Retrieval Systems Research Articles

Purpose– The purpose of this paper is to evaluate the effectiveness of the information retrieval component of a daily newspaper publisher’s integrated library system (ILS) in comparison with the open source alternatives and observe the impact of the scale of metadata, generated daily by library administrators, on retrieved result sets.Design/methodology/approach– In Experiment 1, the authors compared the result sets of the information retrieval system (IRS) component of the publisher’s current ILS and the result sets of proposed ones with human-assessed relevance judgment set. In Experiment 2, the authors compared the performance of proposed IRS components with the publisher’s current production IRS, using result sets of current IRS classified as relevant. Both experiments were conducted using standard information retrieval (IR) evaluation methods: precision, recall, precision atk,F-measure, mean average precision and 11-point interpolated average precision.Findings– Results showed that: first, in Experiment 1, the publisher’s current production ILS ranked last of all participating IRSs when compared to a relevance document set classified by the senior library administrator; and second, in Experiment 2, the tested IR components’ request handlers that used only automatically generated metadata performed slightly better than request handlers that used all of the metadata fields. Therefore, regarding the effectiveness of IR, the daily human effort of generating the publisher’s current set of metadata attributes is unjustified.Research limitations/implications– The experiments’ collections contained Slovene language with large number of variations of the forms of nouns, verbs and adjectives. The results could be different if the experiments’ collections contained languages with different grammatical properties.Practical implications– The authors have confirmed, using standard IR methods, that the IR component used in the publisher’s current ILS, could be adequately replaced with an open source component. Based on the research, the publisher could incorporate the suggested open source IR components in practice. In the research, the authors have described the methods that can be used by libraries for evaluating the effectiveness of the IR of their ILSs.Originality/value– The paper provides a framework for the evaluation of an ILS’s IR effectiveness for libraries. Based on the evaluation results, the libraries could replace the IR components if their current information system setup allows it.

Read full abstract

Semantic similarity is central to many cognitive processes and plays an important role in the way humans process and reason about information. In particular, the retrieval of knowledge from memory hinges crucially on similarity. Likewise, information retrieval systems use similarity to detect relevant information for a given query. Current information retrieval systems apply mainly syntactic techniques to determine similarity. Although such syntactic similarity measures have performed strongly with resources containing large amounts of text, they cannot appropriately cope with syntactic and semantic heterogeneity and ambiguity, if the semantics of the terms is not explicitly available. Therefore, they are rather rigid and inflexible as they cannot adapt to the user's requirements and conceptualization of the domain. Furthermore, geographic features are distinguished via their geometric and thematic data. It is often not possible to capture the complex semantics of geographic features by a single name or a textual description. Therefore spatial data is different from text documents typically found in enterprise databases or on the Web. Retrieval of spatial information requires new, intelligent retrieval mechanisms that satisfy its specific requirements. A semantics-based solution can more easily adapt to user needs and therefore increases the flexibility and usability of spatial data and retrieval methods. This paper investigates the suitability of various approaches—originally developed to explain human similarity judgement—in the context of spatial information retrieval. We propose a new, hybrid approach for semantic similarity measurement, which can represent the complex semantics of spatial data. It allows for retrieving relevant data by determining the similarity between the query and the semantic descriptions of geographic feature types within the database. The hybrid similarity measure combines the geometric structure of conceptual spaces with the relational structure of semantic nets to one, cognitively plausible knowledge representation with an inherent similarity measure.

Read full abstract

Current Information Retrieval Systems Research Articles

Related Topics

Articles published on Current Information Retrieval Systems

A User-Centric Multi-Context Hybrid Reasoning Information Retrieval Model

An Innovative Information Retrieval Model Implementing Particle Swarm Optimization Technique

Search and analytics using semantic annotations

Fairness and Transparency in Ranking

Visual content-based web page categorization with deep transfer learning and metric learning

Mixed language queries in online searches

Searching bibliographic data using graphs: A visual graph query interface

Design implications for task-specific search utilities for retrieval and re-engineering of code

Evaluation of semantic similarity metrics applied to the automatic retrieval of medical documents: An UMLS approach

A library’s information retrieval system (In)effectiveness: case study

Modeling of food technology knowledge base information system using semantic web technologies

A Novel Data Centric Information Retrieval Protocol for Queries in Delay Tolerant Networks

An effective query recommendation approach using semantic strategies for intelligent information retrieval

An improved semantic information searching scheme based multi-agent system and an innovative similarity measure

Context based Indexing in Search Engines using Ontology

A Hybrid Semantic Similarity Measure for Spatial Information Retrieval

On the value of temporal information in information retrieval

Image retrieval: Benchmarking visual information indexing and retrieval systems

A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Putting it together online: Information need identification for the domain novice user

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Current Information Retrieval Systems Research Articles

Related Topics

Articles published on Current Information Retrieval Systems

A User-Centric Multi-Context Hybrid Reasoning Information Retrieval Model

An Innovative Information Retrieval Model Implementing Particle Swarm Optimization Technique

Search and analytics using semantic annotations

Fairness and Transparency in Ranking

Visual content-based web page categorization with deep transfer learning and metric learning

Mixed language queries in online searches

Searching bibliographic data using graphs: A visual graph query interface

Design implications for task-specific search utilities for retrieval and re-engineering of code

Evaluation of semantic similarity metrics applied to the automatic retrieval of medical documents: An UMLS approach

A library’s information retrieval system (In)effectiveness: case study

Modeling of food technology knowledge base information system using semantic web technologies

A Novel Data Centric Information Retrieval Protocol for Queries in Delay Tolerant Networks

An effective query recommendation approach using semantic strategies for intelligent information retrieval

An improved semantic information searching scheme based multi-agent system and an innovative similarity measure

Context based Indexing in Search Engines using Ontology

A Hybrid Semantic Similarity Measure for Spatial Information Retrieval

On the value of temporal information in information retrieval

Image retrieval: Benchmarking visual information indexing and retrieval systems

A Bayesian Framework for XML Information Retrieval: Searching and Learning with the INEX Collection

Putting it together online: Information need identification for the domain novice user