Concept-based Query Research Articles

BackgroundPersonalised medicine provides patients with treatments that are specific to their genetic profiles. It requires efficient data sharing of disparate data types across a variety of scientific disciplines, such as molecular biology, pathology, radiology and clinical practice. Personalised medicine aims to offer the safest and most effective therapeutic strategy based on the gene variations of each subject. In particular, this is valid in oncology, where knowledge about genetic mutations has already led to new therapies. Current molecular biology techniques (microarrays, proteomics, epigenetic technology and improved DNA sequencing technology) enable better characterisation of cancer tumours. The vast amounts of data, however, coupled with the use of different terms - or semantic heterogeneity - in each discipline makes the retrieval and integration of information difficult.ResultsExisting software infrastructures for data-sharing in the cancer domain, such as caGrid, support access to distributed information. caGrid follows a service-oriented model-driven architecture. Each data source in caGrid is associated with metadata at increasing levels of abstraction, including syntactic, structural, reference and domain metadata. The domain metadata consists of ontology-based annotations associated with the structural information of each data source. However, caGrid's current querying functionality is given at the structural metadata level, without capitalising on the ontology-based annotations. This paper presents the design of and theoretical foundations for distributed ontology-based queries over cancer research data. Concept-based queries are reformulated to the target query language, where join conditions between multiple data sources are found by exploiting the semantic annotations. The system has been implemented, as a proof of concept, over the caGrid infrastructure. The approach is applicable to other model-driven architectures. A graphical user interface has been developed, supporting ontology-based queries over caGrid data sources. An extensive evaluation of the query reformulation technique is included.ConclusionsTo support personalised medicine in oncology, it is crucial to retrieve and integrate molecular, pathology, radiology and clinical data in an efficient manner. The semantic heterogeneity of the data makes this a challenging task. Ontologies provide a formal framework to support querying and integration. This paper provides an ontology-based solution for querying distributed databases over service-oriented, model-driven infrastructures.

In this thesis we investigate the possibility to integrate domain-specific knowledge into biomedical information retrieval (IR). Recent decades have shown a fast growing interest in biomedical research, reflected by an exponential growth in scientific literature. An important problem for biomedical IR is dealing with the complex and inconsistent terminology encountered in biomedical publications. Dealing with the terminology problem requires domain knowledge stored in terminological resources: controlled indexing vocabularies and thesauri. The integration of this knowledge is, however, far from trivial. The first research theme investigates heuristics for obtaining word-based representations from biomedical text for robust retrieval. We investigated the effect of choices in document preprocessing heuristics on retrieval effectiveness. Document preprocessing heuristics such as stop word removal, stemming, and breakpoint identification and normalization were shown to strongly affect retrieval performance. An effective combination of heuristics was identified to obtain a word-based representation from text for the remainder of this thesis. The second research theme deals with concept-based retrieval. We compared a word-based to a concept-based representation and determined to what extent a manual concept-based representation can be automatically obtained from text. Retrieval based on only concepts was demonstrated to be significantly less effective than word-based retrieval. This deteriorated performance could be explained by errors in the classification process, limitations of the concept vocabularies and limited exhaustiveness of the concept-based document representations. Retrieval based on a combination of word-based and automatically obtained concept-based query representations did significantly improve word-only retrieval. In the third and last research theme we propose a cross-lingual framework for monolingual biomedical IR. In this framework, the integration of a concept-based representation is viewed as a cross-lingual matching problem involving a word-based and concept-based representation language. This framework gives us the opportunity to adopt a large set of established crosslingual information retrieval methods and techniques for this domain. Experiments with basic term-to-term translation models demonstrate that this approach can significantly improve word-based retrieval. Directions for future work are using these concepts for communication between user and retrieval system, extending upon the translation models and extending CLIR-enhanced concept-based retrieval outside the biomedical domain. Available online from http://purl.utwente.nl/publications/72481.

Concept-based Query Research Articles

Related Topics

Articles published on Concept-based Query

Multimodal query-level fusion for efficient multimedia information retrieval

An intelligent multimedia information system for multimodal content extraction and querying

A framework of query expansion for image retrieval based on knowledge base and concept similarity

A Framework for Ontology Development of Information and Communication Technology Experts Using Thesaurus, Association for Computing Machinery Taxonomy and Domain Experts Approaches

Concept Tree Based Information Retrieval Model

Time-Aware Latent Concept Expansion for Microblog Search

Adaptive diversification for tag-based social image retrieval

Web Search using Improved Concept Based Query Refinement

Improved concept-based query expansion using Wikipedia

Biotea: RDFizing PubMed Central in support for the paper as an interface to the Web of Data.

Leveraging visual concepts and query performance prediction for semantic-theme-based video retrieval

Balancing the Trade-Offs Between Diversity and Precision for Web Image Search Using Concept-Based Query Expansion

Federated ontology-based queries over cancer data

Concept-based query language approach to enterprise information systems

Proof of concept

Concept-based query expansion for retrieving gene related publications from MEDLINE

Issues in the Design of a Pilot Concept-Based Query Interface for the Neuroinformatics Information Framework

The Neuroscience Information Framework: A Data and Knowledge Environment for Neuroscience

An effective indexing model to manage versioned objects in a digital library

An intelligent approach to handling imperfect information in concept-based natural language queries

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Concept-based Query Research Articles

Related Topics

Articles published on Concept-based Query

Multimodal query-level fusion for efficient multimedia information retrieval

An intelligent multimedia information system for multimodal content extraction and querying

A framework of query expansion for image retrieval based on knowledge base and concept similarity

A Framework for Ontology Development of Information and Communication Technology Experts Using Thesaurus, Association for Computing Machinery Taxonomy and Domain Experts Approaches

Concept Tree Based Information Retrieval Model

Time-Aware Latent Concept Expansion for Microblog Search

Adaptive diversification for tag-based social image retrieval

Web Search using Improved Concept Based Query Refinement

Improved concept-based query expansion using Wikipedia

Biotea: RDFizing PubMed Central in support for the paper as an interface to the Web of Data.

Leveraging visual concepts and query performance prediction for semantic-theme-based video retrieval

Balancing the Trade-Offs Between Diversity and Precision for Web Image Search Using Concept-Based Query Expansion

Federated ontology-based queries over cancer data

Concept-based query language approach to enterprise information systems

Proof of concept

Concept-based query expansion for retrieving gene related publications from MEDLINE

Issues in the Design of a Pilot Concept-Based Query Interface for the Neuroinformatics Information Framework

The Neuroscience Information Framework: A Data and Knowledge Environment for Neuroscience

An effective indexing model to manage versioned objects in a digital library

An intelligent approach to handling imperfect information in concept-based natural language queries