Retrieval Platform Research Articles

Accurate data collection at the ground level is vital to the integrity of neuroimaging research. Similarly important is the ability to connect and curate data in order to make it meaningful and sharable with other investigators. Collecting data, especially with several different modalities, can be time consuming and expensive. These issues have driven the development of automated collection of neuroimaging and clinical assessment data within COINS (Collaborative Informatics and Neuroimaging Suite). COINS is an end-to-end data management system. It provides a comprehensive platform for data collection, management, secure storage, and flexible data retrieval (Bockholt et al., 2010; Scott et al., 2011). It was initially developed for the investigators at the Mind Research Network (MRN), but is now available to neuroimaging institutions worldwide. Self Assessment (SA) is an application embedded in the Assessment Manager (ASMT) tool in COINS. It is an innovative tool that allows participants to fill out assessments via the web-based Participant Portal. It eliminates the need for paper collection and data entry by allowing participants to submit their assessments directly to COINS. Instruments (surveys) are created through ASMT and include many unique question types and associated SA features that can be implemented to help the flow of assessment administration. SA provides an instrument queuing system with an easy-to-use drag and drop interface for research staff to set up participants' queues. After a queue has been created for the participant, they can access the Participant Portal via the internet to fill out their assessments. This allows them the flexibility to participate from home, a library, on site, etc. The collected data is stored in a PostgresSQL database at MRN. This data is only accessible by users that have explicit permission to access the data through their COINS user accounts and access to MRN network. This allows for high volume data collection and with minimal user access to PHI (protected health information). An added benefit to using COINS is the ability to collect, store and share imaging data and assessment data with no interaction with outside tools or programs. All study data collected (imaging and assessment) is stored and exported with a participant's unique subject identifier so there is no need to keep extra spreadsheets or databases to link and keep track of the data. Data is easily exported from COINS via the Query Builder and study portal tools, which allow fine grained selection of data to be exported into comma separated value file format for easy import into statistical programs. There is a great need for data collection tools that limit human intervention and error while at the same time providing users with intuitive design. COINS aims to be a leader in database solutions for research studies collecting data from several different modalities.

Read full abstract

Motivation and Objectives Biomedical professionals have at their disposal a huge amount of literature. But when they have a precise question, they often have to deal with too many documents to efficiently find the appropriate answers in a reasonable time. Faced to this literature overload, the need for automatic assistance has been largely pointed out, and PubMed is argued to be only the beginning on how scientists use the biomedical literature (Hunter and Cohen, 2006). Ontology-based search engines began to introduce semantics in search results. These systems still display documents, but the user visualizes clusters of PubMed results according to concepts which were extracted from the abstracts. GoPubMed (Doms and Schroeder, 2005) and EBIMed (Rebholz-Schuhmann et al, 2007) are popular examples of such ontology-based search engines in the biomedical domain. Question Answering (QA) systems are argued to be the next generation of semantic search engines (Wren, 2011). QA systems no more display documents but directly concepts which were extracted from the search results; these concepts are supposed to answer the user’s question formulated in natural language. EAGLi (Gobeill et al, 2009), our locally developed system, is an example of such QA search engines. Thus, both ontology-based and QA search engines, share the crucial task of efficiently extracting concepts from the result set, i.e. a set of documents. This task is sometimes called macro reading, in contrast with micro reading – or classification, categorization – which is a traditional Natural Language Processing task that aims at extracting concepts from a single document (Mitchell et al, 2009). This paper focuses on macro reading of MEDLINE abstracts. Several experiments have been reported to find the best way to extract ontology terms out of a single MEDLINE abstract, i.e. micro reading. In particular, (Trieschnigg et al, 2009) compared the performances of six classification systems for reproducing the manual Medical Subject Headings (MeSH) annotation of a MEDLINE abstract. The evaluated systems included two morphosyntactic classifiers (sometimes also called thesaurus-based), which aim at literally finding ontology terms in the abstract by alignment of words, and a machine learning (or supervised) classifier, which aims at inferring the annotation from a knowledge base containing already annotated abstracts. The authors concluded that the machine learning approach outperformed the morphosyntactic ones. But the macro reading task is fundamentally different, as we look for the best way to extract then combine ontology terms from a set of MEDLINE abstracts. The issue investigated in this paper is: to what extent the differences observed between two classifiers for a micro reading task are observed for a macro reading one? In particular, the redundancy hypothesis claims that the redundancy in large textual collections such as the Web or MEDLINE tends to smoothe performance differences across classifiers (Lin, 2007). To address this question, we compared a morphosyntactic and a machine learning classifiers for both tasks, focusing on the extraction of Gene Ontology (GO) terms, a controlled vocabulary for the characterization of proteins functions. The micro reading task consisted in extracting GO terms from a single MEDLINE abstract, as in the Trieschnigg et al’s work; the macro reading task consisted in extracting GO terms from a set of MEDLINE abstracts in order to answer to proteomics questions asked to the EAGLi QA system.

Read full abstract

Retrieval Platform Research Articles

Related Topics

Articles published on Retrieval Platform

A Compressed-Domain Image Filtering and Re-Ranking Approach for Multi-Agent Image Retrieval

Retrieval of functional TCRs from single antigen-specific T cells: Toward individualized TCR-engineered therapies

DNA methylation and affective disorder

An Intelligent Web Digital Image Metadata Service Platform for Social Curation Commerce Environment

An Online, Batch, and Real-time Retrieval Platform for Genomic Sequences and Annotation of Horticultural Plants

Automated collection of imaging and phenotypic data to centralized and distributed data repositories.

A Design of a Sci-Tech Information Retrieval Platform Based on Apache Solr and Web Mining

A Database for Mycobacterium Secretome Analysis: ‘MycoSec’ to Accelerate Global Health Research

Swarm Intelligence Based Optimization for Web Usage Mining in Recommender System

ATLAS Analysis Papers and Conference Notes

Answering Gene Ontology terms to proteomics questions by supervised macro reading in Medline

Text mining in livestock animal science: Introducing the potential of text mining to animal sciences 1

The age-phenome database

A dynamically semantic platform for efficient information retrieval in P2P networks

Kv2.1 Cell Surface Clusters are Insertion and Retrieval Platforms For Kv Channel Trafficking at the Plasma Membrane

Research on PostgreSQL-based TMX Storage and Implementation of Corpus Retrieval Platform

A Driving Behavior Retrieval Application for Vehicle Surveillance System

The Construction of English-Chinese Parallel Corpus of Medical Works Based on Self-Coded Python Programs

The eDAL Suite: Tools and Concepts for Primary Data Citation

The eDAL Suite: Tools and Concepts for Primary Data Citation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Retrieval Platform Research Articles

Related Topics

Articles published on Retrieval Platform

A Compressed-Domain Image Filtering and Re-Ranking Approach for Multi-Agent Image Retrieval

Retrieval of functional TCRs from single antigen-specific T cells: Toward individualized TCR-engineered therapies

DNA methylation and affective disorder

An Intelligent Web Digital Image Metadata Service Platform for Social Curation Commerce Environment

An Online, Batch, and Real-time Retrieval Platform for Genomic Sequences and Annotation of Horticultural Plants

Automated collection of imaging and phenotypic data to centralized and distributed data repositories.

A Design of a Sci-Tech Information Retrieval Platform Based on Apache Solr and Web Mining

A Database for Mycobacterium Secretome Analysis: ‘MycoSec’ to Accelerate Global Health Research

Swarm Intelligence Based Optimization for Web Usage Mining in Recommender System

ATLAS Analysis Papers and Conference Notes

Answering Gene Ontology terms to proteomics questions by supervised macro reading in Medline

Text mining in livestock animal science: Introducing the potential of text mining to animal sciences 1

The age-phenome database

A dynamically semantic platform for efficient information retrieval in P2P networks

Kv2.1 Cell Surface Clusters are Insertion and Retrieval Platforms For Kv Channel Trafficking at the Plasma Membrane

Research on PostgreSQL-based TMX Storage and Implementation of Corpus Retrieval Platform

A Driving Behavior Retrieval Application for Vehicle Surveillance System

The Construction of English-Chinese Parallel Corpus of Medical Works Based on Self-Coded Python Programs

The eDAL Suite: Tools and Concepts for Primary Data Citation

The eDAL Suite: Tools and Concepts for Primary Data Citation