Answering Task Research Articles

IntroductionFine-grained, descriptive information on habitats and reproductive conditions of plant species are crucial in forest restoration and rehabilitation efforts. Precise timing of fruit collection and knowledge of species' habitat preferences and reproductive status are necessary especially for tropical plant species that have short-lived recalcitrant seeds, and those that exhibit complex reproductive patterns, e.g., species with supra-annual mass flowering events that may occur in irregular intervals. Understanding plant regeneration in the way of planning for effective reforestation can be aided by providing access to structured information, e.g., in knowledge bases, that spans years if not decades as well as covering a wide range of geographic locations. The content of such a resource can be enriched with literature-derived information on species' time-sensitive reproductive conditions and location-specific habitats.MethodsWe sought to develop unsupervised approaches to extract relationships pertaining to habitats and their locations, and reproductive conditions of plant species and corresponding temporal information. Firstly, we handcrafted rules for a traditional rule-based pattern matching approach. We then developed a relation extraction approach building upon transformer models, i.e., the Text-to-Text Transfer Transformer (T5), casting the relation extraction problem as a question answering and natural language inference task. We then propose a novel unsupervised hybrid approach that combines our rule-based and transformer-based approaches.ResultsEvaluation of our hybrid approach on an annotated corpus of biodiversity-focused documents demonstrated an improvement of up to 15 percentage points in recall and best performance over solely rule-based and transformer-based methods with F1-scores ranging from 89.61 to 96.75% for reproductive condition - temporal expression relations, and ranging from 85.39% to 89.90% for habitat - geographic location relations. Our work shows that even without training models on any domain-specific labeled dataset, we are able to extract relationships between biodiversity concepts from literature with satisfactory performance.

Read full abstract

Extractive methods for machine reading comprehension (MRC) tasks have achieved comparable or better accuracy than human performance on benchmark data sets. However, such models are not as successful when adapted to complex domains such as health care. One of the main reasons is that the context that the MRC model needs to process when operating in a complex domain can be much larger compared with an average open-domain context. This causes the MRC model to make less accurate and slower predictions. A potential solution to this problem is to reduce the input context of the MRC model by extracting only the necessary parts from the original context. This study aims to develop a method for extracting useful contexts from long articles as an additional component to the question answering task, enabling the MRC model to work more efficiently and accurately. Existing approaches to context extraction in MRC are based on sentence selection strategies, in which the models are trained to find the sentences containing the answer. We found that using only the sentences containing the answer was insufficient for the MRC model to predict correctly. We conducted a series of empirical studies and observed a strong relationship between the usefulness of the context and the confidence score output of the MRC model. Our investigation showed that a precise input context can boost the prediction correctness of the MRC and greatly reduce inference time. We proposed a method to estimate the utility of each sentence in a context in answering the question and then extract a new, shorter context according to these estimations. We generated a data set to train 2 models for estimating sentence utility, based on which we selected more precise contexts that improved the MRC model's performance. We demonstrated our approach on the Question Answering Data Set for COVID-19 and Biomedical Semantic Indexing and Question Answering data sets and showed that the approach benefits the downstream MRC model. First, the method substantially reduced the inference time of the entire question answering system by 6 to 7 times. Second, our approach helped the MRC model predict the answer more correctly compared with using the original context (F1-score increased from 0.724 to 0.744 for the Question Answering Data Set for COVID-19 and from 0.651 to 0.704 for the Biomedical Semantic Indexing and Question Answering). We also found a potential problem where extractive transformer MRC models predict poorly despite being given a more precise context in some cases. The proposed context extraction method allows the MRC model to achieve improved prediction correctness and a significantly reduced MRC inference time. This approach works technically with any MRC model and has potential in tasks involving processing long texts.

Read full abstract

Answering Task Research Articles

Related Topics

Articles published on Answering Task

Comparative Analysis of Denoising Methods to Improve Image Quality for Medical Visual Question Answering

Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration

BhashaBlend: Enabling Multilingual understanding of videos through NLP for Deaf and Hearing Impaired users

Robust visual question answering via polarity enhancement and contrast

Comparative Analysis of Single and Multiagent Large Language Model Architectures for Domain-Specific Tasks in Well Construction

Learning to enhance areal video captioning with visual question answering

Relation-Aware Heterogeneous Graph Network for Learning Intermodal Semantics in Textbook Question Answering.

GS-CBR-KBQA: Graph-structured case-based reasoning for knowledge base question answering

Unsupervised literature mining approaches for extracting relationships pertaining to habitats and reproductive conditions of plant species

Multi-Modal Instruction-Tuning Small-Scale Language-and-Vision Assistant for Semiconductor Electron Micrograph Analysis

A Novel Pretrained General-purpose Vision Language Model for the Vietnamese Language

ADVANCMENTS IN TEXT SUMMARIZATION AND EXTRACTIVE QUESTION- ANSWERING : A MACHINE LEARNING APPROACH

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach.

Question Calibration and Multi-Hop Modeling for Temporal Question Answering

NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario

YTCommentQA: Video Question Answerability in Instructional Videos

BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining

Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought

Improving Automatic VQA Evaluation Using Large Language Models

Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Answering Task Research Articles

Related Topics

Articles published on Answering Task

Comparative Analysis of Denoising Methods to Improve Image Quality for Medical Visual Question Answering

Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration

BhashaBlend: Enabling Multilingual understanding of videos through NLP for Deaf and Hearing Impaired users

Robust visual question answering via polarity enhancement and contrast

Comparative Analysis of Single and Multiagent Large Language Model Architectures for Domain-Specific Tasks in Well Construction

Learning to enhance areal video captioning with visual question answering

Relation-Aware Heterogeneous Graph Network for Learning Intermodal Semantics in Textbook Question Answering.

GS-CBR-KBQA: Graph-structured case-based reasoning for knowledge base question answering

Unsupervised literature mining approaches for extracting relationships pertaining to habitats and reproductive conditions of plant species

Multi-Modal Instruction-Tuning Small-Scale Language-and-Vision Assistant for Semiconductor Electron Micrograph Analysis

A Novel Pretrained General-purpose Vision Language Model for the Vietnamese Language

ADVANCMENTS IN TEXT SUMMARIZATION AND EXTRACTIVE QUESTION- ANSWERING : A MACHINE LEARNING APPROACH

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach.

Question Calibration and Multi-Hop Modeling for Temporal Question Answering

NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario

YTCommentQA: Video Question Answerability in Instructional Videos

BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining

Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought

Improving Automatic VQA Evaluation Using Large Language Models

Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering