Small Amount Of Training Data Research Articles

Despite significant success of deep learning in object detection tasks, the standard training of deep neural networks requires access to a substantial quantity of annotated images across all classes. Data annotation is an arduous and time-consuming endeavor, particularly when dealing with infrequent objects. Few-shot object detection (FSOD) methods have emerged as a solution to the limitations of classic object detection approaches based on deep learning. FSOD methods demonstrate remarkable performance by achieving robust object detection using a significantly smaller amount of training data. A challenge for FSOD is that instances from novel classes that do not belong to the fixed set of training classes appear in the background and the base model may pick them up as potential objects. These objects behave similarly to label noise because they are classified as one of the training dataset classes, leading to FSOD performance degradation. We develop a semi-supervised algorithm to detect and then utilize these unlabeled novel objects as positive samples during the FSOD training stage to improve FSOD performance. Specifically, we develop a hierarchical ternary classification region proposal network (HTRPN) to localize the potential unlabeled novel objects and assign them new objectness labels to distinguish these objects from the base training dataset classes. Our improved hierarchical sampling strategy for the region proposal network (RPN) also boosts the perception ability of the object detection model for large objects. We test our approach and COCO and PASCAL VOC baselines that are commonly used in FSOD literature. Our experimental results indicate that our method is effective and outperforms the existing state-of-the-art (SOTA) FSOD methods. Our implementation is provided as a supplement to support reproducibility of the results https://github.com/zshanggu/HTRPN.11Early partial results of this work is presented in the 2023 ICCV Workshop on Visual Continual Learning (Shangguan and Rostami, 2023).

Context. Context-based question answering, a fundamental task in natural language processing, demands a deep understanding of the language’s nuances. While being a sophisticated task, it’s an essential part of modern search systems, intelligent assistants, chatbots, and the whole Conversational AI field. While English, Chinese, and other widely spoken languages have gathered an extensive number of datasets, algorithms, and benchmarks, the Ukrainian language, with its rich linguistic heritage and intricate syntax, has remained among low-resource languages in the NLP community, making the Question Answering problem even harder. Objective. The purpose of this work is to establish and benchmark a set of techniques, leveraging Large Language Models, combined in a single framework for solving the low-resource problem for Context-based question-answering task in Ukrainian. Method. A simple yet flexible framework for leveraging Large Language Models, developed as a part of this research work, enlights two key methods proposed and evaluated in this paper for dealing with a small amount of training data for context-based question-answering tasks. The first one utilizes Zero-shot and Few-shot learning – the two major subfields of N-shot learning, where N corresponds to the number of training samples, to build a bilingual instruction-based prompt strategy for language models inferencing in an extractive manner (find an answer span in context) instead of their natural generative behavior (summarize the context according to question). The second proposed method is based on the first one, but instead of just answering the question, the language model annotates the input context through the generation of question-answer pairs for the given paragraph. This synthetic data is used for extractive model training. This paper explores both augmentation-based training, when there is some annotated data already, and completely synthetic training, when no data is available. The key benefit of these two methods is the ability to obtain comparable prediction quality even without an expensive and long-term human annotation process. Results. Two proposed methods for solving the low-to-zero amount of training data problem for context-based questionanswering tasks in Ukrainian were implemented and combined into the flexible LLM experimentation framework. Conclusions. This research comprehensively studied OpenAI GPT-3.5, OpenAI GPT-4, Cohere Command, and Meta LLaMa-2 language understanding capabilities applied to context-based question answering in low-resource Ukrainian. The thorough evaluation of proposed methods on a diverse set of metrics proves their efficiency, unveiling the possibility of building components of search engines, chatbot applications, and standalone general-domain CBQA systems with Ukrainian language support while having almost zero annotated data. The prospect for further research is to extend the scope from the CBQA task evaluated in this paper to all major NLU tasks with the final goal of establishing a complete benchmark for LLMs’ capabilities evaluation in the Ukrainian language.

Small Amount Of Training Data Research Articles

Articles published on Small Amount Of Training Data

A click-based electrocorticographic brain-computer interface enables long-term high-performance switch scan spelling.

Generative AI-Driven Data Augmentation for Crack Detection in Physical Structures

MS-UNet: Multi-Scale Nested UNet for Medical Image Segmentation with Few Training Data Based on an ELoss and Adaptive Denoising Method

Improved region proposal network for enhanced few-shot object detection

Fine-Tuning SSL-Model to Enhance Detection of Cilioretinal Arteries on Colored Fundus Images.

Hybrid AI framework for the predictions of film cooling effectiveness distribution with various surface curvatures and compound angles

GeDa: Improving training data with large language models for Aspect Sentiment Triplet Extraction

Performance Analysis of Random Forest Algorithm in Automatic Building Segmentation with Limited Data

Development and Validation of an Artificial Intelligence Model for Detecting Rib Fractures on Chest Radiographs.

Gaussian processes for Bayesian inverse problems associated with linear partial differential equations

Meta-learning Achieves High Accuracy with a Small Amount of Training Data

Cross-domain structural damage identification using transfer learning strategy

Real-Time Microgrid Energy Scheduling Using Meta-Reinforcement Learning

Depth estimation and 3D reconstruction from UAV-borne imagery: Evaluation on the UseGeo dataset

Interpretable Prediction of SARS-CoV-2 Epitope-Specific TCR Recognition Using a Pre-Trained Protein Language Model.

Rician Noise Removal in Low-noise Condition via a Deep Unfolding Network

Data Augmentation for Sample Efficient and Robust Document Ranking

UA-LLM: ADVANCING CONTEXT-BASED QUESTION ANSWERING IN UKRAINIAN THROUGH LARGE LANGUAGE MODELS

Capacity estimation of lithium-ion batteries with uncertainty quantification based on temporal convolutional network and Gaussian process regression

Multi-path residual attention network for cancer diagnosis robust to a small number of training data of microscopic hyperspectral pathological images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Small Amount Of Training Data Research Articles

Articles published on Small Amount Of Training Data

A click-based electrocorticographic brain-computer interface enables long-term high-performance switch scan spelling.

Generative AI-Driven Data Augmentation for Crack Detection in Physical Structures

MS-UNet: Multi-Scale Nested UNet for Medical Image Segmentation with Few Training Data Based on an ELoss and Adaptive Denoising Method

Improved region proposal network for enhanced few-shot object detection

Fine-Tuning SSL-Model to Enhance Detection of Cilioretinal Arteries on Colored Fundus Images.

Hybrid AI framework for the predictions of film cooling effectiveness distribution with various surface curvatures and compound angles

GeDa: Improving training data with large language models for Aspect Sentiment Triplet Extraction

Performance Analysis of Random Forest Algorithm in Automatic Building Segmentation with Limited Data

Development and Validation of an Artificial Intelligence Model for Detecting Rib Fractures on Chest Radiographs.

Gaussian processes for Bayesian inverse problems associated with linear partial differential equations

Meta-learning Achieves High Accuracy with a Small Amount of Training Data

Cross-domain structural damage identification using transfer learning strategy

Real-Time Microgrid Energy Scheduling Using Meta-Reinforcement Learning

Depth estimation and 3D reconstruction from UAV-borne imagery: Evaluation on the UseGeo dataset

Interpretable Prediction of SARS-CoV-2 Epitope-Specific TCR Recognition Using a Pre-Trained Protein Language Model.

Rician Noise Removal in Low-noise Condition via a Deep Unfolding Network

Data Augmentation for Sample Efficient and Robust Document Ranking

UA-LLM: ADVANCING CONTEXT-BASED QUESTION ANSWERING IN UKRAINIAN THROUGH LARGE LANGUAGE MODELS

Capacity estimation of lithium-ion batteries with uncertainty quantification based on temporal convolutional network and Gaussian process regression

Multi-path residual attention network for cancer diagnosis robust to a small number of training data of microscopic hyperspectral pathological images