External Knowledge Bases Research Articles

Retrieval-Augmented Generation (RAG) is a technique that enhances the capabilities of large language models (LLMs) by incorporating external knowledge sources. This method addresses common LLM limitations, including outdated information and the tendency to produce inaccurate “hallucinated” content. However, evaluating RAG systems is a challenge. Most benchmarks focus primarily on question answering applications, neglecting other potential scenarios where RAG could be beneficial. Accordingly, in the experiments, these benchmarks often assess only the LLM components of the RAG pipeline or the retriever in knowledge-intensive scenarios, overlooking the impact of external knowledge base construction and the retrieval component on the entire RAG pipeline in non-knowledge-intensive scenarios. To address these issues, this paper constructs a large-scale and more comprehensive benchmark, and evaluates all the components of RAG systems in various RAG application scenarios. Specifically, we refer to the CRUD actions that describe interactions between users and knowledge bases, and also categorize the range of RAG applications into four distinct types–Create, Read, Update, and Delete (CRUD). “Create” refers to scenarios requiring the generation of original, varied content. “Read” involves responding to intricate questions in knowledge-intensive situations. “Update” focuses on revising and rectifying inaccuracies or inconsistencies in pre-existing texts. “Delete” pertains to the task of summarizing extensive texts into more concise forms. For each of these CRUD categories, we have developed different datasets to evaluate the performance of RAG systems. We also analyze the effects of various components of the RAG system, such as the retriever, context length, knowledge base construction, and LLM. Finally, we provide useful insights for optimizing the RAG technology for different scenarios 1 .

Read full abstract

Abstract The article studies the current text processing tools based on Artificial Intelligence. A literature review is done emphasizing the dynamic evolution of AI-powered text analytics, having as its central tool ChatGPT and its capabilities. The focus is centered on the techniques and methods that are using embeddings in order to improve large language models (LLMs). In this paper is analyzed the current situation of the literature in terms of text processing using Retrieval-Augmented Generation and is highlighted the potential of this technology to enhance the interpretability and trust in applications critical, such as those related to education or business. AI has revolutionized natural language processing (NLP), which facilitated the machines to interpret and generate text efficiently and accurately. In addition, large language models with external knowledge bases have been developed. These are used to produce more accurate and contextually relevant text responses. This approach is called Retrieval-Augmented Generation (RAG is one of the most significant recent advancements in this field. Based on our study, two use cases are implemented to show the applicability of our study: one related to education and one related to business IT-related documents. The methodology describes the techniques used. This includes retrieval-augmented generation and embedding stored using vector databases. Our custom models are evaluated by comparing them to the general ones, without embeddings, showing superior performance. The article highlights remarkable progress in Retrieval-Augmented Generation (RAG), which is used for AI text processing with a focus on business and education fields. Further in this paper, many of the most significant highlights are presented, which include a scalable framework for AI applications, a new integration of Retrieval-Augmented Generation and embeddings, practical application demonstrations, bridging gaps in the analysis op AI text, significant development in AI performance and optimizing educational and business processes.

Read full abstract

External Knowledge Bases Research Articles

Related Topics

Articles published on External Knowledge Bases

Soft Prompt-tuning with Self-Resource Verbalizer for short text streams

UniKDD: A Unified Generative model for Knowledge-driven Dialogue

Knowledge-injected Prompt Learning for Chinese Biomedical Entity Normalization

CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

QA-RAG: Exploring LLM Reliance on External Knowledge

Generative commonsense knowledge subgraph retrieval for open-domain dialogue response generation

Knowledge-injected prompt learning for actionable information extraction from crisis-related tweets

Testing and Validation of a Custom Retrained Large Language Model for the Supportive Care of HN Patients with External Knowledge Base.

Candidate-Heuristic In-Context Learning: A new framework for enhancing medical visual question answering with LLMs

High‐degree penalty based global statistical network embedding for name disambiguation in anonymized graph

Artificial Intelligence Text Processing Using Retrieval-Augmented Generation: Applications in Business and Education Fields

Temporal validity reassessment: commonsense reasoning about information obsoleteness

Infusing internalized knowledge of language models into hybrid prompts for knowledgeable dialogue generation

Incorporating Entity Type-Aware and Word–Word Relation-Aware Attention in Generative Named Entity Recognition

K-PathVQA: Knowledge-Aware Multimodal Representation for Pathology Visual Question Answering.

Research on Fake News Detection Based on Dual Evidence Perception

From Retrieval to Generation: A Simple and Unified Generative Model for End-to-End Task-Oriented Dialogue

A relational extraction approach based on multiple embedding representations and multi-head self-attention

Neurophysiological and psychophysical references for trends in supervised VQA multimodal deep learning: An interdisciplinary meta-analysis

Hierarchical Understanding in Robotic Manipulation: A Knowledge-Based Framework

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

External Knowledge Bases Research Articles

Related Topics

Articles published on External Knowledge Bases

Soft Prompt-tuning with Self-Resource Verbalizer for short text streams

UniKDD: A Unified Generative model for Knowledge-driven Dialogue

Knowledge-injected Prompt Learning for Chinese Biomedical Entity Normalization

CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

QA-RAG: Exploring LLM Reliance on External Knowledge

Generative commonsense knowledge subgraph retrieval for open-domain dialogue response generation

Knowledge-injected prompt learning for actionable information extraction from crisis-related tweets

Testing and Validation of a Custom Retrained Large Language Model for the Supportive Care of HN Patients with External Knowledge Base.

Candidate-Heuristic In-Context Learning: A new framework for enhancing medical visual question answering with LLMs

High‐degree penalty based global statistical network embedding for name disambiguation in anonymized graph

Artificial Intelligence Text Processing Using Retrieval-Augmented Generation: Applications in Business and Education Fields

Temporal validity reassessment: commonsense reasoning about information obsoleteness

Infusing internalized knowledge of language models into hybrid prompts for knowledgeable dialogue generation

Incorporating Entity Type-Aware and Word–Word Relation-Aware Attention in Generative Named Entity Recognition

K-PathVQA: Knowledge-Aware Multimodal Representation for Pathology Visual Question Answering.

Research on Fake News Detection Based on Dual Evidence Perception

From Retrieval to Generation: A Simple and Unified Generative Model for End-to-End Task-Oriented Dialogue

A relational extraction approach based on multiple embedding representations and multi-head self-attention

Neurophysiological and psychophysical references for trends in supervised VQA multimodal deep learning: An interdisciplinary meta-analysis

Hierarchical Understanding in Robotic Manipulation: A Knowledge-Based Framework