Maximizing RAG efficiency: A comparative analysis of RAG methods

Tolga Şakar,Hakan Emekci

doi:10.1017/nlp.2024.53

Abstract

Abstract This paper addresses the optimization of retrieval-augmented generation (RAG) processes by exploring various methodologies, including advanced RAG methods. The research, driven by the need to enhance RAG processes as highlighted by recent studies, involved a grid-search optimization of 23,625 iterations. We evaluated multiple RAG methods across different vectorstores, embedding models, and large language models, using cross-domain datasets and contextual compression filters. The findings emphasize the importance of balancing context quality with similarity-based ranking methods, as well as understanding tradeoffs between similarity scores, token usage, runtime, and hardware utilization. Additionally, contextual compression filters were found to be crucial for efficient hardware utilization and reduced token consumption, despite the evident impacts on similarity scores, which may be acceptable depending on specific use cases and RAG methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Maximizing RAG efficiency: A comparative analysis of RAG methods

Abstract

Talk to us

Similar Papers

More From: Natural Language Processing

Lead the way for us

Journal: Natural Language Processing	Publication Date: Oct 30, 2024
License type: CC BY 4.0

Similar Papers

The Role of Personalized Generative AI in Advancing Petroleum Engineering and Energy Industry: A Roadmap to Secure and Cost-Efficient Knowledge Integration: A Case Study
Amr Gharieb ... Mohamed Y Soliman
-
Amr Gharieb, et. al.Amr Gharieb ... Mohamed Y Soliman
20 Sep 2024
20 Sep 2024

Investigating the Impact of Prompt Engineering on the Performance of Large Language Models for Standardizing Obstetric Diagnosis Text: Comparative Study.
Lei Wang ... Suling Zhao
JMIR Formative Research | VOL. 8
Lei Wang, et. al.Lei Wang ... Suling Zhao
08 Feb 2024
JMIR Formative Research | VOL. 8

#2924 Comparison of large language models and traditional natural language processing techniques in predicting arteriovenous fistula failure
Suman Lama ... Luca Neri
Nephrology Dialysis Transplantation | VOL. 39
Suman Lama, et. al.Suman Lama ... Luca Neri
23 May 2024
Nephrology Dialysis Transplantation | VOL. 39

Integrating Retrieval-Augmented Generation with Large Language Model Mistral 7b for Indonesian Medical Herb
Diash Firdaus ... Idi Sumardi
JISKA (Jurnal Informatika Sunan Kalijaga) | VOL. 9
Diash Firdaus, et. al.Diash Firdaus ... Idi Sumardi
25 Sep 2024
JISKA (Jurnal Informatika Sunan Kalijaga) | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Maximizing RAG efficiency: A comparative analysis of RAG methods

Abstract

Talk to us

Similar Papers

More From: Natural Language Processing