Optimizing RAG: Classifying Queries for Dynamic Processing

Kabir Olawore, Michael McTear, Y Bi

Research output: Contribution to conferencePaperpeer-review

Abstract

In Retrieval-Augmented Generation (RAG) systems efficient information retrieval is crucial for enhancing user experience and satisfaction, as response times and computational demands significantly impact performance. RAG can be unnecessarily resource-intensive for frequently asked questions (FAQs) and simple questions. In this paper we introduce an approach in which we categorize user questions into simple queries that do not require RAG processing. Evaluation results show that our proposal reduces latency and improves response efficiency compared to systems relying solely on RAG.
Original languageEnglish
Pages160-165
Number of pages5
Publication statusPublished online - 27 May 2025
Event15th International Workshop on Spoken Dialogue Systems Technology - Bilbao, Spain, Bilbao, Spain
Duration: 27 May 202530 May 2025

Conference

Conference15th International Workshop on Spoken Dialogue Systems Technology
Country/TerritorySpain
CityBilbao
Period27/05/2530/05/25

Fingerprint

Dive into the research topics of 'Optimizing RAG: Classifying Queries for Dynamic Processing'. Together they form a unique fingerprint.

Cite this