Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Efficient Federated Search for Retrieval-Augmented Generation
 
conference paper

Efficient Federated Search for Retrieval-Augmented Generation

Guerraoui, Rachid  
•
Kermarrec, Anne-Marie  
•
Petrescu, Diana Andreea  
Show more
March 31, 2025
EuroMLSys ’25 Proceedings of th 2025 5th Workshop on Machine Learning and Systems [Forthcoming publication]
5th Workshop on Machine Learning and Systems (EuroMLSys)

Large language models (LLMs) have demonstrated remarkable capabilities across various domains but remain susceptible to hallucinations and inconsistencies, limiting their reliability. Retrieval-augmented generation (RAG) mitigates these issues by grounding model responses in external knowledge sources. Existing RAG workflows often leverage a single vector database, which is impractical in the common setting where information is distributed across multiple repositories. We introduce RAGRoute, a novel mechanism for federated RAG search. RAGRoute dynamically selects relevant data sources at query time using a lightweight neural network classifier. By not querying every data source, this approach significantly reduces query overhead, improves retrieval efficiency, and minimizes the retrieval of irrelevant information. We evaluate RAGRoute using the MIRAGE and MMLU benchmarks and demonstrate its effectiveness in retrieving relevant documents while reducing the number of queries. RAGRoute reduces the total number of queries up to 77.5% and communication volume up to 76.2%.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1145/3721146.3721942
Author(s)
Guerraoui, Rachid  

EPFL

Kermarrec, Anne-Marie  

EPFL

Petrescu, Diana Andreea  

EPFL

Pires, Rafael  

EPFL

Randl, Mathis Benjamin Manuel  

École Polytechnique Fédérale de Lausanne

de Vos, Martijn  

EPFL

Date Issued

2025-03-31

Publisher

ACM

Published in
EuroMLSys ’25 Proceedings of th 2025 5th Workshop on Machine Learning and Systems [Forthcoming publication]
ISBN of the book

979-8-4007-1538-9

Subjects

CCS Concepts:

•

Information systems → Retrieval models and ranking

•

Combination, fusion and federated search

•

• Computing methodologies → Natural language processing Retrieval-Augmented Generation, Large Language Models, Federated Search, Resource Selection, Routing

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
SACS  
DCL  
Event nameEvent acronymEvent placeEvent date
5th Workshop on Machine Learning and Systems (EuroMLSys)

EuroMLSys 2025

Rotterdam, The Netherlands

2025-03-31

Available on Infoscience
March 20, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/248064
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés