Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. EPFL thesis
  4. Collaborative AI Agents in the Era of Large Language Models
 
doctoral thesis

Collaborative AI Agents in the Era of Large Language Models

Josifoski, Martin  
2025

Developing agents that can reliably act on our behalf is central to artificial intelligence (AI). These agents must seamlessly interact with tools, like search engines and databases, and collaborate. In this thesis, we study the abstractions, methods, and infrastructure needed to enable and support the development of AI agents in the era of large language models (LLMs). The contributions of the thesis are divided into four parts.

Part 1 examines goal-oriented collaboration between two components, at least one of which is LLM-based. For an LLM-based component to interact successfully with others, it must adhere to specified interfaces, especially when interacting with traditional software-based components exposed through an API, and steer the collaboration toward high-utility outcomes. We show that LLM decoding algorithms serve as an efficient strategy to accomplish both objectives without modifying the underlying model.

Part 2 focuses on scenarios where the underlying model's capabilities are insufficient for effective collaboration, and the training signal necessary for improving the model is not readily available. To address this challenge, we introduce the principle of exploiting asymmetry for synthetic data generation and demonstrate how it can be applied to generate useful data even for tasks that LLMs cannot solve directly. We highlight the generality of this approach by drawing connections to seminal work on self-improvement for LLMs.

Part 3 addresses the collaboration among multiple AI systems, tools, and humans. We propose an abstraction that, in concert with the accompanying library, provides a theoretical and practical infrastructure with a modular and concurrency-friendly design, which enables the modeling, implementation, and systematic study of arbitrarily complex structured interactions. To demonstrate the potential of the framework and the accompanying library, we use them to systematically investigate the benefits of complex interactions for solving competitive coding problems.

Part 4 proposes a novel perspective called semantic decoding that allows us to systematically study the design space of structured interactions. We conclude this part by discussing the research opportunities and questions emerging from the semantic decoding perspective, enabled by the foundation laid in Parts 1, 2, and 3.

  • Files
  • Details
  • Metrics
Type
doctoral thesis
DOI
10.5075/epfl-thesis-10872
Author(s)
Josifoski, Martin  

EPFL

Advisors
West, Robert  
Jury

Prof. Viktor Kuncak (président) ; Prof. Robert West (directeur de thèse) ; Prof. Caglar Gulcehre, Prof. Pasquale Minervini, Dr Claudiu Musat (rapporteurs)

Date Issued

2025

Publisher

EPFL

Publisher place

Lausanne

Public defense year

2025-05-02

Thesis number

10872

Total of pages

203

Subjects

AI agents

•

large language models

•

synthetic data generation

•

decoding algorithms

•

transformers

•

natural language processing

•

artificial intelligence

EPFL units
DLAB  
Faculty
IC  
School
IINFCOM  
Doctoral School
EDIC  
Available on Infoscience
April 16, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/249358
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés