Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. EPFL thesis
  4. Stable Optimization in Deep Learning: Geometry and Games
 
doctoral thesis

Stable Optimization in Deep Learning: Geometry and Games

Pethick, Thomas Michaelsen  
2026

Training instabilities are ubiquitous in deep learning and have led to many heuristics aimed at stabilizing the learning process. From an optimization standpoint, such instabilities often arise when the local model used by the algorithm fails to faithfully capture the underlying structure of the problem. This thesis takes a principled approach to address this issue by designing optimization methods that better align with the geometry and game-theoretic nature of the problems arising in deep learning.

In Part I, we focus on minimization problems and propose a family of geometry-aware algorithms that adapt to the structure of neural networks. By explicitly incorporating norm constraints and scale-invariant updates, these methods allow for stable and efficient training of large models with large batch sizes, all without incurring additional memory overhead.

In Part II, we move on to the significantly more challenging problem of multi-agent games. These settings are known to exhibit unstable dynamics, such as limit cycles and divergence. We show that such pathologies can be avoided through simple, local update rules, even when classical assumptions like monotonicity fail. We introduce new algorithmic frameworks based on extragradient and proximal methods.

In both parts, we address both the deterministic and stochastic settings. In the stochastic case, we incorporate momentum-based gradient estimators that reduce variance without requiring increasing batch sizes. This plays a central role in ensuring convergence of the proposed methods both in theory and practice.

  • Files
  • Details
  • Metrics
Type
doctoral thesis
DOI
10.5075/epfl-thesis-10377
Author(s)
Pethick, Thomas Michaelsen  
Advisors
Cevher, Volkan  orcid-logo
Jury

Prof. Michel Bierlaire (président) ; Prof. Volkan Cevher (directeur de thèse) ; Prof. Nicolas Flammarion, Prof. Sebastian Pokutta, Prof. Panayotis Mertikopoulos (rapporteurs)

Date Issued

2026

Publisher

EPFL

Publisher place

Lausanne

Public defense year

2026-02-20

Thesis number

10377

Total of pages

246

Subjects

Non-Euclidean

•

Conditional gradient methods

•

Variational Inequalities

•

Monotone Operators

•

Extragradient methods

•

Proximal methods

EPFL units
LIONS  
Faculty
STI  
School
IEM  
Doctoral School
EDEE  
Available on Infoscience
February 18, 2026
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/259763
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés