Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. EPFL thesis
  4. Bayesian and Data-Efficient Strategies for the Optimization of Organic Reactions
 
doctoral thesis

Bayesian and Data-Efficient Strategies for the Optimization of Organic Reactions

Schöpfer, Alexandre Alain  
2026

Efficient optimization of organic reactions remains a central challenge in synthetic chemistry, often hindered by the vast combinatorial space of possible conditions and the limitations of traditional trial-and-error approaches. In this thesis, reaction optimization in homogeneous catalysis is advanced by integrating statistical and computational methodologies across four pillars: data collection and curation, molecular and reaction representation, robust modeling and model selection, and experimental design with a focus on Bayesian optimization. Emphasis is placed on strategies for low-data regimes, interpretability, and systematic decision-making under uncertainty, with outcomes demonstrated on representative case studies.

The first pillar establishes practical guidance for building statistically robust, machine-learning-ready datasets, with ongoing challenges in streamlining these processes discussed in detail. The second pillar develops a reaction-agnostic featurization strategy for bidentate ligands and curates ligand datasets to enable interpretable and transferable modeling. The third pillar introduces ReaFS (reaction feature selection) as a systematic methodology for selecting informative features and validating multivariate linear regression models, with a focus on model stability and interpretability in low-sample regimes. The fourth pillar extends classical Bayesian optimization by incorporating experimental cost into the acquisition strategy, resulting in cost-informed Bayesian optimization (CIBO) that reduces expected expenditure relative to cost-agnostic policies while respecting practical constraints. Application of selected methodologies is demonstrated on reaction scope exploration and optimization, including azidofunctionalization and carboetherification reactions. The thesis concludes by outlining future directions for integrating these tools with broader chemical synthesis planning, highlighting the need for standardized practices, rigorous validation, and continued methodological innovation.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

EPFL_TH11571.pdf

Type

Main Document

Version

Not Applicable (or Unknown)

Access type

restricted

License Condition

N/A

Size

5.21 MB

Format

Adobe PDF

Checksum (MD5)

4bfa57968dbcfcc95db013995540654b

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés