Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Preprints and Working Papers
  4. Robust Out-of-Distribution Prediction of Buchwald-Hartwig Reactions
 
preprint

Robust Out-of-Distribution Prediction of Buchwald-Hartwig Reactions

Neves, Paulo
•
Hao, Bo
•
Aikonen, Santeri
Show more
October 13, 2025

The Buchwald–Hartwig cross-coupling is a cornerstone of modern pharmaceutical synthesis, yet predictive modeling of its outcomes remains limited by data quality and chemical space coverage. Industry electronic laboratory notebooks (ELNs) contain heterogeneous, noisy records, while open-source high-throughput experimentation (HTE) datasets are fragmented and narrow in scope. As a result, models often fail when applied to novel substrate and condition combinations. Here we introduce a unified framework that systematically standardizes and integrates diverse reaction data into a high-quality, unique-structure-per-entity dataset, coupled with active learning to strategically expand chemical space. By merging published Buchwald–Hartwig HTE data with new experimental results, we achieve models that generalize across substrates and conditions, delivering substantially improved out-of-distribution predictions relative to previous approaches. Crucially, model-guided reagent and condition recommendations were validated experimentally, confirming the framework’s utility for exploring unexplored reactivity. This work establishes a blueprint for robust machine learning in synthetic chemistry, with the potential to accelerate pharmaceutical discovery by enabling more reliable and scalable prediction of reaction outcomes.

  • Files
  • Details
  • Metrics
Type
preprint
DOI
10.26434/chemrxiv-2025-xcr46
Author(s)
Neves, Paulo
Hao, Bo

Johnson & Johnson (United States)

Aikonen, Santeri

Johnson & Johnson (United States)

Diccianni, Justin B.

Johnson & Johnson (United States)

Wegner, Jörg K.

Johnson & Johnson (United States)

Schwaller, Philippe  

École Polytechnique Fédérale de Lausanne

Strambeanu, Iulia I.

Johnson & Johnson (United States)

Date Issued

2025-10-13

Publisher

American Chemical Society (ACS)

Written at

EPFL

EPFL units
LIAC  
Available on Infoscience
October 20, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/255116
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés