Physically-constrained machine learning models of effective electronic Hamiltonians
Accurate quantum mechanical (QM) simulations are central to understanding the electronic structure and properties of molecules and materials. Electronic structure methods solve the Schrödinger equation for the electronic Hamiltonian, from which all ground and excited state properties can be derived. However, their steep computational cost limits applications to small systems or short timescales. Machine learning (ML) offers a way forward by creating surrogate models that map structure to properties at much lower cost. Early ML approaches focus on specific observables, energies, forces, charges, dipole moments, polarizabilities, rather than the underlying electronic structure. While effective within their training domain, such models lack transferability and cannot predict properties beyond those in the training data. A more general solution is to learn fundamental electronic quantities, such as electron densities, density matrices, wave functions, or effective single-particle Hamiltonians, which provide access to many properties through inexpensive postprocessing. This thesis focuses on learning an effective single-particle Hamiltonian.
Accurate Hamiltonians require large basis sets, producing high-dimensional matrices that make direct learning difficult. To balance accuracy and efficiency, we introduce an indirect learning framework. Instead of using matrix elements as final targets, the Hamiltonian is treated as an intermediate representation, while learning targets are derived properties such as orbital energies, charges, or observables computed in either the model basis or a larger reference basis. The model remains parametrized in a compact minimal basis, reducing complexity while still guided by information from more complete calculations. This hybrid design improves efficiency without sacrificing accuracy and preserves access to a wide range of properties through postprocessing. Using automatic differentiation, we optimize the effective Hamiltonian to reproduce observables from either the same or larger basis. Coupled with the Tammâ Dancoff approximation, ML-predicted Hamiltonians can predict singlet excited states across molecules. The models generalize well to unseen, larger systems while being orders of magnitude faster than reference methods, enabling applications such as computing spectral densities from molecular dynamics.
We extend this framework by interfacing ML Hamiltonians with PySCFAD, an auto-differentiable electronic structure code supporting density matrix construction and linear response calculations. This greatly expands the scope of indirect models, providing access to many observables without reimplementing differentiable routines. We analyze how design choices, such as adding physical constraints or basis set parametrization, affect accuracy and transferability. Well-regularized models extrapolate reliably to larger molecules, and for properties like dipole moments and polarizabilities, Hamiltonian-based models outperform property-specific ones. We also extend the framework to periodic systems for predicting band energies.
This thesis shows that ML Hamiltonians offer a powerful and generalizable bridge between QM and ML. By targeting an operator central to electronic structure rather than isolated properties, these models deliver efficient surrogates capable of predicting diverse observables with high accuracy. This work points toward hybrid MLâ QM approaches that unify accuracy and efficiency.
EPFL_TH11666.pdf
Main Document
Not Applicable (or Unknown)
openaccess
N/A
21.85 MB
Adobe PDF
c7cc718018556e371ea8514079c1048d