Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. ∇<sup>2</sup>DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials
 
conference paper

∇2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials

Khrabrov, Kuzma
•
Ber, Anton
•
Tsypin, Artem
Show more
Globerson, Amir
•
Mackey, Lester
Show more
2024
SESAR Innovation Days
38th Conference on Neural Information Processing Systems (NeurIPS 2024)

Methods of computational quantum chemistry provide accurate approximations of molecular properties crucial for computer-aided drug discovery and other areas of chemical science. However, high computational complexity limits the scalability of their applications. Neural network potentials (NNPs) are a promising alternative to quantum chemistry methods, but they require large and diverse datasets for training. This work presents a new dataset and benchmark called ∇2DFT that is based on the nablaDFT. It contains twice as much molecular structures, three times more conformations, new data types and tasks, and state-of-the-art models. The dataset includes energies, forces, 17 molecular properties, Hamiltonian and overlap matrices, and a wavefunction object. All calculations were performed at the DFT level (ωB97X-D/def2-SVP) for each conformation. Moreover, ∇2DFT is the first dataset that contains relaxation trajectories for a substantial number of drug-like molecules. We also introduce a novel benchmark for evaluating NNPs in molecular property prediction, Hamiltonian prediction, and conformational optimization tasks. Finally, we propose an extendable framework for training NNPs and implement 10 models within it.

  • Details
  • Metrics
Type
conference paper
Scopus ID

2-s2.0-105000505708

Author(s)
Khrabrov, Kuzma

AIRI

Ber, Anton

AIRI

Tsypin, Artem

AIRI

Ushenin, Konstantin

AIRI

Rumiantsev, Egor  

EPFL

Telepov, Alexander

AIRI

Protasov, Dmitry

AIRI

Shenbin, Ilya

St. Petersburg Department of V.A.Steklov Institute of Mathematics of the Russian Academy of Sciences

Alekseev, Anton

Saint Petersburg State University

Shirokikh, Mikhail

Saint Petersburg State University

Show more
Editors
Globerson, Amir
•
Mackey, Lester
•
Belgrave, Danielle
•
Fan, Angela
•
Paquet, Ulrich
•
Tomczak, Jakub
•
Zhang, Cheng
Date Issued

2024

Publisher

Neural information processing systems foundation

Published in
SESAR Innovation Days
ISBN of the book

9789462085763

Series title/Series vol.

Advances in Neural Information Processing Systems; 37

ISSN (of the series)

1049-5258

Published in
Transactions on Machine Learning Research
Volume

2023-February

Start page

82

End page

83

Subjects

Air traffic management system

•

Aircraft trajectory optimization

•

Climate impact

•

Constrained Markov decision process

•

Multi-agent reinforcement learning

•

Proximal policy optimization algorithm

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
COSMO  
Event nameEvent acronymEvent placeEvent date
38th Conference on Neural Information Processing Systems (NeurIPS 2024)

NeurIPS 2024

Vancouver, Canada

2024-12-10 - 2024-12-15

FunderFunding(s)Grant NumberGrant URL

Analytical Center for the Government of the Russian Federation

000000D730321P5Q0002

Russian Academy of Sciences

70-2021-00142

Available on Infoscience
April 7, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/248751
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés