Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. A multi-modal pre-training transformer for universal transfer learning in metal-organic frameworks
 
research article

A multi-modal pre-training transformer for universal transfer learning in metal-organic frameworks

Kang, Yeonghun
•
Park, Hyunsoo
•
Smit, Berend  
Show more
March 13, 2023
Nature Machine Intelligence

Metal-organic frameworks (MOFs) are a class of crystalline porous materials that exhibit a vast chemical space owing to their tunable molecular building blocks with diverse topologies. An unlimited number of MOFs can, in principle, be synthesized. Machine learning approaches can help to explore this vast chemical space by identifying optimal candidates with desired properties from structure-property relationships. Here we introduce MOFTransformer, a multi-modal Transformer encoder pre-trained with 1 million hypothetical MOFs. This multi-modal model utilizes integrated atom-based graph and energy-grid embeddings to capture both local and global features of MOFs, respectively. By fine-tuning the pre-trained model with small datasets ranging from 5,000 to 20,000 MOFs, our model achieves state-of-the-art results for predicting across various properties including gas adsorption, diffusion, electronic properties, and even text-mined data. Beyond its universal transfer learning capabilities, MOFTransformer generates chemical insights by analyzing feature importance through attention scores within the self-attention layers. As such, this model can serve as a platform for other MOF researchers that seek to develop new machine learning models for their work.

Metal-organic frameworks are of high interest for a range of energy and environmental applications due to their stable gas storage properties. A new machine learning approach based on a pre-trained multi-modal transformer can be fine-tuned with small datasets to predict structure-property relationships and design new metal-organic frameworks for a range of specific tasks.

  • Details
  • Metrics
Type
research article
DOI
10.1038/s42256-023-00628-2
Web of Science ID

WOS:000948642600001

Author(s)
Kang, Yeonghun
Park, Hyunsoo
Smit, Berend  
Kim, Jihan
Date Issued

2023-03-13

Publisher

NATURE PORTFOLIO

Published in
Nature Machine Intelligence
Volume

5

Issue

3

Start page

309

End page

318

Subjects

Computer Science, Artificial Intelligence

•

Computer Science, Interdisciplinary Applications

•

Computer Science

•

algorithms

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LSMO  
Available on Infoscience
April 10, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/196755
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés