Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. XTab: Cross-table Pretraining for Tabular Transformers
 
conference paper

XTab: Cross-table Pretraining for Tabular Transformers

Zhu, Bingzhao  
•
Shi, Xingjian
•
Erickson, Nick
Show more
Krause, Andreas
•
Brunskill,Emma
2023
ICML'23: Proceedings of the 40th International Conference on Machine Learning
ICML'23: International Conference on Machine Learning

The success of self-supervised learning in computer vision and natural language processing has motivated pretraining methods on tabular data. However, most existing tabular self-supervised learning models fail to leverage information across multiple data tables and cannot generalize to new tables. In this work, we introduce XTab, a framework for cross-table pretraining of tabular transformers on datasets from various domains. We address the challenge of inconsistent column types and quantities among tables by utilizing independent featurizers and using federated learning to pretrain the shared component. Tested on 84 tabular prediction tasks from the OpenML-AutoML Benchmark (AMLB), we show that (1) XTab consistently boosts the generalizability, learning speed, and performance of multiple tabular transformers, (2) by pretraining FT-Transformer via XTab, we achieve superior performance than other state-of-the-art tabular deep learning models on various tasks such as regression, binary, and multiclass classification.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.48550/arXiv.2305.06090
Author(s)
Zhu, Bingzhao  
Shi, Xingjian
Erickson, Nick
Li, Mu
Karypis, George
Shoaran, Mahsa  
Editors
Krause, Andreas
•
Brunskill,Emma
Date Issued

2023

Publisher

JMLR.org

Published in
ICML'23: Proceedings of the 40th International Conference on Machine Learning
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
INL  
Event nameEvent placeEvent date
ICML'23: International Conference on Machine Learning

Honolulu, Hawaii, USA

July 23 - 29, 2023

Available on Infoscience
May 14, 2024
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/207884
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés