Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Stochastic Spectral Descent for Discrete Graphical Models
 
Loading...
Thumbnail Image
research article

Stochastic Spectral Descent for Discrete Graphical Models

Carlson, David
•
Hsieh, Ya-Ping  
•
Collins, Edo
Show more
2016
IEEE Journal of Selected Topics in Signal Processing

Interest in deep probabilistic graphical models has increased in recent years, due to their state-of-the-art perfor- mance on many machine learning applications. Such models are typically trained with the stochastic gradient method, which can take a significant number of iterations to converge. Since the computational cost of gradient estimation is prohibitive even for modestly-sized models, training becomes slow and practically- usable models are kept small. In this paper we propose a new, largely tuning-free algorithm to address this problem. Our approach derives novel majorization bounds based on the Schatten-∞ norm. Intriguingly, the minimizers of these bounds can be interpreted as gradient methods in a non-Euclidean space. We thus propose using a stochastic gradient method in non-Euclidean space. We both provide simple conditions under which our algorithm is guaranteed to converge, and demonstrate empirically that our algorithm leads to dramatically faster training and improved predictive ability compared to stochastic gradient descent for both directed and undirected graphical models.

  • Files
  • Details
  • Metrics
Type
research article
DOI
10.1109/Jstsp.2015.2505684
Web of Science ID

WOS:000370957200007

Author(s)
Carlson, David
•
Hsieh, Ya-Ping  
•
Collins, Edo
•
Carin, Lawrence
•
Cevher, Volkan  orcid-logo
Date Issued

2016

Publisher

Ieee-Inst Electrical Electronics Engineers Inc

Published in
IEEE Journal of Selected Topics in Signal Processing
Volume

10

Issue

2

Start page

296

End page

311

Subjects

Deep Learning

•

Spectral Descent

•

Non-Euclidean Algorithms

Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIONS  
Available on Infoscience
December 22, 2015
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/121876
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés