Stochastic Spectral Descent for Discrete Graphical Models

Carlson, David; Hsieh, Ya-Ping; Collins, Edo; Carin, Lawrence; Cevher, Volkan

doi:10.1109/Jstsp.2015.2505684

Carlson, David; Hsieh, Ya-Ping; Collins, Edo; Carin, Lawrence; Cevher, Volkan

2016

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Interest in deep probabilistic graphical models has increased in recent years, due to their state-of-the-art perfor- mance on many machine learning applications. Such models are typically trained with the stochastic gradient method, which can take a significant number of iterations to converge. Since the computational cost of gradient estimation is prohibitive even for modestly-sized models, training becomes slow and practically- usable models are kept small. In this paper we propose a new, largely tuning-free algorithm to address this problem. Our approach derives novel majorization bounds based on the Schatten-∞ norm. Intriguingly, the minimizers of these bounds can be interpreted as gradient methods in a non-Euclidean space. We thus propose using a stochastic gradient method in non-Euclidean space. We both provide simple conditions under which our algorithm is guaranteed to converge, and demonstrate empirically that our algorithm leads to dramatically faster training and improved predictive ability compared to stochastic gradient descent for both directed and undirected graphical models.

Details

Title Stochastic Spectral Descent for Discrete Graphical Models

Author(s) Carlson, David ; Hsieh, Ya-Ping ; Collins, Edo ; Carin, Lawrence ; Cevher, Volkan

Published in IEEE Journal of Selected Topics in Signal Processing

Pagination 16

Volume 10

Issue 2

Pages 296-311

Date 2016

Publisher Piscataway, Ieee-Inst Electrical Electronics Engineers Inc

ISSN 1932-4553

Keywords

Deep Learning; Spectral Descent; Non-Euclidean Algorithms

DOI https://doi.org/10.1109/Jstsp.2015.2505684

Other identifier(s) View record in Web of Science

Laboratories LIONS

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIONS - Laboratory for Information and Inference Systems
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2015-12-22

Actions

Preview

Select file: