Stochastic Spectral Descent for Restricted Boltzmann Machines

Carlson, David; Cevher, Volkan; Carin, Lawrence

Carlson, David; Cevher, Volkan; Carin, Lawrence

2015

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Restricted Boltzmann Machines (RBMs) are widely used as building blocks for deep learning models. Learning typically proceeds by using stochastic gradient descent, and the gradients are estimated with sampling methods. However, the gradient estimation is a computational bottleneck, so better use of the gradients will speed up the descent algorithm.To this end, we rst derive upper bounds on the RBM cost function, then show that descent methods can have natural advantages by operating in the `1 and Shatten-1 norm. We introduce a new method called \Stochastic Spectral Descent" that updates parameters in the normed space. Empirical results show dramatic improvements over stochastic gradient descent, and have only have a fractional increase on the per-iteration cost.

Details

Title Stochastic Spectral Descent for Restricted Boltzmann Machines

Author(s) Carlson, David ; Cevher, Volkan ; Carin, Lawrence

Conference The 18th International Conference on Artificial Intelligence and Statistics, San Diego, USA, May 9-12, 2015

Date 2015

Keywords

ml-ai

Laboratories LIONS

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIONS - Laboratory for Information and Inference Systems
Conference Papers
Work produced at EPFL

Record creation date 2015-01-27

Actions

Preview

Select file: