Fast Automatic Smoothing for Generalized Additive Models

Davison, Anthony C.

research article

El-Bachir, Yousra

•

Davison, Anthony C.

January 1, 2019

Journal Of Machine Learning Research

Generalized additive models (GAMs) are regression models wherein parameters of probability distributions depend on input variables through a sum of smooth functions, whose degrees of smoothness are selected by L-2 regularization. Such models have become the de-facto standard nonlinear regression models when interpretability and flexibility are required, but reliable and fast methods for automatic smoothing in large data sets are still lacking. We develop a general methodology for automatically learning the optimal degree of L-2 regularization for GAMs using an empirical Bayes approach. The smooth functions are penalized by hyper-parameters that are learned simultaneously by maximization of a marginal likelihood using an approximate expectation-maximization algorithm. The latter involves a double Laplace approximation at the E-step, and leads to an efficient M-step. Empirical analysis shows that the resulting algorithm is numerically stable, faster than the best existing methods and achieves state-of-the-art accuracy. For illustration, we apply it to an important and challenging problem in the analysis of extremal data.

Type

research article

Web of Science ID

WOS:000506403100013

Author(s)

El-Bachir, Yousra

Davison, Anthony C.

Date Issued

2019-01-01

Publisher

MICROTOME PUBL

Published in

Journal Of Machine Learning Research

Volume

20

Start page

173

Subjects

Automation & Control Systems

•

Computer Science, Artificial Intelligence

•

Automation & Control Systems

•

Computer Science

•

automatic l-2 regularization

•

empirical bayes

•

expectation-maximization algorithm

•

generalized additive model

•

laplace approximation

•

marginal maximum likelihood

•

frequency-distribution

•

maximum-likelihood

•

regression

•

selection

•

parameter

•

scale

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

STAT

Available on Infoscience

January 26, 2020

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/164931