Deep Neural Networks With Trainable Activations and Controlled Lipschitz Constant

Aziznejad, Shayan; Gupta, Harshit; Campos, Joaquim; Unser, Michael

doi:10.1109/TSP.2020.3014611

research article

Deep Neural Networks With Trainable Activations and Controlled Lipschitz Constant

•

•

January 1, 2020

Ieee Transactions On Signal Processing

We introduce a variational framework to learn the activation functions of deep neural networks. Our aim is to increase the capacity of the network while controlling an upper-bound of the actual Lipschitz constant of the input-output relation. To that end, we first establish a global bound for the Lipschitz constant of neural networks. Based on the obtained bound, we then formulate a variational problem for learning activation functions. Our variational problem is infinite-dimensional and is not computationally tractable. However, we prove that there always exists a solution that has continuous and piecewise-linear (linear-spline) activations. This reduces the original problem to a finite-dimensional minimization where an l(1) penalty on the parameters of the activations favors the learning of sparse nonlinearities. We numerically compare our scheme with standard ReLU network and its variations, PReLU and LeakyReLU and we empirically demonstrate the practical aspects of our framework.

Name

aziznejad2001.pdf

Type

Publisher's version

Access type

openaccess

License Condition

CC BY-NC-ND

Size

1.36 MB

Format

Adobe PDF

Checksum (MD5)

d6da229d1a648224c91431f6bdb79d6b