Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Nonsmooth Implicit Differentiation for Machine Learning and Optimization

Abstract : In view of training increasingly complex learning architectures, we establish a nonsmooth implicit function theorem with an operational calculus. Our result applies to most practical problems (i.e., definable problems) provided that a nonsmooth form of the classical invertibility condition is fulfilled. This approach allows for formal subdifferentiation: for instance, replacing derivatives by Clarke Jacobians in the usual differentiation formulas is fully justified for a wide class of nonsmooth problems. Moreover this calculus is entirely compatible with algorithmic differentiation (e.g., backpropagation). We provide several applications such as training deep equilibrium networks, training neural nets with conic optimization layers, or hyperparameter-tuning for nonsmooth Lasso-type models. To show the sharpness of our assumptions, we present numerical experiments showcasing the extremely pathological gradient dynamics one can encounter when applying implicit algorithmic differentiation without any hypothesis.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03251332
Contributor : Edouard Pauwels Connect in order to contact the contributor
Submitted on : Monday, June 7, 2021 - 8:30:29 AM
Last modification on : Tuesday, October 19, 2021 - 2:23:19 PM
Long-term archiving on: : Wednesday, September 8, 2021 - 6:05:28 PM

Files

implicitNonsmooth.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03251332, version 1
  • ARXIV : 2106.04350

Citation

Jérôme Bolte, Tam Le, Edouard Pauwels, Antonio Silveti-Falls. Nonsmooth Implicit Differentiation for Machine Learning and Optimization. 2021. ⟨hal-03251332⟩

Share

Metrics

Record views

78

Files downloads

50