Functional vs. parametric equivalence of ReLU networks

P. Bui Thi Mai, C. Lampert, in:, n.d.

Download
OA main.pdf 405.47 KB
Conference Paper | Accepted | English
Department
Abstract
We address the following question: How redundant is the parameterisation of ReLU networks? Specifically, we consider transformations of the weight space which leave the function implemented by the network intact. Two such transformations are known for feed-forward architectures: permutation of neurons within a layer, and positive scaling of all incoming weights of a neuron coupled with inverse scaling of its outgoing weights. In this work, we show for architectures with non-increasing widths that permutation and scaling are in fact the only function-preserving weight transformations. For any eligible architecture we give an explicit construction of a neural network such that any other network that implements the same function can be obtained from the original one by the application of permutations and rescaling. The proof relies on a geometric understanding of boundaries between linear regions of ReLU networks, and we hope the developed mathematical tools are of independent interest.
Publishing Year
Date Published
2020-02-14
IST-REx-ID

Cite this

Bui Thi Mai P, Lampert C. Functional vs. parametric equivalence of ReLU networks.
Bui Thi Mai, P., & Lampert, C. (n.d.). Functional vs. parametric equivalence of ReLU networks.
Bui Thi Mai, Phuong, and Christoph Lampert. “Functional vs. Parametric Equivalence of ReLU Networks,” n.d.
P. Bui Thi Mai and C. Lampert, “Functional vs. parametric equivalence of ReLU networks.”
Bui Thi Mai P, Lampert C. Functional vs. parametric equivalence of ReLU networks
Bui Thi Mai, Phuong, and Christoph Lampert. Functional vs. Parametric Equivalence of ReLU Networks.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]
Main File(s)
File Name
main.pdf 405.47 KB
Access Level
OA Open Access
Date Uploaded
2020-02-11
MD5 Checksum
8d372ea5defd8cb8fdc430111ed754a9


Export

Marked Publications

Open Data IST Research Explorer

Search this title in

Google Scholar