ReLU Neural Networks with Linear Layers are Biased Towards Single- and Multi-Index Models
Published in arXiv, 2023
Connects the representation cost of neural networks with 1 ReLU layer and many linear layers to the spectrum of the expected gradient outer product matrix (EGOP), showing that this architecture is biased towards single- and multi-index models.
Joint work with Greg Ongie and Rebecca Willett.