Depth Separation in Norm-Bounded Infinite-Width Neural Networks

Published in 37th Annual Conference on Learning Theory (COLT), 2024

Establishes a separation in the representation cost and sample complexity needed to approximate functions with two vs. three layer neural networks.

Joint work with Greg Ongie, Rebecca Willett, Ohad Shamir, and Nati Srebro.

https://proceedings.mlr.press/v247/parkinson24a/parkinson24a.pdf