Margins, perceptrons, and deep networks

This talk surveys the role of margins in the analysis of deep networks. As a concrete highlight, it sketches a perceptron-based analysis establishing that shallow ReLU networks can achieve small test error even when they are quite narrow, sometimes even logarithmic in the sample size and inverse target error. The analysis and bounds depend on a certain nonlinear margin quantity due to Nitanda and Suzuki, and can lead to tight upper and lower sample complexity bounds. Joint work with Ziwei Ji.

Date

March 26, 2020

Speakers

Matus Telgarsky

Affiliation

University of Illinois

School of Mathematics

Seminar on Theoretical Machine Learning