Kavli Affiliate: Jing Wang | First 5 Authors: Devansh Bisla, Jing Wang, Anna Choromanska, , | Summary: In this paper, we study the sharpness of a deep learning (DL) loss landscape around local minima in order to reveal systematic mechanisms underlying the generalization abilities of DL models. Our analysis is performed across varying network and […]
Continue.. Low-Pass Filtering SGD for Recovering Flat Optima in the Deep Learning Optimization Landscape