Identifying Generalization Properties in Neural Networks
Huan Wang · #researchIt has been empirically observed that different local optima, obtained from training deep neural networks don't generalize in the same way for the unseen data sets, even if they achieve the same training loss.