research
          
      
      ∙
      01/28/2022
    Interplay between depth of neural networks and locality of target functions
It has been recognized that heavily overparameterized deep neural networ...
          
            research
          
      
      ∙
      05/20/2021
    Logarithmic landscape and power-law escape rate of SGD
Stochastic gradient descent (SGD) undergoes complicated multiplicative n...
          
            research
          
      
      ∙
      02/10/2021
    On Minibatch Noise: Discrete-Time SGD, Overparametrization, and Bayes
The noise in stochastic gradient descent (SGD), caused by minibatch samp...
          
            research
          
      
      ∙
      09/28/2020
    Improved generalization by noise enhancement
Recent studies have demonstrated that noise in stochastic gradient desce...
          
            research
          
      
      ∙
      05/26/2020