research
          
      
      ∙
      07/28/2020
    Stochastic Normalized Gradient Descent with Momentum for Large Batch Training
Stochastic gradient descent (SGD) and its variants have been the dominat...
          
            research
          
      
      ∙
      02/26/2020
    Stagewise Enlargement of Batch Size for SGD-based Learning
Existing research shows that the batch size can seriously affect the per...
          
            research
          
      
      ∙
      05/30/2019