research
          
      
      ∙
      06/09/2021
    Communication-efficient SGD: From Local SGD to One-Shot Averaging
We consider speeding up stochastic gradient descent (SGD) by parallelizi...
          
            research
          
      
      ∙
      06/03/2020