research
          
      
      ∙
      05/31/2023
    Mildly Overparameterized ReLU Networks Have a Favorable Loss Landscape
We study the loss landscape of two-layer mildly overparameterized ReLU n...
          
            research
          
      
      ∙
      01/17/2023
    Expected Gradients of Maxout Networks and Consequences to Parameter Initialization
We study the gradients of a maxout network with respect to inputs and pa...
          
            research
          
      
      ∙
      07/01/2021