research
          
      
      ∙
      05/31/2021
    A unified view of likelihood ratio and reparameterization gradients
Reparameterization (RP) and likelihood ratio (LR) gradient estimators ar...
          
            research
          
      
      ∙
      10/14/2019
    A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Reparameterization (RP) and likelihood ratio (LR) gradient estimators ar...
          
            research
          
      
      ∙
      02/05/2019
    Total stochastic gradient algorithms and applications in reinforcement learning
Backpropagation and the chain rule of derivatives have been prominent; h...
          
            research
          
      
      ∙
      02/04/2019