research
          
      
      ∙
      01/30/2023
    A Novel Framework for Policy Mirror Descent with General Parametrization and Linear Convergence
Modern policy optimization methods in applied reinforcement learning, su...
          
            research
          
      
      ∙
      09/30/2022
    Linear Convergence for Natural Policy Gradient with Log-linear Policy Parametrization
We analyze the convergence rate of the unregularized natural policy grad...
          
            research
          
      
      ∙
      09/23/2021