research
          
      
      ∙
      09/09/2020
    Improved Exploration in Factored Average-Reward MDPs
We consider a regret minimization task under the average-reward criterio...
          
            research
          
      
      ∙
      04/20/2020
    Tightening Exploration in Upper Confidence Reinforcement Learning
The upper confidence reinforcement learning (UCRL2) strategy introduced ...
          
            research
          
      
      ∙
      10/09/2019
    Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Leveraging an equivalence property in the state-space of a Markov Decisi...
          
            research
          
      
      ∙
      03/05/2018