research
          
      
      ∙
      06/05/2020
    Logical Team Q-learning: An approach towards factored policies in cooperative MARL
We address the challenge of learning factored policies in cooperative MA...
          
            research
          
      
      ∙
      09/13/2019
    ISL: Optimal Policy Learning With Optimal Exploration-Exploitation Trade-Off
Traditionally, off-policy learning algorithms (such as Q-learning) and e...
          
            research
          
      
      ∙
      10/17/2018
    Multi-Agent Fully Decentralized Value Function Learning with Linear Convergence Rates
This work develops a fully decentralized multi-agent algorithm for polic...
          
            research
          
      
      ∙
      10/17/2018