research
          
      
      ∙
      02/23/2023
    Targeted Search Control in AlphaZero for Effective Policy Improvement
AlphaZero is a self-play reinforcement learning algorithm that achieves ...
          
            research
          
      
      ∙
      12/19/2019