research
          
      
      ∙
      01/26/2023
    Collaborative Regret Minimization in Multi-Armed Bandits
In this paper, we study the collaborative learning model, which concerns...
          
            research
          
      
      ∙
      08/18/2022
    Communication-Efficient Collaborative Best Arm Identification
We investigate top-m arm identification, a basic problem in bandit theor...
          
            research
          
      
      ∙
      07/16/2022
    Collaborative Best Arm Identification with Limited Communication on Non-IID Data
In this paper, we study the tradeoffs between time-speedup and the numbe...
          
            research
          
      
      ∙
      08/15/2021
    Batched Thompson Sampling for Multi-Armed Bandits
We study Thompson Sampling algorithms for stochastic multi-armed bandits...
          
            research
          
      
      ∙
      12/02/2020
    Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit
Motivated by real-world applications such as fast fashion retailing and ...
          
            research
          
      
      ∙
      04/20/2020
    Collaborative Top Distribution Identifications with Limited Interaction
We consider the following problem in this paper: given a set of n distri...
          
            research
          
      
      ∙
      03/13/2019