research
          
      
      ∙
      01/21/2021
    Distilling Large Language Models into Tiny and Effective Students using pQRNN
Large pre-trained multilingual models like mBERT, XLM-R achieve state of...
          
            research
          
      
      ∙
      04/30/2012