research
          
      
      ∙
      06/08/2023
    Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS
Existing sentence textual similarity benchmark datasets only use a singl...
          
            research
          
      
      ∙
      05/03/2023
    Can Large Language Models Be an Alternative to Human Evaluations?
Human evaluation is indispensable and inevitable for assessing the quali...
          
            research
          
      
      ∙
      10/06/2022
    How Far Are We from Real Synonym Substitution Attacks?
In this paper, we explore the following question: how far are we from re...
          
            research
          
      
      ∙
      04/10/2022
    Re-Examining Human Annotations for Interpretable NLP
Explanation methods in Interpretable NLP often explain the model's decis...
          
            research
          
      
      ∙
      04/09/2022
    Understanding, Detecting, and Separating Out-of-Distribution Samples and Adversarial Samples in Text Classification
In this paper, we study the differences and commonalities between statis...
          
            research
          
      
      ∙
      09/08/2021
    On the Transferability of Pre-trained Language Models: A Study from Artificial Datasets
Pre-training language models (LMs) on large-scale unlabeled text data ma...
          
            research
          
      
      ∙
      12/22/2020