首页> 外文会议>Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge
【24h】

Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge

机译:吹狗哨子:中国数据集以常识和世界知识无法理解

获取原文

摘要

Cant is important for understanding advertising, comedies and dog-whistle politics. However, computational research on cant is hindered by a lack of available datasets. In this paper, we propose a large and diverse Chinese dataset for creating and understanding cant from a computational linguistics perspective. We formulate a task for cant understanding and provide both quantitative and qualitative analysis for tested word embedding similarity and pretrained language models. Experiments suggest that such a task requires deep language understanding, common sense, and world knowledge and thus can be a good testbed for pretrained language models and help models perform better on other tasks.
机译:对理解广告,喜剧和狗哨政治非常重要。 然而,对缺乏可用的数据集来阻碍对无法阻碍的计算研究。 在本文中,我们提出了一个大型和多样化的中国数据集,用于从计算语言学视角创建和解无法创建和解。 我们为无法理解的任务制定,并为测试的单词嵌入相似性和预用语言模型提供定量和定性分析。 实验表明,这样的任务需要深入的语言理解,常识和世界知识,因此可以是预训练语言模型的好的测试平台,并帮助模型在其他任务中更好地表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号