CUSATNLP@DravidianLangTech-EACL2021: Language Agnostic Classification of Offensive Content in Tweets

机译：Cusatnlp @ Dravidianlangtech-EACL2021：推文中的语言可靠分类进攻内容

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Identifying offensive information from tweets is a vital language processing task. This task concentrated more on English and other foreign languages these days. In this shared task on Offensive Language Identification in Dra-vidian Languages, in the First Workshop of Speech and Language Technologies for Dra-vidian Languages in EACL 2021, the aim is to identify offensive content from code mixed Dravidian Languages Kannada, Malay-alam, and Tamil. Our team used language-agnostic BERT (Bidirectional Encoder Representation from Transformers) for sentence embedding and a Softmax classifier. The language-agnostic representation based classification helped obtain good performance for all the three languages, out of which results for the Malayalam language are good enough to obtain a third position among the participating teams.

机译：识别推文中的攻击信息是一个重要的语言处理任务。这项任务这些目前更多地集中在英语和其他外语上。在这项共同任务中，关于DRA-Vidian语言的攻击性语言识别，在EACL 2021中的DRA-Vidian语言的第一个演讲和语言技术研讨会中，目的是从CODE MADIC DRAVIDIAN语言Kannada，Malay-Alam识别攻击内容和泰米尔。我们的团队使用语言 - 不可忽视的BERT（来自变压器的双向编码器表示）用于句子嵌入和软MAX分类器。基于语言无神不可知的分类有助于获得所有三种语言的良好性能，其中MALAYALAM语言的结果足以获得参与团队中的第三位。

著录项

来源
《Workshop on Speech and Language Technologies for Dravidian Languages》|2021年|236-242|共7页
会议地点
作者
Sara Renjit; Sumam Mary Idicula;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Hate Speech Classification in Indonesian Language Tweets by Using Convolutional Neural Network [J] . Dewa Ayu Nadia Taradhita, I Ketut Gede Darma Putra ITB Journal of Information and Communication Technology . 2021,第3期

机译：使用卷积神经网络讨厌印度尼西语语言推文的讲话分类
2. Sentiment Classification of Tweets with Non-Language Features [J] . Akilandeswari J, Jothi G Procedia Computer Science . 2018,第5期

机译：具有非语言功能的推文的情感分类
3. How well do hate speech, toxicity, abusive and offensive language classification models generalize across datasets? [J] . Paula Fortuna, Juan Soler-Company, Leo Wanner Information Processing & Management . 2021,第3期

机译：仇恨言语，毒性，滥用和令人反感的语言分类模型如何概括到数据集？
4. Did you offend me? Classification of Offensive Tweets in Hinglish Language [C] . Puneet Mathur, Ramit Sawhney, Meghna Ayyar, Second workshop on abusive language online 2018 . 2018

机译：你有冒犯我吗英语语言中的攻击性推文分类
5. Language Agnostic Model: Detecting Islamophobic Content on Social Media [D] . Khan, Heena. 2021

机译：语言无关型模型：检测社交媒体上的伊斯兰语含量
6. Suicide Note Classification Using Natural Language Processing: A Content Analysis [O] . John Pestian, Henry Nasrallah, Pawel Matykiewicz, -1

机译：自由语言处理的自杀式注意分类：内容分析
7. Large-Scale, Language-Agnostic Discourse Classification of Tweets During COVID-19 [O] . Oguzhan Gencoglu 2020

机译：Covid-19期间，大规模，语言 - 无话的话语分类推文分类

CUSATNLP@DravidianLangTech-EACL2021: Language Agnostic Classification of Offensive Content in Tweets

摘要

著录项

相似文献

相关主题

期刊订阅