Identifying offensive information from tweets is a vital language processing task. This task concentrated more on English and other foreign languages these days. In this shared task on Offensive Language Identification in Dra-vidian Languages, in the First Workshop of Speech and Language Technologies for Dra-vidian Languages in EACL 2021, the aim is to identify offensive content from code mixed Dravidian Languages Kannada, Malay-alam, and Tamil. Our team used language-agnostic BERT (Bidirectional Encoder Representation from Transformers) for sentence embedding and a Softmax classifier. The language-agnostic representation based classification helped obtain good performance for all the three languages, out of which results for the Malayalam language are good enough to obtain a third position among the participating teams.
展开▼