首页> 外文期刊>Information Processing & Management >Deep sentiments in Roman Urdu text using Recurrent Convolutional Neural Network model
【24h】

Deep sentiments in Roman Urdu text using Recurrent Convolutional Neural Network model

机译:使用反复卷积神经网络模型的罗马乌尔都语文本的深刻情绪

获取原文
获取原文并翻译 | 示例
           

摘要

Although over 64 million people worldwide speak Urdu language and are well aware of its Roman script, limited research and efforts have been made to carry out sentiment analysis and build language resources for the Roman Urdu language. This article proposes a deep learning model to mine the emotions and attitudes of people expressed in Roman Urdu - consisting of 10,021 sentences from 566 online threads belonging to the following genres: Sports; Software; Food & Recipes; Drama; and Politics. The objectives of this research are twofold: (1) to develop a human-annotated benchmark corpus for the under-resourced Roman Urdu language for the sentiment analysis; and (2) to evaluate sentiment analysis techniques using the Rule-based, N-gram, and Recurrent Convolutional Neural Network (RCNN) models. Using Corpus, annotated by three experts to be positive, negative, and neutral with 0.557 Cohen's Kappa score, we run two sets of tests, i.e., binary classification (positive and negative) and tertiary classification (positive, negative and neutral). Finally, the results of the RCNN model are analyzed by comparing it with the outcome of the Rule-based and N-gram models. We show that the RCNN model outperforms baseline models in terms of accuracy of 0.652 for binary classification and 0.572 for tertiary classification.
机译:虽然全世界超过6400万人说乌尔都语语言,并充分了解其罗马剧本,有限的研究和努力,为罗马乌尔都语进行了情感分析和建立语言资源。本文提出了深入的学习模式来挖掘罗马乌尔都人的情感和态度 - 由属于以下类型的566个在线线程组成的10,021个句子:体育;软件;食物与食谱;戏剧;和政治。这项研究的目标是双重的:(1)为出于情绪分析开发用于资源罗马乌尔都语语言的人类注释的基准语料; (2)使用基于规则的N-GRAM和经常性卷积神经网络(RCNN)模型来评估情绪分析技术。使用语料库,由三位专家注释为正,负,中性,与0.557 Cohen的Kappa评分,我们运行两组测试,即二进制分类(正负)和三级分类(正,负离子和中性)。最后,通过将其与规则的基于N-GRAM模型的结果进行比较来分析RCNN模型的结果。我们表明RCNN模型在二进制分类的精度为0.652的准确性方面优于基线模型和第0.572次进行三级分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号