首页> 外文会议>Annual Computing and Communication Workshop and Conference >Analyzing Stance and Topic of E-Cigarette Conversations on Twitter: Case Study in Indonesia
【24h】

Analyzing Stance and Topic of E-Cigarette Conversations on Twitter: Case Study in Indonesia

机译:在推特上分析电子烟对话的立场和主题:印度尼西亚的案例研究

获取原文

摘要

To control the use of e-cigarette, Indonesia plan to establish a regulation that embodies all the concerns, sentiments, and opinions of public This study aims to identify public opinions in social media Twitter by classifying tweets into group in favor or against e-cigarette and explore dominant topics of each group. This research obtained 15,373 tweets between June 2019 – May 2020 that is classified into 4 labels: Against, Favor, Neutral, and Irrelevant. The best model was selected with specification: 3 features (Count, Unigram, and Bigram), Logistic Regression algorithm, and three-stage classification pipeline ($mathrm{F}1-ext{score}=0.807$). As for topic modelling, corpus Against and Favor are used to retrieve dominant topics. We chose Non-negative Matrix Factorization algorithm with $mathrm{k}=6$ and achieve high coherence scores, which are 0.962004 for corpus Against and 0.999736 for corpus Favor.
机译:为了控制使用电子香烟,印尼计划建立一个法规,体现了一切顾虑,情绪和公共这项研究的目的的意见,通过微博进行分类识别社交媒体Twitter的民意成组赞成或反对电子烟探索各组的主导议题。这项研究获得的15373个鸣叫2019六月 - 五月2020分为4个标签:反对,赞成,中立和不相关的。最佳模型用规格选自:3个特性(计数,单字组和两字组),Logistic回归算法,和三个阶段分类管道( $ mathrm {F} 1 - {文本得分} = 0.807 $ )。至于主题建模,胼反对和青睐用于检索主导话题。我们选择非负矩阵分解算法 $ mathrm {K} = 6 $ 和实现高相干性的分数,这是0.962004用于语料库靠并0.999736为语料库青睐。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号