首页> 外文会议>International Conference on Advanced Computer Science and Information Systems >A corpus-based lexicon building in Indonesian political context through Indonesian online news media
【24h】

A corpus-based lexicon building in Indonesian political context through Indonesian online news media

机译:通过印尼在线新闻媒体在印尼政治背景下建立基于语料库的词典

获取原文

摘要

Considering public opinion has always been a necessity for most people including governments and politicians. This information provides more direct means in determining public views which important for their decision-making process. With technology and the Internet nowadays, people are able to assess public opinion by using opinion mining or sentiment analysis. There are several known methods for this technology, for instance is lexicon-based method which is inherited from sentiment classification approach. This method uses lexicon in determining sentiment of particular object within related data sets. This paper solely concentrates on building the lexicon for the method. By focusing on Indonesian politic, we create a corpus-based approach to build a contextual lexicon which uses news articles as corpora. We determine the initial seed words and have it validated by domain experts for our experiment Based on the tests that we have done, we find that 51.79 per cent of the terms in our lexicon are relevant to our research domain. We use this finding to evaluate and improve our method as we continue the research to obtain more relevant result.
机译:考虑公众舆论一直是包括政府和政治人物在内的大多数人的必需品。这些信息为确定对他们的决策过程至关重要的公众意见提供了更直接的手段。如今,借助技术和互联网,人们能够通过使用意见挖掘或情感分析来评估公众意见。该技术有几种已知的方法,例如,是从情感分类方法继承的基于词典的方法。该方法使用词典来确定相关数据集中特定对象的情绪。本文仅专注于为该方法构建词典。通过关注印尼政治,我们创建了一种基于语料库的方法来构建将新闻文章用作语料库的上下文词典。我们确定了初始的种子词,并由领域专家进行了实验验证。根据我们所做的测试,我们发现词典中51.79%的术语与我们的研究领域相关。当我们继续研究以获得更相关的结果时,我们将利用这一发现来评估和改进我们的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号