首页> 外文会议>International Conference on ICT for Smart Society >Twitter Information Extraction for Smart City
【24h】

Twitter Information Extraction for Smart City

机译:Twitter智慧城市信息提取

获取原文

摘要

In Indonesia, Bandung is the second most active Twitter user, which means a lot of tweets have been shared among Bandung people on Twitter. Tweets can be used as a data source to explore information related to the city. One example is information related to traffic congestion, such as information of location, date, and time when the traffic congestion happened. In this study, we proposed a method to filter the tweets related to traffic congestion in Bandung and to extract the information of location, time, date and image (if any). SVM with several variations of the weighting and the selection of features is used for filtering process. The results showed that the greatest accuracy rate is 83% use Binary weighting method in top-2000 features. Meanwhile, information extraction process carried out by a rule-based approach, gave satisfactory results, around 98%-100% for the extraction of date, time and URL. However, the extraction of location information only gave accuracy of about 62%. It was caused by OOV (Out of Vocabulary) and OOR (Out of Rules).
机译:在印度尼西亚,万隆是第二大活跃的Twitter用户,这意味着万隆人在Twitter上分享了很多推文。推文可用作数据源,以探索与城市有关的信息。一个示例是与交通拥堵有关的信息,例如交通拥堵发生的位置,日期和时间的信息。在这项研究中,我们提出了一种方法来过滤与万隆市交通拥堵相关的推文,并提取位置,时间,日期和图像(如果有)的信息。具有权重和特征选择的几种变体的SVM用于过滤过程。结果表明,在前2000个特征中,使用Binary加权方法的最大准确率是83%。同时,通过基于规则的方法进行的信息提取过程给出了令人满意的结果,大约98%-100%的日期,时间和URL的提取。但是,位置信息的提取仅给出约62%的准确性。它是由OOV(词汇量超出)和OOR(规则外)引起的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号