首页> 外文期刊>Journal of digital information management >Sina Weibo Incident Monitor and Chinese Disaster Microblogging Classification
【24h】

Sina Weibo Incident Monitor and Chinese Disaster Microblogging Classification

机译:新浪微博事件监控器与中国灾难微博分类

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes the initial work on developing an all-hazards emergency event detector using messages obtained in near-real-time from the public timeline of the Chinese Sina Weibo microblogging service. The system filters target keywords corresponding to emergency events of earthquakes, floods, typhoons, fires and storms and then uses classifiers to identify messages from people experiencing the corresponding emergency event. Then, this study carried out experiments that compare the performance of four different classification methods and also explore to improve the classifier by the new training data captured by SWIM recently. After Chinese text pre-processing, feature selection and training set size, the experimental results demonstrate Random forests classifier could get best performance but need more long time to run in R, thus the potential to improve this classifier for setting up the SWIM system need to be explored in the future. While similar work has been reported using Twitter content, this is the first time these techniques have been applied to the Sina Weibo microblogging service for multiple emergency event types. This paper also outline the experience of accessing Sina Weibo messages, provide a summary of their structure and content, note the challenges faced in processing this text using Natural Language Processing packages and outline the developed website for users to view the processed messages. The long term aim is to develop a general emergency notification and monitoring system for various disaster event types in China reported by the public on Sina Weibo which can be used by the appropriate emergency services as a source of improved situational awareness.
机译:本文介绍了使用从中国新浪微博服务的公共时间线近实时获取的消息开发全危害紧急事件检测器的初步工作。该系统过滤与地震,洪水,台风,火灾和暴风雨等紧急事件相对应的目标关键字,然后使用分类器从经历相应紧急事件的人员中识别消息。然后,本研究进行了比较四种不同分类方法性能的实验,并尝试通过SWIM最近捕获的新训练数据来改进分类器。经过中文文本预处理,特征选择和训练集大小后,实验结果表明随机森林分类器可以获得最佳性能,但需要更长的时间才能在R中运行,因此有必要改进该分类器以建立SWIM系统。在将来被探索。尽管使用Twitter内容报道了类似的工作,但这是首次将这些技术应用于多种紧急事件类型的新浪微博服务。本文还概述了访问新浪微博消息的经验,提供了它们的结构和内容的摘要,指出了使用自然语言处理程序包处理文本时面临的挑战,并概述了供用户查看已处理消息的网站。长期目标是为公众在新浪微博上报道的中国各种灾害事件类型开发通用的紧急通知和监视系统,该系统可以由适当的紧急服务部门用作提高态势意识的来源。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号