首页> 外文会议>International Conference on Data and Software Engineering >Public facilities recommendation system based on structured and unstructured data extraction from multi-channel data sources
【24h】

Public facilities recommendation system based on structured and unstructured data extraction from multi-channel data sources

机译:公共设施推荐系统基于来自多通道数据源的结构化和非结构化数据提取

获取原文

摘要

Nowadays social media data has grown very rapidly by producing a huge amount and variety of data everyday. Those data can be analyzed and processed to deliver useful information especially for public needs. However, most of the data available in social media are unstructured. This paper proposes a recommendation system for public facilities by utilizing both structured and unstructured data gathered from multi-channel data sources. The system uses single-criteria rating, multi-criteria-rating, and text data as the inputs. The challenge is how to handle data variety such that any kind of data from any channel can be integrated. The second challenge is how to extract location-related data from the raw data. There are four data channels used in the system. Three of them are social media channels, i.e. Twitter, Instagram, and Foursquare, while the other is internal data channel built as a part of the system itself. The system deals with three categories of public facility, i.e. park, hospital, and mosque. The whole system consists of two sub systems, i.e. the extractor system including the rating input module and the recommendation system. The recommendation system is implemented as end-user mobile application such that the users are able to use it anytime and anywhere. The system successfully integrate data from different social media channels and in different format to provide users with useful information concerning public facilities in the form of recommendation (rating) and popularity of the facilities. The experiment has shown that above 90% of the data collected from the social media contains location-related information that is useful for further processing. The system has been tested using usability test, and it obtained an average users score 3.9 on a scale of 1 to 5.
机译:如今,社交媒体数据通过每天产生大量数据以及各种数据来增长非常迅速。可以分析和处理这些数据,以便为公共需求提供有用的信息。但是,社交媒体中可用的大多数数据都是非结构化的。本文提出了通过利用来自多通道数据源收集的结构化和非结构化数据来提出公共设施的推荐系统。系统使用单标准评级,多标准评级和文本数据作为输入。挑战是如何处理数据变化,以便可以集成任何来自任何频道的任何类型的数据。第二个挑战是如何从原始数据中提取与位置相关的数据。系统中使用了四个数据通道。其中三个是社交媒体频道,即推特,Instagram和Foursquare,而另一个是作为系统本身的一部分构建的内部数据通道。该系统涉及三类公共设施,即公园,医院和清真寺。整个系统由两个子系统组成,即提取器系统,包括评级输入模块和推荐系统。推荐系统实现为最终用户移动应用程序,使得用户可以随时随地使用它。该系统成功将数据从不同的社交媒体渠道和不同的格式集成,以向用户提供有关建议(评级)和设施人气的公共设施的有用信息。实验表明,从社交媒体收集的90%以上包含与进一步处理有用的位置相关信息。系统已经使用可用性测试进行了测试,并且它获得的平均用户在1到5的等级中得分3.9。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号