首页> 外文会议>International Conference on Progress in Informatics and Computing >A Method for Discovering and Obtaining Company Hot Events from Internet News
【24h】

A Method for Discovering and Obtaining Company Hot Events from Internet News

机译:一种从互联网新闻中发现和获取公司热点事件的方法

获取原文

摘要

With the rapid development and popularization, Internet is becoming the most convenient way to publish and obtain information, which causes an extremely increasing quantity and variety of data. It is difficult to find out potentially valuable information from these data, which is the primary problem of data mining. Mining company hot events from Internet news can effectively reflect how its business works. Thus, we propose a method for discovering and obtaining hot events from Internet news. In the proposed method, we use Gaussian kernel to update clustering center instead of global cluster to modify Single-Pass clustering algorithm. It is a dynamic incremental clustering algorithm which does not need to initialize the number of clusters. Then, Top-N hot events can be obtained through the clustering centers. Experimental comparison shows that the improved algorithm has higher clustering efficiency than the classic algorithm. Case studies from Shanghai pilot free-trade zone (FTZ) also show the effectiveness of our proposed method.
机译:随着快速发展和普及,Internet成为发布和获取信息的最便捷方式,这导致数据量和种类的急剧增加。很难从这些数据中找到潜在有价值的信息,这是数据挖掘的主要问题。互联网新闻中的矿业公司热点事件可以有效地反映其业务运作方式。因此,我们提出了一种从互联网新闻中发现和获取热点事件的方法。在提出的方法中,我们使用高斯核来更新聚类中心而不是全局聚类来修改单遍聚类算法。它是一种动态增量聚类算法,不需要初始化聚类数。然后,可以通过聚类中心获得前N个热点事件。实验比较表明,改进算法比经典算法具有更高的聚类效率。上海自由贸易试验区(FTZ)的案例研究也显示了我们提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号