首页> 外文期刊>VINE >News analysis through text mining: a case study
【24h】

News analysis through text mining: a case study

机译:通过文本挖掘进行新闻分析:案例研究

获取原文
获取原文并翻译 | 示例
       

摘要

Purpose – This paper seeks to provide a tangible example of the use of text-mining techniques in a real world setting, i.e. using real, as opposed to test, data. Design/methodology/approach – News stories are modeled using the vector space model, with the similarity between documents quantified using the cosine measure. For data analysis, three clustering algorithms are used, and the results from the best-performing algorithm retained. Findings – Agglomerative clustering performed poorly, while direct k-way clustering and k-way clustering through repeated bisections yielded similar results, with the former performing marginally better in terms of external isolation and internal cohesion of the clusters produced. A number of themes that dominated news coverage during the period under consideration were identified, some of which were noticeably only topical during certain parts of the year.
机译:目的–本文力图提供在实际环境中使用文本挖掘技术的具体示例,即使用真实数据而不是测试数据。设计/方法/方法–新闻故事使用向量空间模型建模,文档之间的相似性使用余弦度量进行量化。对于数据分析,使用了三种聚类算法,并且保留了性能最佳的算法的结果。研究结果–聚集聚类的表现不佳,而直接k-way聚类和通过重复二等分的k-way聚类产生了相似的结果,其中前者在所产生的类的外部隔离和内部凝聚方面表现略佳。在所讨论的时期内,确定了一些主导新闻报道的主题,其中一些主题仅在一年中的某些时期才是热门话题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号