首页> 外文会议>Principles of data mining and knowledge discovery >Mining text archives: creating readable maps to structure and describe document collections
【24h】

Mining text archives: creating readable maps to structure and describe document collections

机译:挖掘文本档案:创建可读的地图以构造和描述文档集合

获取原文
获取原文并翻译 | 示例

摘要

With the ever-growing amount of unstrutured textual data on the web,mining these text collections is of increasing importance ofr the understanding of document archives.Particularly the self-organizing map has shown to be very well suited for this task.however,the interpretation of the resulting document maps still requires a tremendous effort,especially as far as the analysis of the features learned and the characteristics of identified text clusters are concerned learned and the characteristics of identified text clusters are concerned.In this paper we present the LabelSOM method which,based on the features learned by the map,automatically assigns a set of keywords to the units of the map to describe the concepts of the underlying text clusters,thus making the characteristics of the various topical areas on the map explicit.
机译:随着网络上非结构化文本数据的不断增长,挖掘这些文本集合对于理解文档档案的重要性越来越高。特别是,自组织图已非常适合此任务。生成的文档图谱仍然需要付出巨大的努力,尤其是在学习学习的特征和识别的文本簇的特征以及关注识别的文本簇的特征方面。本文提出了LabelSOM方法,根据地图学习到的特征,自动将一组关键字分配给地图的各个单元,以描述基础文本集群的概念,从而使地图上各个主题区域的特征都明确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号