首页> 外文会议>IEEE International Conference on Artificial Intelligence and Computer Applications >An Ensemble Classification Algorithm for Text Data Stream based on Feature Selection and Topic Model
【24h】

An Ensemble Classification Algorithm for Text Data Stream based on Feature Selection and Topic Model

机译:基于特征选择和主题模型的文本数据流集成分类算法

获取原文

摘要

How to mine valuable information that users are interested in from a continuous text data stream, text data stream classification has received widespread attention as a core technology to solve the problem. This paper proposes a text data stream ensemble classification algorithm that combines feature selection and topic model. Firstly, the mutual information feature selection method is used to remove features that are not related to classification. Secondly, the LDA topic model is used to establish the document-topic distribution. Finally, the pre-processed text data stream is classified by an ensemble classification model. The experimental results show that the proposed text data stream ensemble classification algorithm can improve the classification performance of text data stream.
机译:如何从连续的文本数据流中挖掘用户感兴趣的有价值的信息,文本数据流分类作为解决该问题的核心技术已受到广泛关注。提出了一种结合特征选择和主题模型的文本数据流集成分类算法。首先,互信息特征选择方法用于去除与分类无关的特征。其次,使用LDA主题模型建立文档主题分布。最后,预处理后的文本数据流通过集成分类模型进行分类。实验结果表明,本文提出的文本数据流集成分类算法可以提高文本数据流的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号