首页> 外文期刊>Pattern recognition letters >Classification of text documents based on score level fusion approach
【24h】

Classification of text documents based on score level fusion approach

机译:基于分数层次融合方法的文本文档分类

获取原文
获取原文并翻译 | 示例
           

摘要

Text document classification is a well known theme in the field of the information retrieval and text mining. Selection of most desired features in the text document plays a vital role in classification problem. This research article addresses the problem of text classification by considering Sentence-Vector Space Model (S-VSM) and Unigram representation models for the text document. An enhanced S-VSM model will be considered for the constructive representation of text documents. A neural network based representation for text documents is proposed for effective capturing of semantic information of the text data. Two different classifiers are designed based on the two different representation models of the text documents. Score level fusion is applied on two proposed models to find out the overall accuracy of the proposed model. Key contributions of the paper are an enhanced S-VSM model, an interval valued representation model for the proposed S-VSM approach. A word level representation model for semantic information preserving of the text document and score level fusion approach. (C) 2017 Elsevier B.V. All rights reserved.
机译:文本文档分类是信息检索和文本挖掘领域中众所周知的主题。文本文档中最需要的功能的选择在分类问题中起着至关重要的作用。本文通过考虑文本文件的句子向量空间模型(S-VSM)和Unigram表示模型来解决文本分类问题。将考虑使用增强的S-VSM模型来构造文本文档。提出了一种基于神经网络的文本文档表示方法,以有效地捕获文本数据的语义信息。基于文本文档的两种不同表示模型,设计了两种不同的分类器。将分数水平融合应用于两个提议的模型,以找出提议的模型的整体准确性。本文的主要贡献是增强的S-VSM模型,这是所提出的S-VSM方法的区间值表示模型。一种用于文本文档语义信息保存的词级表示模型和分数级别融合方法。 (C)2017 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号