首页> 外文会议>Seventh joint conference on lexical and computational semantics >Solving Feature Sparseness in Text Classification using Core-Periphery Decomposition
【24h】

Solving Feature Sparseness in Text Classification using Core-Periphery Decomposition

机译:使用核心外围分解解决文本分类中的特征稀疏性

获取原文
获取原文并翻译 | 示例

摘要

Feature sparseness is a problem common to cross-domain and short-text classification tasks. To overcome this feature sparseness problem, we propose a novel method based on graph decomposition to find candidate features for expanding feature vectors. Specifically, we first create a feature-rclatcdncss graph, which is subsequently decomposed into core-periphery (CP) pairs and use the peripheries as the expansion candidates of the cores. We expand both training and test instances using the computed related features and use them to train a text classifier. We observe that prioritising features that are common to both training and test instances as cores during the CP decomposition to further improve the accuracy of text classification. We evaluate the proposed CP-decomposition-based feature expansion method on benchmark datasets for cross-domain sentiment classification and short-text classification. Our experimental results show that the proposed method consistently outperforms all baselines on short-text classification tasks, and perform competitively with pivot-based cross-domain sentiment classification methods.
机译:特征稀疏是跨域和短文本分类任务常见的问题。为了解决该特征稀疏问题,我们提出了一种基于图分解的新方法,以找到用于扩展特征向量的候选特征。具体来说,我们首先创建一个feature-rclatcdncss图,随后将其分解为核心-外围(CP)对,并使用外围作为核心的扩展候选对象。我们使用计算出的相关功能来扩展训练和测试实例,并使用它们来训练文本分类器。我们观察到,在CP分解期间,将训练和测试实例所共有的优先功能作为核心,可以进一步提高文本分类的准确性。我们在基准数据集上针对跨域情感分类和短文本分类评估了基于CP分解的特征扩展方法。我们的实验结果表明,该方法在短文本分类任务上始终优于所有基线,并且与基于枢轴的跨域情感分类方法相比具有竞争优势。

著录项

  • 来源
  • 会议地点 New Orleans(US)
  • 作者单位

    Department of Computer Science, University of Liverpool Department of Engineering Mathematics, University of Bristol;

    Department of Computer Science, University of Liverpool Department of Engineering Mathematics, University of Bristol;

    Department of Computer Science, University of Liverpool Department of Engineering Mathematics, University of Bristol;

    Department of Computer Science, University of Liverpool Department of Engineering Mathematics, University of Bristol;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号