...
首页> 外文期刊>Journal of intelligent & fuzzy systems: Applications in Engineering and Technology >A gating context-aware text classification model with BERT and graph convolutional networks
【24h】

A gating context-aware text classification model with BERT and graph convolutional networks

机译:具有BERT和图形卷积网络的Gating上下文感知文本分类模型

获取原文
获取原文并翻译 | 示例
           

摘要

Graph convolutional networks (GCNs), which are capable of effectively processing graph-structural data, have been successfully applied in text classification task. Existing studies on GCN based text classification model largely concerns with the utilization of word co-occurrence and Term Frequency-Inverse Document Frequency (TF-IDF) information for graph construction, which to some extent ignore the context information of the texts. To solve this problem, we propose a gating context-aware text classification model with Bidirectional Encoder Representations from Transformers (BERT) and graph convolutional network, named as Gating Context GCN (GC-GCN). More specifically, we integrate the graph embedding with BERT embedding by using a GCN with gating mechanism to enable the acquisition of context coding. We carry out text classification experiments to show the effectiveness of the proposed model. Experimental results shown our model has respectively obtained 0.19%, 0.57%, 1.05% and 1.17% improvements over the Text-GCN baseline on the 20NG, R8, R52, and Ohsumed benchmark datasets. Furthermore, to overcome the problem that word co-occurrence and TF-IDF are not suitable for graph construction for short texts, Euclidean distance is used to combine with word co-occurrence and TF-IDF information. We obtain an improvement by 1.38% on the MR dataset compared to Text-GCN baseline.
机译:图卷积网络(GCN)能够有效地处理图结构数据,已成功地应用于文本分类任务中。现有的基于GCN的文本分类模型的研究主要关注于利用单词共现和术语频率逆文档频率(TF-IDF)信息构建图形,这在一定程度上忽略了文本的上下文信息。为了解决这个问题,我们提出了一个基于双向编码器表示的门控上下文感知文本分类模型,称为门控上下文GCN(GC-GCN)。更具体地说,我们通过使用带有选通机制的GCN,将图嵌入与伯特嵌入结合起来,以实现上下文编码的获取。我们进行了文本分类实验,证明了该模型的有效性。实验结果表明,在20NG、R8、R52和Ohsumed基准数据集上,我们的模型比文本GCN基线分别提高了0.19%、0.57%、1.05%和1.17%。此外,为了克服单词共现和TF-IDF不适合短文本的图形构造问题,使用欧几里德距离将单词共现和TF-IDF信息结合起来。与文本GCN基线相比,我们在MR数据集上获得了1.38%的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号