首页> 外文期刊>Knowledge-Based Systems >Sparse attention based separable dilated convolutional neural network for targeted sentiment analysis
【24h】

Sparse attention based separable dilated convolutional neural network for targeted sentiment analysis

机译:基于稀疏注意力的可分离式卷积神经网络用于目标情感分析

获取原文
获取原文并翻译 | 示例
       

摘要

Long short-term memory networks (LSTM) and classical convolutional neural networks (CNN) are two critical methods for the task of targeted sentiment analysis, but LSTM are difficult to parallelize and time-inefficient, and classical CNN can only capture local semantic features. To this end, this paper first proposes a sparse attention based separable dilated convolutional neural network (SA-SDCCN), which consists of multichannel embedding layer, separable dilated convolution module, sparse attention layer, and output layer. Specifically, our work is mainly concentrated on the first three parts. In multichannel embedding layer, semantic and sentiment embeddings are incorporated into an embedding tensor, which builds richer representations over the input sequence. In separable dilated convolution module, long-range contextual semantic information is explored and multi-scale contextual semantic dependencies are aggregated simultaneously through diverse dilation rates. Moreover, the separable structure further reduces the model parameters. In sparse attention layer, sentiment-oriented components are noticed according to the features of specific target entity. Finally, some experiments on three benchmark datasets demonstrate that SA-SDCCN achieves comparable or even better performance than state-of-the-art methods in terms of higher parallelism and lower computational cost. (C) 2019 Elsevier B.V. All rights reserved.
机译:长短期记忆网络(LSTM)和经典卷积神经网络(CNN)是目标情感分析任务的两种关键方法,但LSTM难以并行化且时间效率低,而经典CNN只能捕获局部语义特征。为此,本文首先提出了一种基于稀疏注意的可分离扩张卷积神经网络(SA-SDCCN),该网络由多通道嵌入层,可分离扩张的卷积模块,稀疏注意层和输出层组成。具体来说,我们的工作主要集中在前三个部分。在多通道嵌入层中,将语义和情感嵌入合并到嵌入张量中,从而在输入序列上构建更丰富的表示。在可分离的扩展卷积模块中,探索远程上下文语义信息,并通过不同的扩展速率同时聚合多尺度上下文语义依赖性。而且,可分离结构进一步减小了模型参数。在稀疏关注层中,根据特定目标实体的特征注意到了面向情感的组件。最后,在三个基准数据集上进行的一些实验表明,SA-SDCCN在更高的并行性和更低的计算成本方面,可以达到与最新技术相当甚至更好的性能。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号