首页> 外文会议>IEEE International Conference on Semantic Computing >Poetry Generation Model via Deep learning incorporating Extended Phonetic and Semantic Embeddings
【24h】

Poetry Generation Model via Deep learning incorporating Extended Phonetic and Semantic Embeddings

机译:诗歌生成模型通过深入学习融入扩展语音和语义嵌入

获取原文

摘要

Arabic poetry generation is a very challenging task since the linguistic structure of the Arabic language is considered a severe challenge for many researchers and developers in the Natural Language Processing field. In this paper, we propose a poetry generation model with extended phonetic and semantic embeddings (Phonetic CNNsubword embeddings). The proposed approach consists of three stages: (1.) Word Embedding, (2.) Keywords Extraction and Expansion, and (3.) Poetry Generation stage. Our model is able to generate the first verse which explicitly incorporates the theme related phrase using the Backward and Forward Language Model (B/F-LM) with Gated Recurrent Unit (GRU) cell, and then generates other verses of the poem sequentially, where each verse is composed based on a keyword and all previous generated verses using our proposed Hierarchy-Attention Sequence-to-Sequence model (HASS). We show that Phonetic CNNsubword embeddings have an effective contribution to the overall model performance. The Keywords Extraction and Expansion stage can ensure that the generated poem is coherent and semantically consistent with the input query intent. A comprehensive human evaluation confirms that the poems generated by our model outperform the base models in criteria including Meaning, Coherence, Fluency, and Poeticness. Extensive quantitative experiments using Bi-Lingual Evaluation Understudy (BLEU) scores also demonstrate significant improvements over strong baselines.
机译:阿拉伯语诗歌一代是一个非常具有挑战性的任务,因为阿拉伯语的语言结构被认为是自然语言处理领域的许多研究人员和开发人员的严峻挑战。在本文中,我们提出了一种诗歌生成模型,具有扩展语音和语义嵌入式(语音CNNSubWord Embeddings)。所提出的方法包括三个阶段:(1.)字嵌入,(2.)关键词提取和扩展,以及(3)诗歌生成阶段。我们的模型能够生成第一verse使用带有门控复发单元(GRU)单元的后向和前向语言模型(B / F-LM)明确地将主题相关的短语结合在一起,然后顺序地生成其他经文的尾声每个verse都是根据关键字和所有先前生成的经文组成,使用我们提出的层级 - 注意序列到序列模型(Hass)。我们展示了语音CNN子字 Embeddings对整体模型性能具有有效的贡献。关键字提取和扩展阶段可以确保所生成的诗是与输入查询意图相干和语义一致的。全面的人类评估证实,我们的模型产生的诗歌在标准中表现出基础模型,包括意义,一致性,流畅性和诗意。使用双语言评估的广泛的定量实验估计(BLEU)评分也表现出对强基线的显着改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号