Automatic Translating Between Ancient Chinese and Contemporary Chinese with Limited Aligned Corpora

机译：古代汉语与当代汉语与汉语 - 当代语料库中的自动翻译

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Chinese language has evolved a lot during the long-term development. Therefore, native speakers now have trouble in reading sentences written in ancient Chinese. In this paper, we propose to build an end-to-end neural model to automatically translate between ancient and contemporary Chinese. However, the existing ancient-contemporary Chinese parallel corpora are not aligned at the sentence level and sentence-aligned corpora are limited, which makes it difficult to train the model. To build the sentence level parallel training data for the model, we propose an unsupervised algorithm that constructs sentence-aligned ancient-contemporary pairs by using the fact that the aligned sentence pair shares many of the tokens. Based on the aligned corpus, we propose an end-to-end neural model with copying mechanism and local attention to translate between ancient and contemporary Chinese. Experiments show that the proposed unsupervised algorithm achieves 99.4% F1 score for sentence alignment, and the translation model achieves 26.95 BLEU from ancient to contemporary, and 36.34 BLEU from contemporary to ancient.

机译：在长期发展中，汉语演变了很多。因此，母语人士现在遇到古代汉语读书的奇迹。在本文中，我们建议建立一个端到端的神经模型，自动翻译古代和当代汉语。然而，现有的古代中国平行语料库在句子级没有对齐，句子对齐的是有限的，这使得训练模型很难。为了构建模型的句子级并行培训数据，我们提出了一种无监督算法，通过使用对齐的句子对许多令牌来构造句子对齐的古代当代对。基于对齐的语料库，我们提出了一个端到端的神经模型，复制机制和当地关注古代和当代汉语之间的翻译。实验表明，提出的无监督算法达到了句子对齐的99.4％F1分数，而翻译模型从古代到当代的古代达到26.95个BLEU，以及来自当代到古代的36.34 Bleu。

著录项

来源
《CCF International Conference on Natural Language Processing and Chinese Computing》|2019年|xxxii 850 p.|共11页
会议地点
作者
Zhiyuan Zhang; Wei Li; Qi Su;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词
Sentence alignment; Neural machine translation;

机译：句子对齐;神经机翻译;

相似文献

外文文献
中文文献
专利

1. 基于语料库的当代汉语剧本中的请求策略研究 [J] . 王胜利中国应用语言学：英文版 . 2014,第001期
2. Automatic induction of bilingual resources from aligned parallel corpora:application to shallow-transfer machine translation [J] . Helena M. Caseli, Maria das Gracas V. Nunes, Mikel L. Forcada Machine translation . 2006,第4期

机译：从对齐的并行语料库中自动提取双语资源：在浅传输机器翻译中的应用
3. Discussion on Chinese Ancient Literature Translation Based on the English Translation of the Book of Songs [J] . Shuai Wu, Xiuzhang Yang, Huan Xia, 教育理论综述(英文) . 2020,第002期

机译：从诗经英译看中国古代文学翻译。
4. Discussion on Chinese Ancient Literature Translation Based on the English Translation of the Book of Songs [J] . Shuai Wu, Xiuzhang Yang, Huan Xia, 教育理论综述(英文) . 2020,第002期

机译：从诗经英译看中国古代文学翻译。
5. Automatic Translating Between Ancient Chinese and Contemporary Chinese with Limited Aligned Corpora [C] . Zhiyuan Zhang, Wei Li, Qi Su CCF International Conference on Natural Language Processing and Chinese Computing . 2019

机译：有限对齐语料库自动在古汉语和当代汉语之间进行翻译
6. New Export China: Translations Across Time and Place in Contemporary Chinese Porcelain Art (1996–2016) [D] . Burchmore, Alexander Thomas 2019

机译：新出口中国：当代中国瓷器艺术的时空翻译（1996–2016）
7. Semi-Automatic Construction of the Chinese-English MeSH Using Web-BasedTerm Translation Method [O] . Wen-Hsiang Lu, Shih-Jui Lin, Yi-Che Chan, 2005

机译：基于Web的汉英MeSH半自动构建术语翻译法
8. Automatic Acquisition of a High-Precision Translation Lexicon from Parallel Chinese-English Corpora [O] . Gao Zhao-Ming 1998

机译：从汉英平行语料库中自动获取高精度翻译词典

Automatic Translating Between Ancient Chinese and Contemporary Chinese with Limited Aligned Corpora

摘要

著录项

相似文献

相关主题

期刊订阅