首页> 外文会议>Workshop on Asian translation >Similar Southeast Asian Languages: Corpus-Based Case Study on Thai-Laotian and Malay-Indonesian
【24h】

Similar Southeast Asian Languages: Corpus-Based Case Study on Thai-Laotian and Malay-Indonesian

机译:相似的东南亚语言:基于泰语-老挝语和马来语-印尼语的语料库案例研究

获取原文

摘要

This paper illustrates the similarity between Thai and Laotian, and between Malay and Indonesian, based on an investigation on raw parallel data from Asian Language Treebank. The cross-lingual similarity is investigated and demonstrated on metrics of correspondence and order of tokens, based on several standard statistical machine translation techniques. The similarity shown in this study suggests a possibility on harmonious annotation and processing of the language pairs in future development.
机译:本文基于对亚洲语言树库的原始并行数据的调查,说明了泰国和老挝之间,马来人和印度尼西亚之间的相似性。基于几种标准的统计机器翻译技术,对跨语言相似性进行了调查,并根据标记的对应性和顺序度量进行了证明。这项研究显示的相似性表明,在未来的发展中,有可能对语言对进行和谐的注释和处理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号