首页> 外国专利> METHOD FOR CONSTRUCTING CHINESE-ENGLISH BILINGUAL CORPUS, AND RELATED DEVICE

METHOD FOR CONSTRUCTING CHINESE-ENGLISH BILINGUAL CORPUS, AND RELATED DEVICE

机译:构建中英双语语料库的方法,以及相关设备

摘要

A method for constructing a Chinese-English bilingual corpus, and a related device, relating to the technical field of computers, and applied to smart cities, specifically to smart life. The method for constructing the Chinese-English bilingual corpus comprises: obtaining a Chinese entity, an English entity, and a mapping relationship and an intertranslation relationship between the Chinese entity and the English entity, and constructing a bilingual entity word network according to a preset requirement; calculating a single-language representation estimated value and a cross-language entity estimated value of the bilingual entity word network according to the Chinese entity, the English entity, contextual words, a preset hyperlink set, and a preset sentence set; calculating a cross-language sentence estimated value corresponding to an obtained comparable sentence network by using a training sentence; calculating a target estimated value according to the three estimated values; and according to the target estimated value, combining the bilingual entity word network and the comparable sentence network into the Chinese-English bilingual corpus, and storing the Chinese-English bilingual corpus on a blockchain. The accuracy of corpus in the Chinese-English bilingual corpus is improved by using the correlation between the two networks.
机译:一种构建中国英语双语语料库的方法,以及与计算机技术领域有关的相关设备,并应用于智能城市,特别是智能寿命。构建汉英双语语料库的方法包括:获得中国实体,英语实体和映射关系以及中国实体和英语实体之间的互连关系,并根据预设要求构建双语实体字网络;根据中国实体,英语实体,上下文单词,预设超链接集和预设句子集,计算单语言表示估计值和双语实体字网络的跨语言实体估计值。计算通过使用训练句对应于获得的可比句子网络对应的跨语句估计值;根据三个估计值计算目标估计值;根据目标估计值,将双语实体词网络和可比较的句子网络组合到中英双语语料库中,并将中英语双语语料库存储在区块链中。通过使用两个网络之间的相关性,改善了中英语双语语料库中的语料库的准确性。

著录项

  • 公开/公告号WO2021218012A1

    专利类型

  • 公开/公告日2021-11-04

    原文格式PDF

  • 申请/专利权人 PING AN TECHNOLOGY (SHENZHEN) CO. LTD.;

    申请/专利号WO2020CN117388

  • 发明设计人 DENG YUE;JIN GE;XU LIANG;

    申请日2020-09-24

  • 分类号G06F40/58;

  • 国家 CN

  • 入库时间 2022-08-24 22:07:19

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号