首页> 外国专利> AUTOMATED DICTIONARY CREATION FOR SCIENTIFIC TERMS

AUTOMATED DICTIONARY CREATION FOR SCIENTIFIC TERMS

机译:科学术语的自动词典创建

摘要

Systems and methods for automated creation of a dictionary of scientific terms are described herein. Initially, input data is filtered to obtain a primary file having a plurality of term-ID pairs with each term-ID pair having a unique term ID and a scientific term. Further, a remove-term file is generated based on one or more term-ID pairs identified from the primary file such that the scientific terms of each term-ID pair corresponds to one of additional terms, frequent scientific terns, and undesirable terms. At least one term-ID pair from among the one or more term-ID pairs is altered to obtain a modified term-ID pair based on modification rules. The modified term-ID pair is added to an add-term file and a modified file is obtained based on the remove-term file and the add-term file. Duplicate term-ID pairs present in the modified file are removed to obtain the dictionary of scientific terms.
机译:本文描述了用于自动创建科学术语词典的系统和方法。最初,过滤输入数据以获得具有多个术语-ID对的主文件,其中每个术语-ID对具有唯一的术语ID和科学术语。此外,基于从主文件中标识的一个或多个术语-ID对生成删除术语文件,以使每个术语-ID对的科学术语对应于其他术语,频繁的科学术语和不良术语之一。基于修改规则,改变一个或多个术语-ID对中的至少一个术语-ID对,以获得修改后的术语-ID对。修改的术语-ID对被添加到添加术语文件,并且基于移除术语文件和添加术语文件获得修改的文件。删除已修改文件中存在的重复术语ID对,以获得科学术语词典。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号