首页> 外文会议>International Conference on Information Technology for Manufacturing Systems >An Improved Method for Mathematical Formula Extraction in Printed English and Chinese Documents
【24h】

An Improved Method for Mathematical Formula Extraction in Printed English and Chinese Documents

机译:印刷英汉文献中数学公式提取的改进方法

获取原文

摘要

Accurately locating mathematical formulas in scientific documents is the basis of their recognition. The existing formula extraction methods mostly aim at the documents in one language, which is inadaptable to the documents in other languages. This paper describes an improved method to extract formulas not only in Chinese but also in English documents. First, using run-number as the features to distinguish the documents' language; and then according to the difference between Chinese and English documents, corresponding features and parameters are chosen for the formula extraction. The experimental results show that this method can improve the robustness of formula extraction.
机译:在科学文档中准确定位数学公式是他们的认可的基础。现有的配方提取方法主要以一种语言瞄准文件,这对其他语言的文件无关。本文介绍了一种改进的方法,不仅在中文中提取公式,而且是英文文件。首先,使用运行号码作为区分文档语言的功能;然后根据中文和英文文件之间的差异,选择相应的特征和参数用于公式提取。实验结果表明,该方法可以提高配方萃取的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号