首页> 外文期刊>Journal of Computer Science & Technology >Predicting Chinese Abbreviations from Definitions: An Empirical Learning Approach Using Support Vector Regression
【24h】

Predicting Chinese Abbreviations from Definitions: An Empirical Learning Approach Using Support Vector Regression

机译:根据定义预测中文缩写:使用支持向量回归的经验学习方法

获取原文
获取原文并翻译 | 示例
           

摘要

In Chinese, phrases and named entities play a central role in information retrieval. Abbreviations, however, make keyword-based approaches less effective. This paper presents an empirical learning approach to Chinese abbreviation prediction. In this study, each abbreviation is taken as a reduced form of the corresponding definition (expanded form), and the abbreviation prediction is formalized as a scoring and ranking problem among abbreviation candidates, which are automatically generated from the corresponding definition. By employing Support Vector Regression. (SVR) for scoring, we can obtain multiple abbreviation candidates together with their SVR values, which are used for candidate ranking. Experimental results show that the SVR method performs better than the popular heuristic rule of abbreviation prediction. In addition, in abbreviation prediction, the SVR method outperforms the hidden Markov model (HMM).
机译:在中文中,短语和命名实体在信息检索中起着核心作用。但是,缩写使基于关键字的方法不太有效。本文提出了一种经验学习方法,用于中文缩写预测。在本研究中,将每个缩写作为对应定义的简化形式(扩展形式),并将缩写预测形式化为缩写候选中的评分和排名问题,并根据对应的定义自动生成。通过采用支持向量回归。 (SVR)进行评分,我们可以获得多个缩写候选词及其SVR值,用于候选词排名。实验结果表明,SVR方法的性能优于流行的缩写预测启发式规则。另外,在缩写预测中,SVR方法优于隐马尔可夫模型(HMM)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号