首页> 外文期刊>Journal of Computer Science & Technology >A Method to Build a Super Small but Practically Accurate Language Model for Handheld Devices
【24h】

A Method to Build a Super Small but Practically Accurate Language Model for Handheld Devices

机译:一种用于手持设备的超小但实用的语言模型的构建方法

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, an important question, whether a small language model can be practically accurate enough, is raised. Afterwards, the purpose of a language model, the problems that a language model faces, and the factors that affect the performance of a language model, are analyzed. Finally, a novel method for language model compression is proposed, which makes the large language model usable for applications in handheld devices, such as mobiles, smart phones, personal digital assistants (PDAs), and handheld personal computers (HPCs). In the proposed language model compression method, three aspects are included. First, the language model parameters are analyzed and a criterion based on the importance measure of n-grams is used to determine which n-grams should be kept and which removed. Second, a piecewise linear warping method is proposed to be used to compress the uni-gram count values in the full language model. And third, a rank-based quantization method is adopted to quantize the bi-gram probability values. Experiments show that by using this compression method the language model can be reduced dramatically to only about 1M bytes while the performance almost does not decrease. This provides good evidence that a language model compressed by means of a well-designed compression technique is practically accurate enough, and it makes the language model usable in handheld devices.
机译:在本文中,提出了一个重要的问题,即小语言模型是否可以在实际上足够准确。然后,分析语言模型的目的,语言模型面临的问题以及影响语言模型性能的因素。最后,提出了一种新的语言模型压缩方法,该方法使大型语言模型可用于手持设备中的应用,例如移动电话,智能电话,个人数字助理(PDA)和手持个人计算机(HPC)。在所提出的语言模型压缩方法中,包括三个方面。首先,分析语言模型参数,并使用基于n-gram重要性度量的标准来确定应保留和删除哪些n-gram。其次,提出了一种分段线性变形方法,用于压缩全语言模型中的单字母组计数值。第三,采用基于秩的量化方法对二元语法概率值进行量化。实验表明,通过使用这种压缩方法,语言模型可以显着减少到大约1M字节,而性能几乎没有下降。这提供了充分的证据,表明通过精心设计的压缩技术压缩的语言模型实际上足够准确,并且使该语言模型可在手持设备中使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号