Training MEMM with PSO: A Tool for Part-of-Speech Tagging

Lei La; Qiao Guo; Qimin Cao

首页> 外文期刊>Journal of software >Training MEMM with PSO: A Tool for Part-of-Speech Tagging

【24h】

Training MEMM with PSO: A Tool for Part-of-Speech Tagging

机译：使用PSO训练MEMM：用于词性标记的工具

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Maximum Entropy Markov Models (MEMM) can avoid the assumption of independence in traditional Hidden Markov Models (HMM), and thus take advantage of context information in most text mining tasks. Because the convergence rate of the classic generalized iterative' scaling (CIS) algorithm is too low to be tolerated, researchers proposed a lot of improved methods such as IIS, SCGIS and LBFGS for parameters training in MEMM. However these methods sometimes do not satisfy task requirements in efficiency and robustness. This article modifies the traditional Particle Swarm Optimization (PSO) algorithm by using dynamic global mutation probability (DGMP) to solve the local optimum and infinite loops problems and use the modified PSO in MEMM for estimating the parameters. We introduce the MEMM trained by modified PSO into Chinese Part-of-Speech (POS) tagging, analysis the experimental results and find it has higher convergence rate and accuracy than traditional MEMM.

机译：最大熵马尔可夫模型（MEMM）可以避免传统隐马尔可夫模型（HMM）中的独立性假设，因此可以在大多数文本挖掘任务中利用上下文信息。由于经典的广义迭代缩放算法（CIS）的收敛速度太低而无法容忍，因此研究人员提出了许多改进的方法，如IIS，SCGIS和LBFGS，用于在MEMM中进行参数训练。但是，这些方法有时不能满足效率和鲁棒性方面的任务要求。本文通过使用动态全局变异概率（DGMP）来解决局部最优和无限循环问题，并在MEMM中使用经过修改的PSO来估计参数，从而对传统的粒子群优化（PSO）算法进行了修改。将经过改进的PSO训练的MEMM引入中文词性（POS）标签中，对实验结果进行分析，发现它具有比传统MEMM更高的收敛速度和准确性。

著录项

来源
《Journal of software》 |2012年第11期|2511-2517|共7页
作者
Lei La; Qiao Guo; Qimin Cao;
展开▼
作者单位

School of Automation, Beijing Institute of Technology, Beijing, China;

School of Automation, Beijing Institute of Technology, Beijing, China;

School of Automation, Beijing Institute of Technology, Beijing, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
maximum entropy markov models; particle swarm optimization; dynamic global mutation probability; part-of-speech; text mining;

机译：最大熵马尔可夫模型;粒子群优化;动态全局突变概率;词性文字挖掘;

相似文献

外文文献
中文文献
专利

1. Training MEMM with PSO: A Tool for Part-of- Speech Tagging [J] . Lei La, Qiao Guo, Qimin Cao Journal of software . 2012,第11期

机译：使用PSO训练MEMM：语音部分标记的工具
2. Automatic Part-of-speech Tagging for Oromo Language Using Maximum Entropy Markov Model (MEMM) [J] . Abraham Tesso Nedjo, Degen Huang, Xiaoxia Liu Journal of information and computational science . 2014,第10期

机译：使用最大熵马尔可夫模型（MEMM）的Oromo语言的自动词性标记
3. Heuristic sample selection to minimize reference standard training set for a part-of-speech tagger. [J] . Liu K, Chapman W, Hwa R, Journal of the American Medical Informatics Association : . 2007,第5期

机译：启发式样本选择，以最大程度地减少针对词性标记器的参考标准训练集。
4. PSO-Tagger: A New Biologically Inspired Approach to the Part-of-Speech Tagging Problem [C] . Ana Paula Silva, Arlindo Silva, Irene Rodrigues Adaptive and natural computing algorithms . 2013

机译：PSO-Tagger：一种针对词性标注问题的新的生物学启发方法
5. IITagger: Tagging Wall Street Journal text with part-of-speech information [D] . Kim, Yeongkwun 1996

机译：IITagger：使用词性信息标记“华尔街日报”文本
6. Heuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger [O] . Kaihong Liu, Wendy Chapman, Rebecca Hwa, 2007

机译：启发式样本选择以最大程度地减少词性标注器的参考标准训练集
7. Robust Multilingual Part-of-Speech Tagging via Adversarial Training [O] . Yasunaga, Michihiro, Kasai, Jungo, Radev, Dragomir 2017

机译：通过对抗训练实现健壮的多语言词性标注

Training MEMM with PSO: A Tool for Part-of-Speech Tagging

摘要

著录项

相似文献

相关主题

期刊订阅