Scaling Conditional Random Fields by One-Against-the-Other Decomposition

Hai Zhao; Chunyu Kit

首页> 外文期刊>Journal of Computer Science & Technology >Scaling Conditional Random Fields by One-Against-the-Other Decomposition

【24h】

Scaling Conditional Random Fields by One-Against-the-Other Decomposition

机译：通过反对另一种分解来缩放条件随机场

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

As a powerful sequence labeling model, conditional random fields (CRFs) have had successful applications in many natural language processing (NLP) tasks. However, the high complexity of CRFs training only allows a very small tag (or label) set, because the training becomes intractable as the tag set enlarges. This paper proposes an improved decomposed training and joint decoding algorithm for CRF learning. Instead of training a single CRF model for all tags, it trains a binary sub-CRF independently for each tag. An optimal tag sequence is then produced by a joint decoding algorithm based on the probabilistic output of all sub-CRFs involved. To test its effectiveness, we apply this approach to tackling Chinese word segmentation (CWS) as a sequence labeling problem. Our evaluation shows that it can reduce the computational cost of this language processing task by 40-50% without any significant performance loss on various large-scale data sets.

机译：作为强大的序列标记模型，条件随机字段（CRF）已在许多自然语言处理（NLP）任务中成功应用。但是，CRF训练的高复杂度仅允许使用非常小的标签（或标签）集，因为随着标签集的扩大，训练变得很棘手。提出了一种改进的CRF学习分解训练和联合解码算法。它没有为所有标签训练单个CRF模型，而是为每个标签独立训练了一个二进制sub-CRF。然后，基于所涉及的所有子CRF的概率输出，由联合解码算法生成最佳标签序列。为了测试其有效性，我们将这种方法应用于解决中文分词（CWS）作为序列标签问题。我们的评估表明，它可以将这种语言处理任务的计算成本降低40-50％，而不会在各种大型数据集上造成任何明显的性能损失。

著录项

来源
《Journal of Computer Science & Technology》 |2008年第4期|p.612-619|共8页
作者
Hai Zhao; Chunyu Kit;
展开▼
作者单位

Department of Chinese, Translation and Linguistics, City University of Hong Kong, 83 Tat Chee Avenue Kowloon, Hong Kong, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
natural language processing; machine learning; conditional random fields; Chinese word segmentation;

机译：自然语言处理机器学习条件随机域汉语分词;

相似文献

外文文献
中文文献
专利

1. Scaling Conditional Random Fields by One-Against-the-Other Decomposition [J] . Hai Zhao, Chunyu Kie 计算机科学技术学报（英文版） . 2008,第004期

机译：通过反对另一种分解缩放条件随机场
2. Efficient and Scalable Approach to Equilibrium Conditional Simulation of Gibbs Markov Random Fields [J] . Milan ?ukovi?, Dionissios T. Hristopulos EPJ Web of Conferences . 2020,第1期

机译：Gibbs Markov随机场平衡条件仿真的高效可扩展方法
3. Theory and generation of conditional, scalable sub-Gaussian random fields [J] . Panzeri M., Riva M., Guadagnini A., Water resources research . 2016,第3期

机译：条件可伸缩次高斯随机场的理论与产生
4. Investigating syllabic prominence with Conditional Random Fields and Latent-Dynamic Conditional Random Fields [C] . Francesco Cutugno, Enrico Leone, Bogdan Ludusan, Annual conference of the International Speech Communication Association . 2012

机译：使用条件随机场和潜在动态条件随机场研究音节突出
5. SELECTED TOPICS IN SPATIAL STATISTICAL ANALYSIS: NONSTATIONARY VECTOR KRIGING, LARGE SCALE CONDITIONAL SIMULATION OF THREE-DIMENSIONAL GAUSSIAN RANDOM FIELDS, AND HYPOTHESIS TESTING IN A CORRELATED RANDOM FIELD [D] . QUIMBY, WILLIAM F. 1986

机译：空间统计分析中的选定主题：非平稳向量Kriging，三维高斯随机场的大规模条件模拟以及相关随机场中的假设检验
6. CHEMDNER system with mixed conditional random fields and multi-scale word clustering [O] . Yanan Lu, Donghong Ji, Xiaoyuan Yao, 2015

机译：具有混合条件随机场和多尺度词聚类的CHEMDNER系统

Scaling Conditional Random Fields by One-Against-the-Other Decomposition

摘要

著录项

相似文献

相关主题

期刊订阅