UCSG Shallow Parser

机译：UCSG浅解析器

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recently, there is an increasing interest in integrating rule based methods with statistical techniques for developing robust, wide coverage, high performance parsing systems. In this paper, we describe an architecture, called UCSG shallow parser architecture, which combines linguistic constraints expressed in the form of finite state grammars with statistical rating using HMMs built from a POS-tagged corpus and an A* search for global optimization for determining the best shallow parse for a given sentence. The primary aim of the design of the UCSG parsing architecture is developing a judicious combination of linguistic and statistical methods to develop wide coverage robust shallow parsing systems, without the need for large scale manually parsed training corpora. The UCSG architecture uses a grammar to specify all valid structures and a statistical component to rate and rank the possible alternatives, so as to produce the best parse first without compromising on the ability to produce all possible parses. The architecture supports bootstrapping with an aim to reduce the need for parsed training corpora. The complete system has been implemented in Perl under Linux. In this paper we first describe the UCSG shallow parsing architecture and then focus on the evaluation of the UCSG finite state grammar for the chunking task for English. Recall of 91.16% and 93.73% have been obtained on the Susanne parsed corpus and CoNLL 2000 chunking task test data set respectively. Extensive experimentation is under way to evaluate the other modules.

机译：最近，人们越来越关注将基于规则的方法与统计技术相集成，以开发健壮，覆盖面广的高性能解析系统。在本文中，我们描述了一种称为UCSG浅层解析器体系结构的体系结构，该体系结构将有限状态文法形式的语言约束与具有统计等级的统计信息结合使用，该HMM使用POS标签语料库构建的HMM和A *搜索全局优化来确定给定句子的最佳浅层分析。 UCSG解析体系结构设计的主要目的是开发一种语言学和统计方法的明智组合，以开发广泛覆盖的健壮的浅层解析系统，而无需大规模的手动解析训练语料库。 UCSG体系结构使用语法指定所有有效结构，并使用统计成分对可能的备选方案进行评级和排名，以便首先产生最佳解析，而又不影响产生所有可能解析的能力。该体系结构支持自举，目的是减少对解析后的训练语料库的需求。完整的系统已在Linux下的Perl中实现。在本文中，我们首先描述了UCSG浅层解析体系结构，然后重点介绍了针对英语分块任务的UCSG有限状态语法的评估。在Susanne解析的语料库和CoNLL 2000分块任务测试数据集上，分别获得了91.16％和93.73％的调用率。正在进行广泛的实验以评估其他模块。

著录项

来源
《International Conference on Computational Linguistics and Intelligent Text Processing(CICLing 2006); 20060219-25; Mexico City(MX)》|2006年|P.156-167|共12页
会议地点 Mexico City(MX)
作者
Guntur Bharadwaja Kumar; Kavi Narayana Murthy;
展开▼
作者单位

Department of Computer and Information Siences, University of Hyderabad, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词
chunking; shallow parsing; finite state grammar; HMM; A~* search; UCSG architecture;

机译：分块;浅层解析;有限状态语法; HMM; A〜*搜索; UCSG体系结构;

相似文献

外文文献
中文文献
专利

1. Chinese Shallow Semantic Parsing Based on Multi-method of Machine Learning [J] . Wan Fucheng, He Xiangzhen, Zhang Dongjiao, Journal of web engineering . 2020,第5a6期

机译：基于机器学习多方法的中国浅层语义解析
2. BIDIRECTIONAL GATED RECURRENT UNIT FOR SHALLOW PARSING [J] . Medari Janai Tham Indian Journal of Computer Science and Engineering . 2020,第5期

机译：用于浅析的双向门控复发单元
3. Shallow Parsing Approach to Automated Grammaticality Evaluation [J] . AREGBESOLA Kehinde, GANIYU Adesina, OLABIYISI Olatunde, Journal of Computer Science and Control Systems . 2020,第1期

机译：自动语法评价的浅层解析方法
4. UCSG Shallow Parser [C] . Guntur Bharadwaja Kumar, Kavi Narayana Murthy International Conference on Computational Linguistics and Intelligent Text Processing . 2006

机译：UCSG肤色解释器
5. Faceted Search and Browsing of Indonesian Text Collection Using Shallow Parsing Techniques. [D] . Sanaka, Srinivasa Raviteja. 2010

机译：使用浅层解析技术对印度尼西亚文本集合进行多面搜索和浏览。
6. Shallow Semantic Parsing of Randomized Controlled Trial Reports [O] . Hyung Paek, Yacov Kogan, Prem Thomas, 2006

机译：随机对照试验报告的浅语义分析
7. Shallow Discourse Parsing Using Constituent Parsing Tree [O] . Changge Chen, Peilu Wang, Hai Zhao 2015

机译：使用构成解析树解析浅话题
8. Skeletons in the Parser: Using a Shallow Parser to Improve Deep Parsing [R] . Swift, M. , Allen, J. , Gildea, D. 2004

机译：解析器中的骷髅：使用浅层解析器来改善深度解析

UCSG Shallow Parser

摘要

著录项

相似文献

相关主题

期刊订阅