Parenthetical Classification for Information Extraction

机译：信息提取的括号分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The article focuses on a rather unexplored topic in NLP: parenthetical classification. Parenthet-icals are defined as any text sequence between parentheses. They have been approached from isolated perspectives, like translation pairs extraction, but a full account of their syntactic and semantic properties is lacking. This article proposes a new comprehensive scheme drawn from corpus-based linguistic studies on French news. This research is part of a project investigating the structural aspects of punctuation signs and their usefulness for Information Extraction. Parenthetical classification is approached as a relation extraction problem split into three correlated subtasks: syntactic and semantic classification and head recognition. Corpus-based studies singled out 11 syntactic and 18 semantic relation subtypes. The article addresses automatic classification, using a combination of CRF and SVM. This baseline system reports 0.674 (head recognition). 0.908 (syntax), 0.734 (semantics), and 0.518 (end-to-end) of F1.

机译：本文重点介绍NLP中一个尚未开发的主题：括号分类。括号定义为括号之间的任何文本序列。已经从孤立的角度（例如翻译对提取）着手处理它们，但是仍缺乏对它们的句法和语义属性的完整说明。本文提出了一种新的综合方案，该方案是从基于语料库的法国新闻语言研究中得出的。这项研究是研究标点符号的结构方面及其对信息提取的实用性的项目的一部分。括号分类法是一种关系提取问题，分为三个相关的子任务：句法和语义分类以及头部识别。基于语料库的研究选择了11种句法和18种语义关系子类型。本文介绍了结合使用CRF和SVM进行自动分类的方法。该基线系统报告为0.674（头部识别）。 F1的0.908（语法），0.734（语义）和0.518（端对端）。

著录项

来源
《International conference on computational linguistics》|2012年|297-308|共12页
会议地点
作者
Ismail EL MAAROUF; Jeanne VILLANEAU;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Parentheticals; Punctuation; Information Extraction;

机译：括号;标点;信息提取;

相似文献

外文文献
中文文献
专利

1. Collective Web-Based Parenthetical Translation Extraction Using Markov Logic Networks [J] . RICHARD TZONG-HAN TSAI ACM transactions on Asian language information processing . 2016,第2期

机译：基于马尔可夫逻辑网络的基于Web的集体括号翻译提取
2. Parenthetical verb constructions, fragment answers, and constituent modification [J] . James Griffiths Natural language & linguistic theory . 2015,第1期

机译：括号动词结构，片段答案和成分修饰
3. Pushed aside: parentheticals, memory and processing [J] . Brian Dillon, Charles Clifton Jr., Lyn Frazier Language, cognition and neuroscience . 2014,第4期

机译：放在一边：括号，内存和处理
4. Parenthetical Classification for Information Extraction [C] . Ismail EL MAAROUF, Jeanne VILLANEAU International conference on computational linguistics . 2012

机译：关于信息提取的括号分类
5. Concerning American parenthetical expressions in syntax [D] . Grubb, Teresa R. 2016

机译：关于语法中的美国括号表达
6. Morphological Classification of Extraction Sockets and Clinical Decision Tree for Socket Preservation/Augmentation after Tooth Extraction: a Systematic Review [O] . Gintaras Juodzbalys, Arturas Stumbras, Samir Goyushov, 2019

机译：拔牙后牙槽的形态分类和拔牙后牙槽保存/增强的临床决策树：系统评价
7. To Improve Feature Extraction and Opinion Classification Issues in Customer Product Reviews Utilizing an Efficient Feature Extraction and Classification (EFEC) Algorithm [O] . Palaiyah Solainayagi, Ramalingam Ponnusamy 2018

机译：利用有效特征提取和分类（efec）算法，改善客户产品评论中的特征提取和意见分类问题
8. Extraction of Shrimp Ponds Using Object Oriented Classification vis-a- vis Pixel Based Classification [R] . Chauhan, R. , Tripathi, N. K. , Chowdhury, S. R. 2004

机译：基于面向对象分类的面向对象分类提取虾塘

Parenthetical Classification for Information Extraction

摘要

著录项

相似文献

相关主题

期刊订阅