首页> 外文会议>International conference on language resources and evaluation >Semantic annotation of French corpora: animacy and verb semantic classes
【24h】

Semantic annotation of French corpora: animacy and verb semantic classes

机译:法语语料库的语义标注:动画和动词语义类

获取原文

摘要

This paper presents a first corpus of French annotated for animacy and for verb semantic classes. The resource consists of 1,346 sentences extracted from three different corpora: the French Treebank (Abeille and Barrier, 2004), the Est-Republicain corpus (CNRTL) and the ESTER corpus (ELRA). It is a set of parsed sentences, containing a verbal head subcategorizing two complements, with annotations on the verb and on both complements, in the TIGER XML format (Mengel and Lezius, 2000). The resource was manually annotated and manually corrected by three annotators. Animacy has been annotated following the categories of (Zaenen et al., 2004). Measures of inter-annotator agreement are good (Multi-π = 0.82 and Multi-κ = 0.86 (k = 3, N = 2360)). As for verb semantic classes, we used three of the five levels of classification of an existing dictionary: Les Verbes du Francais (Dubois and Dubois-Charlier, 1997). For the higher level (generic classes), the measures of agreement are Multi-π = 0.84 and Multi-κ = 0.87 (k = 3, N = 1346). The inter-annotator agreements show that the annotated data are reliable for both animacy and verbal semantic classes.
机译:本文介绍了法语的第一语料库,该法文注释用于动画和动词语义类。该资源包括从三个不同的语料库中提取的1,346个句子:法国树库(Abeille和Barrier,2004年),Est-Republicain语料库(CNRTL)和ESTER语料库(ELRA)。它是一组经过分析的句子,其中包含一个将两个补语细分为一个语言的头,并以TIGER XML格式在动词和两个补语上都有注释(Mengel和Lezius,2000)。资源是由三个注释者手动注释和手动更正的。 Animacy已按照(Zaenen et al。,2004)的类别进行了注释。注释者之间的一致性的度量很好(Multi-π= 0.82和Multi-κ= 0.86(k = 3,N = 2360))。至于动词语义分类,我们使用了现有词典的五个分类级别中的三个:Les Verbes du Francais(Dubois和Dubois-Charlier,1997)。对于更高的级别(通用类),一致性的度量标准是Multi-π= 0.84和Multi-κ= 0.87(k = 3,N = 1346)。注释者之间的协议表明,注释数据对于动画和言语语义类都是可靠的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号