基于测度优化Laplacian SVM的中文指代消解方法

周炫余; 刘娟; 邵鹏; 卢笑; 罗飞

首页> 中文期刊> 《电子学报》 >基于测度优化Laplacian SVM的中文指代消解方法

基于测度优化Laplacian SVM的中文指代消解方法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

相比于传统的基于半监督学习的指代消解方法，Laplacian SVM（Support Vector Machine）能有效的挖掘已标注样本和未标注样本的相似性和关联性，更好的推导模型的分类边界。而传统Laplacian SVM采用欧式距离度量样本之间的距离，使得异类样本之间的相似性可能过大，不利于样本的准确分类。对此，提出一种基于数据驱动学习最优测度Laplacian SVM算法以解决中文指代消解语料不足的问题。该方法通过优化样本对之间的相似性约束条件和引入Fisher判别项，增大同类样本间的相似性，并突出强判别能力的特征。此外，提出核嵌入的测度优化方法将以上线性测度优化推广到非线性空间，有利于Laplacian SVM利用核函数实现非线性分类。在ACE2005中文语料库上的测评结果表明，所提出测度优化的Laplacian SVM（包括线性和核嵌入两种形式）的方法只需少量标注样本就可以获得与经典的有监督学习模型相当甚至更好的消解性能，同时也优于其他传统的半监督学习方法。%Compared to the traditional semi-supervised based anaphora resolution methods,Laplacian SVM(Support Vector Machine)can efficiently explore the similarity and correlations between labeled and unlabeled samples for deriving more accurate classification model.However,traditional Laplacian SVM simply uses Euclidean distance to calculate the distance between two samples,which may result that two samples from different classes may have false high similarity.To address the problem of insufficient Chinese annotated corpus,a data-driven based method is proposed to learn the optimal distance metric.The proposed method takes similarity constraints between sample-pairs into consideration and introduces the Fisher discrimination criterion,so that the similarities of in-class samples are higher than those of between-class sam-ples,and the discriminant features are highlighted in the new metric space.Furthermore,the proposed metric-optimized method is generalized from linear to nonlinear space by the use of kernel,so that it can be used for non-linear classifica-tion.Compared with the classical supervised method and other four traditional semi-supervised methods on the ACE2005 Chinese corpus,the proposed method,both the linear form and kernel form,achieves the comparatively better or best per-formance,with fewer labeled samples.

著录项

来源
《电子学报》 |2016年第12期|3064-3072|共9页
作者
周炫余; 刘娟; 邵鹏; 卢笑; 罗飞;
展开▼
作者单位

武汉大学软件国家重点实验室;

湖北武汉430072;

武汉大学计算机学院;

湖北武汉 430072;

武汉大学软件国家重点实验室;

湖北武汉430072;

武汉大学计算机学院;

湖北武汉 430072;

武汉大学软件国家重点实验室;

湖北武汉430072;

武汉大学计算机学院;

湖北武汉 430072;

湖南大学电气与信息工程学院;

湖南长沙410082;

武汉大学软件国家重点实验室;

湖北武汉430072;

武汉大学计算机学院;

湖北武汉 430072;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
测度优化; Laplacian SVM; 中文指代消解; 半监督学习; 自然语言处理;

相似文献

中文文献
外文文献
专利

1. 基于SVM的中文名词短语指代消解研究 [J] . 高俊伟 ,孔芳 ,朱巧明 . 计算机科学 . 2012,第010期
2. 基于全局优化的中文事件同指消解方法 [J] . 滕佳月 ,李培峰 ,朱巧明 . 北京大学学报：自然科学版 . 2016,第1期
3. 基于双代价参数 SVM 的生物医学文本指代消解研究 [J] . 张丽君 ,李丽双 ,范国龙 . 大连理工大学学报 . 2015,第004期
4. 基于结构化表示的中文事件同指消解方法 [J] . 宦敏 ,程昊熠 ,李培峰 . 中文信息学报 . 2019,第012期
5. 基于特征分选策略的中文共指消解方法 [J] . 李渝勤 ,甘润生 ,杨永红 . 计算机工程 . 2011,第018期
6. 基于结构化表示的中文事件同指消解方法 [C] . HUAN Min ,宦敏 ,CHENG Haoyi . 第十八届中国计算语言学大会暨中国中文信息学会2019学术年会 . 2018
7. 中文零指代消解方法研究与实现 [A] . 王立凯 . 2020

基于测度优化Laplacian SVM的中文指代消解方法

摘要

著录项

相似文献

相关主题

期刊订阅