首页> 外文会议>IEEE International Conference on Cognitive Infocommunications >Can time dependencies and ensemble classification improve content-free dialogue segmentation?
【24h】

Can time dependencies and ensemble classification improve content-free dialogue segmentation?

机译:时间依赖性和整体分类能否改善无内容的对话细分?

获取原文

摘要

We present an extended study of content-free topic segmentation of conversational (meeting) data based on classification of vocalisation events. In previous work, content-free topic segmentation achieved good accuracy through a modified naïve Bayes classifier and vocalisation horizon features. In this study, we attempted to improve on those results by incorporating time (sequential) dependency information into the topic boundary detection process through the use of conditional random fields and ensemble classifiers. We expected that incorporating such information would help reduce the number of false positives generated by the naïve Bayes method. We introduce a new metric in the assessment of performance, in addition to the usual Pk and WindowDiff (WD) metrics in order to account for the under-detection bias of the segmentation task. Although a boosting model showed fairly good performance using a simple base classifier and limited contextual features, the more elaborate methods still trailed the Bayesian method.
机译:我们提出了基于发声事件分类的会话(会议)数据的无内容主题细分的扩展研究。在以前的工作中,通过修改朴素的贝叶斯分类器和发声范围功能,实现了无内容的主题细分。在这项研究中,我们尝试通过使用条件随机字段和集成分类器将时间(顺序)依赖项信息纳入主题边界检测过程来改善这些结果。我们预计,合并此类信息将有助于减少由朴素贝叶斯方法产生的误报的数量。除了通常的Pk和WindowDiff(WD)指标外,我们还引入了一种新的性能评估指标,以解决细分任务的检测不足偏差。尽管使用简单的基础分类器和有限的上下文功能,增强模型显示出相当不错的性能,但更为复杂的方法仍落后于贝叶斯方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号