首页> 外文会议>INTERSPEECH 2012 >Dynamic Conditional Random Fields for Joint Sentence Boundary and Punctuation Prediction

【24h】

Dynamic Conditional Random Fields for Joint Sentence Boundary and Punctuation Prediction

机译：联合句边界和标点测预测的动态条件随机字段

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The use of dynamic conditional random fields (DCRF) has been shown to outperform linear-chain conditional random fields (LCRF) for punctuation prediction on conversational speech texts [1]. In this paper, we combine lexical, prosodic, and modified n-gram score features into the DCRF framework for a joint sentence boundary and punctuation prediction task on TDT3 English broadcast news. We show that the joint prediction method outperforms the conventional two-stage method using LCRF or maximum entropy model (MaxEnt). We show the importance of various features using DCRF, LCRF, MaxEnt, and hidden-event n-gram model (HEN) respectively. In addition, we address the practical issue of feature explosion by introducing lexical pruning, which reduces model size and improves the Fl-measure. We adopt incremental local training to overcome memory size limitation without incurring significant performance penalty. Our results show that adding prosodic and n-gram score features gives about 20% relative error reduction in all cases. Overall, DCRF gives the best accuracy, followed by LCRF, MaxEnt, and HEN.

机译：已经显示了使用动态条件随机字段（DCRF）以对会话语音文本进行标点符号预测的线性链条条件随机字段（LCRF）占此垂直的线性链条条件随机字段（LCRF）[1]。在本文中，我们将词汇，韵律和修改的n-gram分数特征与TDT3英语广播新闻中的联合句边界和标点符号预测任务结合到DCRF框架中。我们表明，联合预测方法使用LCRF或最大熵模型（MAXENT）优于传统的两级方法。我们分别展示了各种功能的重要性，分别使用DCRF，LCRF，MaxEnt和隐藏事件N-GRAM模型（HEN）的重要性。此外，我们通过引入词汇修剪来解决特色爆炸的实际问题，这减少了模型尺寸并改善了FL措施。我们采用增量本地培训来克服内存大小限制，而不会产生重大的表现惩罚。我们的研究结果表明，添加韵律和N-GRAM分数特征在所有情况下都具有约20％的相对误差减少。总的来说，DCRF给出了最佳准确性，其次是LCRF，MaxEnt和Hen。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Xuancong Wang; Hwee Tou Ng; Khe Chai Sim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
punctuation; dynamic conditional random fields; sentence boundary detection;

机译：标点符号;动态条件随机字段;句子边界检测;

相似文献

外文文献
中文文献
专利

1. A Support Vector Conditional Random Fields Classifier With a Mahalanobis Distance Boundary Constraint for High Spatial Resolution Remote Sensing Imagery [J] . Zhong Y., Lin X., Zhang L. Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2014,第4期

机译：具有马氏距离边界约束的支持向量条件随机场分类器，用于高分辨率遥感影像
2. Simplified Conditional Random Fields With Class Boundary Constraint for Spectral-Spatial Based Remote Sensing Image Classification [J] . Zhang G. Geoscience and Remote Sensing Letters, IEEE . 2012,第5期

机译：基于类空间约束的基于光谱空间的遥感图像分类的简化条件随机场
3. Background Extraction Based on Joint Gaussian Conditional Random Fields [J] . Hong-Cyuan Wang, Yu-Chi Lai, Wen-Huang Cheng, IEEE Transactions on Circuits and Systems for Video Technology . 2018,第11期

机译：基于联合高斯条件随机场的背景提取
4. Dynamic Conditional Random Fields for Joint Sentence Boundary and Punctuation Prediction [C] . Xuancong Wang, Hwee Tou Ng, Khe Chai Sim Annual conference of the International Speech Communication Association . 2012

机译：联合句边界和标点预测的动态条件随机场
5. Joint Multiple Visual Task Understanding from a Single Image via Deep Learning and Conditional Random Field. [D] . Wang, Peng. 2017

机译：通过深度学习和条件随机场从单个图像进行联合多视觉任务理解。
6. Conrad: Gene prediction using conditional random fields [O] . David DeCaprio, Jade P. Vinson, Matthew D. Pearson, 2007

机译：康拉德：使用条件随机场进行基因预测
7. Using conditional random fields for sentence boundary detection in speech [O] . Yang Liu, Andreas Stolcke, Elizabeth Shriberg, 2014

机译：使用条件随机场进行语音中的句子边界检测

Dynamic Conditional Random Fields for Joint Sentence Boundary and Punctuation Prediction

摘要

著录项

相似文献

相关主题

期刊订阅