Structure-based voiced/usable speech detection using state space embedding

机译：使用状态空间嵌入的基于结构的有声/可用语音检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The process of speech production in the human system is very complex, possesses nonlinearities, and can only be precisely modeled in terms of nonlinear dynamics. A non-linear speech classification approach is proposed, which classifies speech based on features extracted from Takens' method of delays, a technique used to reconstruct signals into a trajectory in multidimensional state space. In this research, two types of speech detection are presented, namely, voiced and usable speech (for speaker identification purposes). The proposed approach has been able to yield a probability of error of 12% in noisy environments for voiced speech detection, and 78% correct usable speech detection by comparing the structures of embedded voiced speech frames with embedded unvoiced speech frames, and embedded usable speech frames with unusable speech. Some applications of this speech detection technique include the enhancement of speaker identification and speech recognition systems.

机译：人类系统中语音产生的过程非常复杂，具有非线性，并且只能根据非线性动力学进行精确建模。提出了一种非线性语音分类方法，该方法基于从Takens的延迟方法中提取的特征对语音进行分类，该方法用于将信号重构为多维状态空间中的轨迹。在这项研究中，提出了两种类型的语音检测，即有声语音和可用语音（用于说话人识别）。通过比较嵌入的有声语音帧与嵌入的无声语音帧以及嵌入的可用语音帧的结构，所提出的方法已经能够在嘈杂的环境中为语音检测提供12％的错误概率，并提供78％的正确可用语音检测。语音不可用。这种语音检测技术的一些应用包括增强了说话人识别和语音识别系统。

著录项

来源
《Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004. Proceedings of 2004 International Symposium on》|2004年|p.811-815|共5页
会议地点
作者
Ofoegbu U.O.; Smolenski B.Y.; Yantorno R.E.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
speech enhancement; state-space methods; signal classification; signal reconstruction; multidimensional signal processing; feature extraction; structure-based voice detection; usable speech detection; state space embedding; nonlinear dynamics; nonlinear speech classification; Taken method of delays; signal reconstruction; multidimensional state space trajectory; speaker identification enhancement; noisy environments; embedded voiced speech frames; embedded unvoiced speech frames; usable speech frames; unusable speech; speech recognition;

机译：语音增强;状态空间方法;信号分类;信号重建;多维信号处理;特征提取;基于结构的语音检测;可用语音检测;状态空间嵌入;非线性动力学;非线性语音分类;延迟采取方法;信号重建;多维状态空间轨迹;说话人识别增强;嘈杂的环境;嵌入式有声语音帧;嵌入式无声语音帧;可用语音帧;无用语音;语音识别;

相似文献

外文文献
中文文献
专利

1. A subspace approach based on embedded prewhitening for voice activity detection [J] . Kim D.K., Chang J.-H. The Journal of the Acoustical Society of America . 2011,第5aPta1期

机译：一种基于嵌入式预白化的语音活动检测子空间方法
2. Instantaneous voicedon-voiced detection in speech signals based on variational mode decomposition [J] . Upadhyay Abhay, Pachori Ram Bilas Journal of the Franklin Institute . 2015,第7期

机译：基于变分模式分解的语音信号瞬时有声/无声检测
3. Detection of Voiced, Unvoiced and Silence Regions of Assamese Speech by Using Acoustic Features [J] . Bidyut Kumar Das, Ajit Das, Utpal Bhattacharjee International Journal of Computer Trends and Technology . 2014,第2期

机译：利用声学特征检测阿萨姆语语音的发声，无声和无声区域
4. Structure-based voiced/usable speech detection using state space embedding [C] . Ofoegbu U.O., Smolenski B.Y., Yantorno R.E. International Symposium on Intelligent Signal Processing and Communication Systems . 2004

机译：基于结构的浊音/可用语音检测使用状态空间嵌入
5. Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition [D] . Tao, Fei. 2018

机译：用于鲁棒语音活动检测和自动语音识别的视听语音处理方面的进展
6. Existence detection and embedding rate estimation of blended speech in covert speech communications [O] . Lijuan Li, Yong Gao -1

机译：秘密语音通信中混合语音的存在性检测和嵌入率估计
7. A fast method for high-resolution voiced/unvoiced detection and glottal closure/opening instant estimation of speech [O] . Koutrouvelis, A., Kafentzis, GP, Gaubitch, N.D., 2015

机译：高分辨率浊音/清音检测和声门闭合/打开即时估计的快速方法

Structure-based voiced/usable speech detection using state space embedding

摘要

著录项

相似文献

相关主题

期刊订阅