首页> 外文会议>INTERSPEECH 2012 >Speaker Clustering for a Mixture of Singing and Reading

【24h】

Speaker Clustering for a Mixture of Singing and Reading

机译：扬声器聚类，用于唱歌和阅读的混合

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, we propose a speaker clustering algorithm based on reading and singing speech samples for each speaker. As a speaking style, singing introduces changes in the time-frequency structure of a speaker's voice. The purpose of this study is to introduce advancements into speech systems such as speech indexing and retrieval which improve robustness to intrinsic variations in speech production. Clustering is performed within a GMM mean supervector space. The proposed method includes two stages. First, initial clusters are obtained using traditional clustering techniques such as k-means, and hierarchical. Next, each cluster is refined in a PLDA subspace resulting in a more speaker dependent representation that is less sensitive to speaking style. The proposed algorithm improves the average clustering accuracy of the k-means baseline by +9.3% absolute.

机译：在这项研究中，我们提出了一种基于每个扬声器读取和唱歌语音样本的扬声器聚类算法。作为说话的风格，唱歌引入了扬声器语音时频结构的变化。本研究的目的是将进步引入语音系统，例如语音索引和检索，这改善了语音生产中内在变化的鲁棒性。群集在GMM平均监控器空间内执行。该方法包括两个阶段。首先，使用诸如K-Means等传统聚类技术获得初始集群和分层。接下来，每个群集都在PLDA子空间中精制，导致更多的扬声器依赖表示，对话框不太敏感。该算法提高了K-Means基线的平均聚类精度+ 9.3％绝对。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Mahnoosh Mehrabani; John H. L. Hansen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
speaker clustering; singing;

机译：扬声器聚类;唱歌;

相似文献

外文文献
中文文献
专利

1. Nested Gibbs sampling for mixture-of-mixture model and its application to speaker clustering [J] . Naohiro Tawara, Shinji Watanabe, Tetsuji Ogawa, APSIPA Transactions on Signal and Information Processing . 2016,第2016期

机译：混合模型的嵌套Gibbs采样及其在说话人聚类中的应用
2. Intra-speaker and inter-speaker variability in speech sound pressure level across repeated readings [J] . Castellana Antonella, Carullo Alessio, Astolfi Arianna, The Journal of the Acoustical Society of America . 2017,第4期

机译：在重复读数中，扬声器和扬声器间变异性在语音压力水平中
3. Low Power Speaker Identification by Integrated Clustering and Gaussian Mixture Model Scoring [J] . Iliev Nick, Gianelli Alberto, Trivedi Amit Ranjan Embedded Systems Letters, IEEE . 2020,第1期

机译：通过集成聚类和高斯混合模型评分低功率扬声器识别
4. Speaker Clustering for a Mixture of Singing and Reading [C] . Mahnoosh Mehrabani, John H. L. Hansen Annual conference of the International Speech Communication Association . 2012

机译：说话者聚类，混合唱歌和阅读
5. The impact of singing-integrated reading instruction on the oral reading fluency and motivation of elementary students in an out-of-school time program. [D] . Moorehead-Carter, Yvette Marie. 2015

机译：在校外时间计划中，歌唱综合阅读教学对小学生口语阅读流利度和学习动机的影响。
6. Attendance at cultural events reading books or periodicals and making music or singing in a choir as determinants for survival: Swedish interview survey of living conditions. [O] . L. O. Bygren, B. B. Konlaan, S. E. Johansson 1996

机译：参加文化活动阅读书籍或期刊以及在合唱团中演唱音乐或唱歌是决定生存的因素：瑞典对生活条件的访谈。
7. Singing speaker clustering based on subspace learning in the GMM mean supervector space [O] . Mehrabani Mahnoosh, Hansen John H.L. 2013

机译：GMM平均超向量空间中基于子空间学习的歌唱者聚类。
8. Speaker Clustering for a Mixture of Singing and Reading (Preprint). [R] . Mehrabani, M., Hansen, J. H. 2012

机译：用于歌唱和阅读混合的说话者聚类（预印本）。

Speaker Clustering for a Mixture of Singing and Reading

摘要

著录项

相似文献

相关主题

期刊订阅