Foldering Vociemall Messages by Caller Using Text Independent Speaker Recognition

机译：呼叫者使用独立于文本的说话者识别功能将Vociemall邮件夹入文件夹

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability to automatically scan voicemail messages for content and caller identity cues would be a useful service. This paper describes a system which automatcally files voicemail messages into caller folders using text independent speaker recognition techniques. Callers are represented by Gaussian mixture models (GMM's). The speech for an incoming message is processed and scored against caller models created for a subscriber. A message whose matching score exceeds a threshold is filed in the matching matching score exceeds a threshold is filed in the matching caller folder; otherwise it is tagged as "unknown". The subscriber has the ability to listen to an "unknown" message and file it in the proper folder, if it exists, or create a new folder, if it does not. Such subscriber labelled messages are used to train and adapt caller models. The system has been evalauted on a database of voicemail messages collected at AT&T Labs. A set of 20 callers from this database is designated as "ingroup". Each of these callers has recorded at least 20 messages totalling 10 or more minutes in duration. A distinct set of 220 messages, each from a different caller, are designated as "outgroup". representative performance figures with threshold parameters set to ensure that out-group acceptance is low compared with ingroup rejection are the following. The average ingroup message rejection rate is 11.0

机译：自动扫描语音邮件中的内容和呼叫者身份提示的功能将是一项有用的服务。本文介绍了一种系统，该系统使用独立于文本的说话者识别技术将语音邮件自动归档到呼叫者文件夹中。呼叫者由高斯混合模型（GMM）表示。根据为订户创建的呼叫者模型对传入消息的语音进行处理并对其评分。匹配得分超过阈值的消息被存储在匹配的呼叫者文件夹中;匹配得分超过阈值的消息被存储在匹配的呼叫者文件夹中。否则将其标记为“未知”。订户有能力收听“未知”消息并将其归档在适当的文件夹（如果存在）中，或者创建一个新的文件夹（如果不存在）。这种带有用户标记的消息用于训练和调整呼叫者模型。该系统在AT＆T实验室收集的语音邮件消息数据库中得到了高度评价。该数据库中的一组20个呼叫者称为“组内”。这些呼叫者中的每一个都记录了至少20条消息，总计持续10分钟或更长时间。分别来自不同呼叫者的220条消息的不同集合被指定为“ outgroup”。以下是设置阈值参数以确保组外接受度比组内拒绝低的代表性性能数据。组内平均邮件拒绝率为11.0

著录项

来源
《6th International Conference on Spoken Language Processing ICSLP 2000 Oct.16-Oct.20 2000 Beijing International Convention Center, Beijing, China》|2000年|p.474-478|共5页
会议地点
作者
Aaron E.Rosenberg; S.Parthasarathy; Julia Hirschberg; Stephen Whittaker;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类世界各国文化与文化事业;
关键词

相似文献

外文文献
专利

1. Text-Independent/Text-Prompted Speaker Recognition by Combining Speaker-Specific GMM with Speaker Adapted Syllable-Based HMM [J] . Seiichi NAKAGAWA, Wei ZHANG, Mitsuo TAKAHASHI IEICE Transactions on Information and Systems . 2006,第3期

机译：通过结合特定于说话人的GMM和基于说话人的基于音节的HMM来实现与文本无关/提示文字的说话人识别
2. Investigation of the effect of data duration and speaker gender on text-independent speaker recognition [J] . Cemal Hanilci, Figen Ertas Computers and Electrical Engineering . 2013,第2期

机译：研究数据持续时间和说话人性别对与文本无关的说话人识别的影响
3. Speaker-specific mapping for text-independent speaker recognition [J] . Hemant Misra, Shajith Ikbal, B. Yegnanarayana Speech Communication . 2003,第3a4期

机译：特定于说话人的映射，用于与文本无关的说话人识别
4. Foldering Vociemall Messages by Caller Using Text Independent Speaker Recognition [C] . Aaron E.Rosenberg, S.Parthasarathy, Julia Hirschberg, International conference on spoken language processing . 2000

机译：使用文本独立扬声器识别来折叠Vociemall消息
5. Text-independent Speaker Recognition Using Discriminative Subspace Analysis [D] . Jiang, Weiwu 2012

机译：区分子空间分析的文本无关说话人识别
6. Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：识别消息和使者：仿生频谱分析可增强语音和说话者识别能力
7. Frame-Level Speaker Embeddings for Text-Independent Speaker Recognition and Analysis of End-to-End Model [O] . Suwon Shon, Hao Tang, James Glass 2018

机译：帧级扬声器嵌入文本独立扬声器识别和结束模型分析

Foldering Vociemall Messages by Caller Using Text Independent Speaker Recognition

摘要

著录项

相似文献

相关主题

期刊订阅