【24h】

An Experimental Study of an Audio Indexing System for the Web

机译:Web音频索引系统的实验研究

获取原文

摘要

We have developed a speech recognition based audio search engine for indexing spoken documents found on the World wide Web. Our site (http://www.compaq,com/speechbot) indexes around 20 news and talk radio shows covering a wide range of topics, speaking styles and acoustic conditions from a selection of public Web sites with multimedia archives. In this paper, we describe our system and its performance, focusing on the speech recognition and retrieal aspects. We describe our training procedure in soem detail and report our historical error rate since the site launch. We also investigate the impact of Our Of Vocabulary (OOV) words. Finally we report the resutls of retrieval experiments which demonstrate that our systme can index effectively.
机译:我们已经开发了一种基于语音识别的音频搜索引擎,用于索引在万维网上找到的语音文档。我们的网站(http://www.compaq,com/speechbot)索引了约20个新闻和谈话广播节目,这些新闻和谈话广播节目来自带有多媒体档案库的一些公共网站,内容涉及广泛的主题,讲话风格和声音条件。在本文中,我们将着重于语音识别和重发方面来描述我们的系统及其性能。我们将详细描述培训过程,并报告自网站启动以来的历史错误率。我们还将调查“词汇量”(OOV)单词的影响。最后,我们报告了检索实验的结果,这些结果表明我们的系统可以有效地建立索引。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号