首页> 外文会议>SYNAT Workshop >Automatic Transcription of Polish Radio and Television Broadcast Audio
【24h】

Automatic Transcription of Polish Radio and Television Broadcast Audio

机译:波兰广播电视广播音频的自动转录

获取原文

摘要

This paper describes a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for the transcription of television and radio broadcast audio in Polish. This work is one of the first attempts of speech recognition of broadcast audio in Polish. The system uses a hybrid, con nectionist recognizer based on a recurrent neural network architecture. The training is based on an extensive set of manually transcribed and verified recordings of television and radio shows. This is further boosted by a large collection of textual data available from online sources, mostly up-to-date news articles. The paper describes and evaluates some of the key components of the architecture. The system is also compared to a conventional HMM-based architecture. An application of the described system in indexing and search of terms within audio and video transcripts is also described.
机译:本文介绍了一种大词汇连续语音识别(LVCSR)系统,用于在波兰语中转录电视和无线电广播音频。这项工作是抛光中广播音频的第一次尝试之一。该系统基于经常性神经网络架构使用混合动力CON识别器。培训基于广泛的手动转录和验证的电视和无线电录音。这进一步通过在线来源提供的大量文本数据集,主要是最新的新闻文章。本文描述并评估了架构的一些关键组件。该系统也与传统的基于HMM的架构进行了比较。还描述了所描述的系统在索引和搜索音频和视频转录物中的术语中的应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号