首页> 外文期刊>Research journal of applied science, engineering and technology >Arabic Audio News Retrieval System Using Dependent Speaker Mode, Mel Frequency Cepstral Coefficient and Dynamic Time Warping Techniques
【24h】

Arabic Audio News Retrieval System Using Dependent Speaker Mode, Mel Frequency Cepstral Coefficient and Dynamic Time Warping Techniques

机译:阿拉伯音频新闻检索系统,使用相关的扬声器模式,梅尔频率倒谱系数和动态时间扭曲技术

获取原文
           

摘要

Recently, audio data has increasingly becomes one of the prevalent source of information, especially after the exponential growth of using Internet, digital libraries systems and digital mobile devices. The currently massive amount of audio data stimulates working on developing custom audio retrieval tools to facilitate the audio retrieval tasks. The most familiar audio retrieval systems are based on searching using keyword, title or authors. This study presents the feasibility of using MEL Frequency Cepstral Coefficients (MFCCs) to extract features and Dynamic Time Warping (DTW) to compare the test patterns for Arabic audio news. The study proposes and implements architecture for content based audio retrieval system that is dedicated for the Arabic Audio News. The proposed architecture (ARANEWS) utilizes automatic speech recognition for isolated Arabic keyword speech mode; template based automatic speech recognition approach, MFCCs and DTW. ARANEWS presents a style of retrieval system that based on modeling signal waves and measuring the similarity between features that are extracted from spoken queries and spoken keywords. One of the major components that compose ARANEWS system is feature Database (ARANEWSDB). ARANEWSDB stores the extracted features (MFCCs) from the spoken keywords that are prepared to retrieve Arabic audio news. ARANEWS supports using Query by Humming (QBH) and Query by Example (QBE) instead of using query by text.
机译:最近,音频数据已越来越成为一种普遍的信息来源之一,尤其是在使用Internet,数字图书馆系统和数字移动设备的指数增长之后。当前大量的音频数据刺激了开发定制音频检索工具以促进音频检索任务的工作。最熟悉的音频检索系统基于使用关键字,标题或作者的搜索。这项研究提出了使用MEL频率倒谱系数(MFCC)提取特征以及使用动态时间规整(DTW)来比较阿拉伯音频新闻的测试模式的可行性。该研究提出并实现了基于内容的音频检索系统的体系结构,该体系专用于阿拉伯语音频新闻。拟议的架构(ARANEWS)利用自动语音识别功能来隔离阿拉伯语关键字语音模式;基于模板的自动语音识别方法,MFCC和DTW。 ARANEWS提出了一种检索系统样式,该系统基于对信号波进行建模并测量从语音查询和语音关键字中提取的特征之间的相似性。组成ARANEWS系统的主要组件之一是功能数据库(ARANEWSDB)。 ARANEWSDB存储从准备检索阿拉伯音频新闻的语音关键字中提取的功能(MFCC)。 ARANEWS支持使用“按嗡嗡声查询(QBH)”和“按示例查询”(QBE),而不是按文本查询。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号