Open Source Speech Recognition on Edge Devices

机译：边缘设备上的开源语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep learning has revived the field of automatic speech recognition (ASR) in the last ten years and pushed recognition rates into regions on par with humans. Applications like Siri, Amazon Alexa and Google Assistant are very popular, but have inherent privacy problems. In this paper, we evaluate state of the art open source ASR models regarding their usability in a smart speaker without cloud, both in terms of accuracy and runtime performance on cost-effective low power edge devices. We found Kaldi to be the most accurate solution and also among the fastest ones. It runs more than fast enough on an Nvidia Jetson Nano. It is still not on par with commercial cloud services, but getting close to it.

机译：深度学习在过去十年中复兴了自动语音识别（ASR）领域，并将识别率推向与人类同等的地区。 Siri，Amazon Alexa和Google Assistant等应用程序非常受欢迎，但存在固有的隐私问题。在本文中，我们就其在无云智能扬声器中的可用性评估了最先进的开源ASR模型，包括在经济高效的低功耗边缘设备上的准确性和运行时性能。我们发现Kaldi是最准确的解决方案，也是最快的解决方案之一。它在Nvidia Jetson Nano上的运行速度足够快。它仍然不能与商业云服务相提并论，但是越来越接近它。

著录项

来源
《International Conference on Advanced Computer Information Technologies》|2020年|441-445|共5页
会议地点
作者
René Peinl; Basem Rizk; Robert Szabad;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech recognition; Graphics processing units; Hardware; Machine learning; Performance evaluation; Hidden Markov models; Random access memory;

机译：语音识别;图形处理单元;硬件;机器学习;性能评估;隐马尔可夫模型;随机存取存储器;

相似文献

外文文献
中文文献
专利

1. Privacy-Preserving Outsourced Speech Recognition for Smart IoT Devices [J] . Ma Zhuo, Liu Yang, Liu Ximeng, Internet of Things Journal, IEEE . 2019,第5期

机译：智能物联网设备的隐私保护外包语音识别
2. Memory Efficient and Fast Speech Recognition System for Low-Resource Mobile Devices [J] . Hoon Chung, Ikjoo Chung IEEE Transactions on Consumer Electronics . 2006,第3期

机译：低资源移动设备的内存高效快速语音识别系统
3. Recognition unit determination of interactive chinese speech recognition for embedded devices [J] . Jang G.-J., Pan C., Park J.-H., Consumer Electronics, IEEE Transactions on . 2012,第4期

机译：嵌入式设备交互式中文语音识别的识别单元确定
4. Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices [C] . Yuekai Zhang, Sining Sun, Long Ma IEEE International Conference on Acoustics, Speech and Signal Processing . 2021

机译：微小传感器：边缘设备上的高效语音识别模型
5. Source and channel coding for speech transmission and remote speech recognition. [D] . Bernard, Alexis Pascal. 2002

机译：用于语音传输和远程语音识别的源和通道编码。
6. Effects of Active and Passive Hearing Protection Devices on Sound Source Localization Speech Recognition and Tone Detection [O] . Andrew D. Brown, Brianne T. Beemer, Nathaniel T. Greene, -1

机译：主动和被动听力保护装置对声源定位语音识别和音调检测的影响
7. Effects of Active and Passive Hearing Protection Devices on Sound Source Localization, Speech Recognition, and Tone Detection. [O] . Andrew D Brown, Brianne T Beemer, Nathaniel T Greene, 2015

机译：主动和被动听力保护装置对声源定位，语音识别和音调检测的影响。

Open Source Speech Recognition on Edge Devices

摘要

著录项

相似文献

相关主题

期刊订阅