首页> 外文期刊>International journal of human-computer interaction >Nomadic Speech-Based Text Entry: A Decision Model Strategy for improved Speech to Text Processing
【24h】

Nomadic Speech-Based Text Entry: A Decision Model Strategy for improved Speech to Text Processing

机译:游牧基于语音的文本输入:一种改进的语音到文本处理的决策模型策略

获取原文
获取原文并翻译 | 示例
           

摘要

Speech text entry can be problematic during ideal dictation conditions, but difficulties are magnified when external conditions deteriorate. Motion during speech is an extraordinary condition that might have detrimental effects on automatic speech recognition. This research examined speech text entry while mobile. Speech enrollment profiles were created by participants in both a seated and walking environment. Dictation tasks were also completed in both the seated and walking conditions. Although results from an earlier study suggested that completing the enrollment process under more challenging conditions may lead to improved recognition accuracy under both challenging and less challenging conditions, the current study provided contradictory results. A detailed review of error rates confirmed that some participants minimized errors by enrolling under more challenging conditions while others benefited by enrolling under less challenging conditions. Still others minimized errors when different enrollment models were used under the opposing condition. Leveraging these insights, we developed a decision model to minimize recognition error rates regardless of the conditions experienced while completing dictation tasks. When applying the model to existing data, error rates were reduced significantly but additional research is necessary to effectively validate the proposed solution.
机译:在理想的听写条件下,语音文本输入可能会出现问题,但是当外部条件恶化时,困难会放大。语音期间的运动是一种特殊情况,可能会对自动语音识别产生不利影响。这项研究检查了移动时的语音文本输入。语音注册配置文件是由参与者在就座和步行环境中创建的。在就座和步行条件下听写任务也都完成了。尽管较早研究的结果表明,在更具挑战性的条件下完成注册过程可能会在具有挑战性和较低挑战性的条件下提高识别准确性,但当前的研究提供了矛盾的结果。对错误率的详细审查证实,一些参与者通过在更具挑战性的条件下进行注册来最大程度地减少错误,而其他参与者则从在挑战性较小的条件下进行注册受益。当在相反条件下使用不同的注册模型时,还有其他方法将错误最小化。利用这些见解,我们开发了一种决策模型,以最小化识别错误率,无论完成听写任务时遇到的条件如何。当将模型应用于现有数据时,错误率显着降低,但是必须进行其他研究才能有效地验证所提出的解决方案。

著录项

  • 来源
  • 作者单位

    Interactive Systems Research Center, UMBC, Baltimore, MD;

    Interactive Systems Research Center, UMBC, Baltimore, MD;

    Interactive Systems Research Center, UMBC, Baltimore, MD;

    Interactive Systems Research Center, UMBC, Baltimore, MD;

    UMBC, Information Systems Department, 1000 Hilltop Circle, Baltimore MD 21250;

    Georgia Institute of Technology, Atlanta;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号