首页> 外文会议>Annual conference of the International Speech Communication Association >Language Modeling for Voice-Enabled Social TV Using Tweets
【24h】

Language Modeling for Voice-Enabled Social TV Using Tweets

机译:使用推文的启用语音的社交电视的语言建模

获取原文

摘要

Social TV is a recent trend that integrates social media access and TV viewing. In this paper, we investigate approaches for building effective language models for a voice-enabled social TV application, where viewers can speak their social media updates while watching TV. We propose to take advantage of social media data, more specifically TV-related Twitter messages (tweets). The challenge is the noisy nature of Twitter data. Our contributions are as follows. First, we collect TV show related tweets and provide a detailed analysis of the style mismatch between written tweets and spoken language. Second, we propose a learning based approach for transforming tweets to be more suitable for language modeling. This transformation considers lexical, phonetic and contextual similarity between the misspellings and the canonical form. Third, we build the language models from normalized TV-related tweets along with other data resources that are weighted to optimize speech recognition performance. The model created via normalized tweets achieved higher performance.
机译:社交电视是融合社交媒体访问和电视观看的最新趋势。在本文中,我们研究了为具有语音功能的社交电视应用程序建立有效的语言模型的方法,在这种模式下,观众可以边看电视边说自己的社交媒体更新。我们建议利用社交媒体数据,尤其是与电视相关的Twitter消息(推文)。挑战在于Twitter数据的嘈杂性。我们的贡献如下。首先,我们收集与电视节目相关的推文,并详细分析书面推文和口头语言之间的风格不匹配。其次,我们提出了一种基于学习的方法来转换推文,使其更适合于语言建模。这种转换考虑了拼写错误和规范形式之间的词汇,语音和上下文相似性。第三,我们从标准化的电视相关推文以及其他加权优化语音识别性能的数据资源构建语言模型。通过规范化推文创建的模型具有更高的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号