首页> 外国专利> System and method for inserting a description of images into audio recordings

System and method for inserting a description of images into audio recordings

机译:用于将图像的描述插入到音频记录中的系统和方法

摘要

There is disclosed a system and method for interpreting and describing graphic images. In an embodiment, the method of inserting a description of an image into an audio recording includes: interpreting an image and producing a word description of the image including at least one image keyword; parsing an audio recording into a plurality of audio clips, and producing a transcription of each audio clip, each audio clip transcription including at least one audio keyword; calculating a similarity distance between the at least one image keyword and the at least one audio keyword of each audio clip; and selecting the audio clip transcription having a shortest similarity distance to the at least one image keyword as a location to insert the word description of the image. The word description of the image can then be appended to the selected audio clip to produce an augmented audio recording including the interpreted word description of the image.
机译:公开了一种用于解释和描述图形图像的系统和方法。在一个实施例中,将图像的描述插入到音频记录中的方法包括:解释图像并产生包括至少一个图像关键词的图像的词描述;将音频记录解析成多个音频片段,并产生每个音频片段的转录,每个音频片段转录包括至少一个音频关键词;计算每个音频片段的至少一个图像关键字和至少一个音频关键字之间的相似距离;选择与所述至少一个图像关键词具有最短相似距离的音频片段转录作为插入图像词描述的位置。然后可以将图像的单词描述附加到所选的音频剪辑,以产生包括图像的解释的单词描述的增强音频记录。

著录项

  • 公开/公告号US7996227B2

    专利类型

  • 公开/公告日2011-08-09

    原文格式PDF

  • 申请/专利权人 PETER C. BOYLE;YU ZHANG;

    申请/专利号US20070866495

  • 发明设计人 PETER C. BOYLE;YU ZHANG;

    申请日2007-10-03

  • 分类号G10L11;G10L15/26;G06F17/27;G06K9/72;

  • 国家 US

  • 入库时间 2022-08-21 18:08:41

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号