首页> 外文会议>Advances in multimedia modeling >Semantic Concept Detection for User-Generated Video Content Using a Refined Image Folksonomy
【24h】

Semantic Concept Detection for User-Generated Video Content Using a Refined Image Folksonomy

机译:使用精细图像Folksonomy对用户生成的视频内容进行语义概念检测

获取原文
获取原文并翻译 | 示例

摘要

The automatic detection of semantic concepts is a key technology for enabling efficient and effective video content management. Conventional techniques for semantic concept detection in video content still suffer from several interrelated issues: the semantic gap, the imbalanced data set problem, and a limited concept vocabulary size. In this paper, we propose to perform semantic concept detection for user-created video content using an image folksonomy in order to overcome the aforementioned problems. First, an image folksonomy contains a vast amount of user-contributed images. Second, a significant portion of these images has been manually annotated by users using a wide variety of tags. However, user-supplied annotations in an image folksonomy are often characterized by a high level of noise. Therefore, we also discuss a method that allows reducing the number of noisy tags in an image folksonomy. This tag refinement method makes use of tag co-occurrence statistics. To verify the effectiveness of the proposed video content annotation system, experiments were performed with user-created image and video content available on a number of social media applications. For the datasets used, video annotation with tag refinement has an average recall rate of 84% and an average precision of 75%, while video annotation without tag refinement shows an average recall rate of 78% and an average precision of 62%.
机译:语义概念的自动检测是实现高效,有效的视频内容管理的关键技术。用于视频内容中的语义概念检测的常规技术仍然遭受几个相互关联的问题:语义鸿沟,数据集不平衡问题以及概念词汇量有限。在本文中,我们提出使用图像分类法对用户创建的视频内容执行语义概念检测,以克服上述问题。首先,图像分类法包含大量用户提供的图像。其次,这些图像的很大一部分已经由用户使用各种各样的标签手动注释。但是,图像民用解剖学中用户提供的注释通常以高噪声为特征。因此,我们还讨论了一种方法,该方法可以减少图像分类术中的噪声标签数量。这种标签细化方法利用了标签共现统计。为了验证所提出的视频内容注释系统的有效性,对用户创建的图像和视频内容在许多社交媒体应用程序上可用进行了实验。对于所使用的数据集,带有标签优化的视频注释的平均召回率为84%,平均精度为75%,而没有标签优化的视频注释的平均召回率为78%,平均精度为62%。

著录项

  • 来源
    《Advances in multimedia modeling》|2010年|p.511-521|共11页
  • 会议地点 Chongqing(CN);Chongqing(CN)
  • 作者单位

    Image and Video Systems Lab, Korea Advanced Institute of Science and Technology (KAIST),Yuseong-gu, Daejeon, 305-732, Republic of Korea;

    Image and Video Systems Lab, Korea Advanced Institute of Science and Technology (KAIST),Yuseong-gu, Daejeon, 305-732, Republic of Korea;

    Image and Video Systems Lab, Korea Advanced Institute of Science and Technology (KAIST),Yuseong-gu, Daejeon, 305-732, Republic of Korea;

    Image and Video Systems Lab, Korea Advanced Institute of Science and Technology (KAIST),Yuseong-gu, Daejeon, 305-732, Republic of Korea;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 多媒体技术与多媒体计算机;
  • 关键词

    folksonomy; semantic concept detection; tag refinement; UCC;

    机译:民间疗法语义概念检测;标签细化; UCC;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号