...
首页> 外文期刊>Multimedia, IEEE Transactions on >Exploiting Web Images for Dataset Construction: A Domain Robust Approach
【24h】

Exploiting Web Images for Dataset Construction: A Domain Robust Approach

机译:利用Web图像进行数据集构建:一种领域稳健的方法

获取原文
获取原文并翻译 | 示例
           

摘要

Labeled image datasets have played a critical role in high-level image understanding. However, the process of manual labeling is both time-consuming and labor intensive. To reduce the cost of manual labeling, there has been increased research interest in automatically constructing image datasets by exploiting web images. Datasets constructed by existing methods tend to have a weak domain adaptation ability, which is known as the “dataset bias problem.” To address this issue, we present a novel image dataset construction framework that can be generalized well to unseen target domains. Specifically, the given queries are first expanded by searching the Google Books Ngrams Corpus to obtain a rich semantic description, from which the visually nonsalient and less relevant expansions are filtered out. By treating each selected expansion as a “bag” and the retrieved images as “instances,” image selection can be formulated as a multi-instance learning problem with constrained positive bags. We propose to solve the employed problems by the cutting-plane and concave-convex procedure algorithm. By using this approach, images from different distributions can be kept while noisy images are filtered out. To verify the effectiveness of our proposed approach, we build an image dataset with 20 categories. Extensive experiments on image classification, cross-dataset generalization, diversity comparison, and object detection demonstrate the domain robustness of our dataset.
机译:标记的图像数据集在高级图像理解中发挥了关键作用。但是,手动贴标签的过程既费时又费力。为了减少人工标记的成本,人们对利用Web图像自动构建图像数据集的研究兴趣越来越高。通过现有方法构造的数据集往往具有较弱的域适应能力,这被称为“数据集偏差问题”。为了解决这个问题,我们提出了一种新颖的图像数据集构建框架,可以很好地推广到看不见的目标领域。具体而言,首先通过搜索Google图书Ngrams语料库来扩展给定的查询,以获得丰富的语义描述,从中过滤掉视觉上不显眼和相关性较小的扩展。通过将每个选定的扩展视为“袋”,并将检索到的图像视为“实例”,可以将图像选择公式化为约束正袋的多实例学习问题。我们建议通过切割平面和凹凸程序算法来解决所使用的问题。通过使用这种方法,可以保留来自不同分布的图像,同时过滤掉嘈杂的图像。为了验证我们提出的方法的有效性,我们建立了一个包含20个类别的图像数据集。在图像分类,跨数据集概括,多样性比较和对象检测方面的大量实验证明了我们数据集的领域稳健性。

著录项

  • 来源
    《Multimedia, IEEE Transactions on》 |2017年第8期|1771-1784|共14页
  • 作者单位

    Global Big Data Technologies Center, University of Technology Sydney, Sydney, NSW, Australia;

    Global Big Data Technologies Center, University of Technology Sydney, Sydney, NSW, Australia;

    School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China;

    Alibaba Group, Hangzhou, China;

    Global Big Data Technologies Center, University of Technology Sydney, Sydney, NSW, Australia;

    School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Noise measurement; Manuals; Search engines; Robustness; Visualization; Google; Labeling;

    机译:噪声测量;手册;搜索引擎;稳健性;可视化;Google;标签;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号