Exploiting Web Images for Dataset Construction: A Domain Robust Approach

Yazhou Yao; Jian Zhang; Fumin Shen; Xiansheng Hua; Jingsong Xu; Zhenmin Tang

首页> 外文期刊>Multimedia, IEEE Transactions on >Exploiting Web Images for Dataset Construction: A Domain Robust Approach

【24h】

Exploiting Web Images for Dataset Construction: A Domain Robust Approach

机译：利用Web图像进行数据集构建：一种领域稳健的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Labeled image datasets have played a critical role in high-level image understanding. However, the process of manual labeling is both time-consuming and labor intensive. To reduce the cost of manual labeling, there has been increased research interest in automatically constructing image datasets by exploiting web images. Datasets constructed by existing methods tend to have a weak domain adaptation ability, which is known as the “dataset bias problem.” To address this issue, we present a novel image dataset construction framework that can be generalized well to unseen target domains. Specifically, the given queries are first expanded by searching the Google Books Ngrams Corpus to obtain a rich semantic description, from which the visually nonsalient and less relevant expansions are filtered out. By treating each selected expansion as a “bag” and the retrieved images as “instances,” image selection can be formulated as a multi-instance learning problem with constrained positive bags. We propose to solve the employed problems by the cutting-plane and concave-convex procedure algorithm. By using this approach, images from different distributions can be kept while noisy images are filtered out. To verify the effectiveness of our proposed approach, we build an image dataset with 20 categories. Extensive experiments on image classification, cross-dataset generalization, diversity comparison, and object detection demonstrate the domain robustness of our dataset.

机译：标记的图像数据集在高级图像理解中发挥了关键作用。但是，手动贴标签的过程既费时又费力。为了减少人工标记的成本，人们对利用Web图像自动构建图像数据集的研究兴趣越来越高。通过现有方法构造的数据集往往具有较弱的域适应能力，这被称为“数据集偏差问题”。为了解决这个问题，我们提出了一种新颖的图像数据集构建框架，可以很好地推广到看不见的目标领域。具体而言，首先通过搜索Google图书Ngrams语料库来扩展给定的查询，以获得丰富的语义描述，从中过滤掉视觉上不显眼和相关性较小的扩展。通过将每个选定的扩展视为“袋”，并将检索到的图像视为“实例”，可以将图像选择公式化为约束正袋的多实例学习问题。我们建议通过切割平面和凹凸程序算法来解决所使用的问题。通过使用这种方法，可以保留来自不同分布的图像，同时过滤掉嘈杂的图像。为了验证我们提出的方法的有效性，我们建立了一个包含20个类别的图像数据集。在图像分类，跨数据集概括，多样性比较和对象检测方面的大量实验证明了我们数据集的领域稳健性。

著录项

来源
《Multimedia, IEEE Transactions on》 |2017年第8期|1771-1784|共14页
作者
Yazhou Yao; Jian Zhang; Fumin Shen; Xiansheng Hua; Jingsong Xu; Zhenmin Tang;
展开▼
作者单位

Global Big Data Technologies Center, University of Technology Sydney, Sydney, NSW, Australia;

Global Big Data Technologies Center, University of Technology Sydney, Sydney, NSW, Australia;

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China;

Alibaba Group, Hangzhou, China;

Global Big Data Technologies Center, University of Technology Sydney, Sydney, NSW, Australia;

School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Noise measurement; Manuals; Search engines; Robustness; Visualization; Google; Labeling;

机译：噪声测量;手册;搜索引擎;稳健性;可视化;Google;标签;

相似文献

外文文献
中文文献
专利

1. Construction of Diverse Image Datasets From Web Collections With Limited Labeling [J] . Niluthpol Chowdhury Mithun, Rameswar Panda, Amit K. Roy-Chowdhury Circuits and Systems for Video Technology, IEEE Transactions on . 2020,第4期

机译：使用有限标签的网络收集构建不同的图像数据集
2. Corrections to “Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss” [J] . Yang Y., Zha Z.-J., Gao Y., Multimedia, IEEE Transactions on . 2015,第2期

机译：对“通过健壮的特定于样本的损失利用Web图像进行语义视频索引”的更正
3. Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss [J] . Yang Y., Zha Z.-J., Gao Y., Multimedia, IEEE Transactions on . 2014,第6期

机译：利用Web图片获取语义视频，通过针对特定样本的强大损失进行索引
4. Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach [C] . Duan Lixin, Xu Dong, Chang Shih-Fu Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on . 2012

机译：利用网络图像进行消费者视频中的事件识别：多源域适应方法
5. Scale domains and scale domain thresholds: Evaluating the scalability of spatial datasets. [D] . Joss, Brent N. J. 2003

机译：比例域和比例域阈值：评估空间数据集的可伸缩性。
6. Transcription network construction for large-scale microarray datasets using a high-performance computing approach [O] . Mengxia (Michelle) Zhu, Qishi Wu 2008

机译：使用高性能计算方法构建大规模微阵列数据集的转录网络
7. Exploiting Web Images for Dataset Construction: A Domain Robust Approach [O] . Yao, Yazhou, Zhang, Jian, Shen, Fumin, 2017

机译：利用Web图像进行数据集构建：一种领域稳健的方法

Exploiting Web Images for Dataset Construction: A Domain Robust Approach

摘要

著录项

相似文献

相关主题

期刊订阅