Fusing Visual and Textual Information to Determine Content Safety

机译：融合视觉和文本信息以确定内容安全性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In advertising, identifying the content safety of web pages is a significant concern since advertisers do not want brands to be associated with threatening content. At the same time, publishers would like to maximize the number of web pages on which they can place ads. Thus, a fine balance must be achieved while classifying content safety in order to satisfy both advertisers and publishers. In this paper, we propose a multimodal machine learning framework that fuses visual and textual information from web pages to improve current predictions of content safety. The primary focus is on late fusion, which involves combining final model outputs of separate modalities, such as images and text, to arrive at a single decision. This paper presents a fully automated machine learning framework that performs binary and multilabel classification using late fusion techniques. We also introduce additional work in early fusion, which involves extracting and fusing intermediate features from the two separate models. Our algorithms are applied to data extracted from relevant web pages in the advertising industry. Both of our late and early fusion methods obtain significant improvements over algorithms currently in use.

机译：在广告中，识别网页的内容安全是一个重要的问题，因为广告商不希望品牌与威胁内容相关联。与此同时，发布商希望最大化它们可以放置广告的网页数量。因此，必须在分类内容安全的同时实现精细平衡，以满足广告商和发布者。在本文中，我们提出了一种多模式机器学习框架，其融合来自网页的视觉和文本信息，以改善内容安全的当前预测。主要焦点是晚期融合，这涉及将单独模式的最终模型输出组合在一起，以单一决定。本文介绍了一种全自动的机器学习框架，使用后期融合技术进行二进制和多标签分类。我们还在早期融合中介绍了额外的工作，涉及从两个独立型号中提取和融合中间特征。我们的算法应用于广告业中相关网页提取的数据。我们的晚期和早期的融合方法都获得了目前正在使用的算法的显着改进。

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2019年|1 v.|共6页
会议地点
作者
Rodrigo Leonardo; Amber Hu; Mohammad Uzair; Qiujing Lu; Iris Fu; Keishin Nishiyama; Sooraj Mangalath Subrahmannian; Divyaa Ravichandran;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词
Web pages; Visualization; Safety; Training; Machine learning; Feature extraction; Machine learning algorithms;

机译：网页;可视化;安全;培训;机器学习;特征提取;机器学习算法;

相似文献

外文文献
中文文献
专利

1. Fusing audio, visual and textual clues for sentiment analysis from multimodal content [J] . Poria Soujanya, Cambria Erik, Howard Newton, Neurocomputing . 2016,第JANa22PTaA期

机译：融合音频，视觉和文本线索以从多模式内容中进行情感分析
2. Learning analytics techniques and visualisation with textual data for determining causes of academic failure [J] . Nkhoma Clara, Duy Dang-Pham, Ai-Phuong Hoang, Behaviour & Information Technology . 2020,第7a9期

机译：学习分析技术与文本数据的可视化，以确定学术失败的原因
3. A decisive content based image retrieval approach for feature fusion in visual and textual images [J] . Unar Salahuddin, Wang Xingyuan, Wang Chunpeng, Knowledge-Based Systems . 2019,第SEPa1期

机译：基于决定性内容的图像检索方法，用于视觉和文本图像中的特征融合
4. Fusing Visual and Textual Information to Determine Content Safety [C] . Rodrigo Leonardo, Amber Hu, Mohammad Uzair, IEEE International Conference on Machine Learning and Applications . 2019

机译：融合视觉和文字信息来确定内容安全
5. Localizing Content in Videos Via Textual and Visual Queries [D] . Feng, Yang. 2020

机译：通过文本和视觉查询本地化视频中的内容
6. Automatic Detection of Pornographic and Gambling Websites Based on Visual and Textual Content Using a Decision Mechanism [O] . Yang Chen, Rongfeng Zheng, Anmin Zhou, 2020

机译：基于使用决策机制的视觉和文本内容自动检测色情和赌博网站
7. Enhanced Video Analytics for Sentiment Analysis Based on Fusing Textual, Auditory and Visual Information [O] . Sadam Al-Azani, El-Sayed M. El-Alfy 2020

机译：基于融合文本，听觉和视觉信息的情感分析增强了视频分析
8. MSHA's (Mine Safety and Health Administration's) Procedure for Determining Quartz Content of Respirable Coal Mine Dust [R] . Goldberg, S. A. , Tomb, T. F. , Kacsmar, P. M. , 1984

机译：msHa（矿山安全和健康管理局）确定可吸入煤矿粉尘石英含量的程序

Fusing Visual and Textual Information to Determine Content Safety

摘要

著录项

相似文献

相关主题

期刊订阅