首页> 外文会议>IEEE International Conference on Machine Learning and Applications >Fusing Visual and Textual Information to Determine Content Safety
【24h】

Fusing Visual and Textual Information to Determine Content Safety

机译:融合视觉和文本信息以确定内容安全性

获取原文

摘要

In advertising, identifying the content safety of web pages is a significant concern since advertisers do not want brands to be associated with threatening content. At the same time, publishers would like to maximize the number of web pages on which they can place ads. Thus, a fine balance must be achieved while classifying content safety in order to satisfy both advertisers and publishers. In this paper, we propose a multimodal machine learning framework that fuses visual and textual information from web pages to improve current predictions of content safety. The primary focus is on late fusion, which involves combining final model outputs of separate modalities, such as images and text, to arrive at a single decision. This paper presents a fully automated machine learning framework that performs binary and multilabel classification using late fusion techniques. We also introduce additional work in early fusion, which involves extracting and fusing intermediate features from the two separate models. Our algorithms are applied to data extracted from relevant web pages in the advertising industry. Both of our late and early fusion methods obtain significant improvements over algorithms currently in use.
机译:在广告中,识别网页的内容安全是一个重要的问题,因为广告商不希望品牌与威胁内容相关联。与此同时,发布商希望最大化它们可以放置广告的网页数量。因此,必须在分类内容安全的同时实现精细平衡,以满足广告商和发布者。在本文中,我们提出了一种多模式机器学习框架,其融合来自网页的视觉和文本信息,以改善内容安全的当前预测。主要焦点是晚期融合,这涉及将单独模式的最终模型输出组合在一起,以单一决定。本文介绍了一种全自动的机器学习框架,使用后期融合技术进行二进制和多标签分类。我们还在早期融合中介绍了额外的工作,涉及从两个独立型号中提取和融合中间特征。我们的算法应用于广告业中相关网页提取的数据。我们的晚期和早期的融合方法都获得了目前正在使用的算法的显着改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号