首页> 外国专利> Aligning symbols and objects using co-attention for understanding visual content

Aligning symbols and objects using co-attention for understanding visual content

机译：使用共同关注对齐符号和对象以了解视觉内容

页面导航

摘要
著录项
相似文献

摘要

A method, apparatus and system for understanding visual content includes determining at least one region proposal for an image, attending at least one symbol of the proposed image region, attending a portion of the proposed image region using information regarding the attended symbol, extracting appearance features of the attended portion of the proposed image region, fusing the appearance features of the attended image region and features of the attended symbol, projecting the fused features into a semantic embedding space having been trained using fused attended appearance features and attended symbol features of images having known descriptive messages, computing a similarity measure between the projected, fused features and fused attended appearance features and attended symbol features embedded in the semantic embedding space having at least one associated descriptive message and predicting a descriptive message for an image associated with the projected, fused features.

机译：用于理解视觉内容的方法，装置和系统包括确定图像的至少一个区域提议，参加所提出的图像区域的至少一个符号，使用关于所附符号的信息，参加所提出的图像区域的一部分，提取外观特征在所提出的图像区域的参与部分中，融合出现的图像区域的外观特征和所附符号的特征，将融合特征投影到已经使用熔化的外观特征训练的语义嵌入空间中，并参加了图像的符号特征已知的描述性消息，计算投影，融合特征和融合的出现的外观特征和嵌入在具有至少一个关联的描述性消息的语义嵌入空间中的相似性测量以及嵌入的符号特征，并预测与投影，融合相关联的图像的描述性消息特征。

著录项

公开/公告号US11210572B2

专利类型
公开/公告日2021-12-28

原文格式PDF
申请/专利权人 SRI INTERNATIONAL;
展开▼

申请/专利号US201916717497
发明设计人 AJAY DIVAKARAN;KARAN SIKKA;KARUNA AHUJA;ANIRBAN ROY;
展开▼

申请日2019-12-17
分类号G06K9/62;G06K9/46;G06K9/72;G06N3/08;G06K9/66;
国家 US
入库时间 2022-08-24 23:04:16

相似文献

专利
外文文献
中文文献