...
首页> 外文期刊>Intelligenza Artificiale >Integration of numeric and symbolic information for semantic image interpretation
【24h】

Integration of numeric and symbolic information for semantic image interpretation

机译:集成数字和符号信息以进行语义图像解释

获取原文
获取原文并翻译 | 示例
           

摘要

Semantic image interpretation (SII) is the process of generating meaningful descriptions of the content of images. Background knowledge (BK), in the form of logical theories, is extremely useful for SII. State-of-the-art algorithms for SII mainly adopt a bottom-up approach, which generates semantic interpretations of images starting from their low-level features. In these approaches BK is used only at a late stage for both enriching the semantic descriptions and improving image retrieval. In this paper, we show how BK plays an important role also during the early phase of SII. To this aim, we propose: (i) a reference framework where a semantic image description is a partial model of the BK. The elements of the partial model are grounded (linked) to a (set of) image segment(s). (ii) A loss function that evaluates how well this partial model fits the picture; (iii) a clustering-based optimization process that searches the partial model that better fits a picture. BK is used to prune branches of the search space that correspond to partial models which are inconsistent with BK. To evaluate our approach, we built a gold standard dataset of 203 pictures annotated with complex objects and their parts. We also evaluated our method on a reference dataset in Computer Vision, namely, the PASCAL-Part dataset. The results are positive. The evaluation assumes a perfect detection of parts. To understand the impact of a realistic (and noisy) part detection on our algorithm, we did a preliminary evaluation by implementing the entire SII pipeline. Part detection is performed by a recent deep learning architecture trained for detecting parts. From a qualitative analysis, it emerges that recognizing complex objects starting from parts in some cases gets better results than detecting complex objects directly.
机译:语义图像解释(SII)是生成有意义的图像内容描述的过程。逻辑理论形式的背景知识(BK)对SII极为有用。 SII的最新算法主要采用自下而上的方法,该方法从图像的低级特征开始生成图像的语义解释。在这些方法中,仅在后期才使用BK来丰富语义描述和改进图像检索。在本文中,我们展示了BK在SII早期也如何发挥重要作用。为此,我们提出:(i)一个参考框架,其中语义图像描述是BK的部分模型。局部模型的元素被接地(链接)到一个(一组)图像片段。 (ii)损失函数,用于评估该局部模型对图片的拟合程度; (iii)基于聚类的优化过程,该过程搜索更适合图片的部分模型。 BK用于修剪与BK不一致的部分模型对应的搜索空间分支。为了评估我们的方法,我们建立了黄金标准数据集,其中包含203张图片,其中标注了复杂的对象及其零件。我们还在计算机视觉中的参考数据集(即PASCAL-Part数据集)上评估了我们的方法。结果是肯定的。评估假设对零件进行了完美检测。为了理解现实的(有噪声的)零件检测对我们算法的影响,我们通过实现整个SII管道进行了初步评估。零件检测由经过训练可检测零件的最新深度学习体系结构执行。从定性分析中可以看出,在某些情况下,从零件开始识别复杂对象比直接检测复杂对象会获得更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号