首页> 美国卫生研究院文献>Computational Intelligence and Neuroscience >An Improved Convolutional Neural Network Algorithm and Its Application in Multilabel Image Labeling
【2h】

An Improved Convolutional Neural Network Algorithm and Its Application in Multilabel Image Labeling

机译:改进的卷积神经网络算法及其在多标签图像标注中的应用

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In today's society, image resources are everywhere, and the number of available images can be overwhelming. Determining how to rapidly and effectively query, retrieve, and organize image information has become a popular research topic, and automatic image annotation is the key to text-based image retrieval. If the semantic images with annotations are not balanced among the training samples, the low-frequency labeling accuracy can be poor. In this study, a dual-channel convolution neural network (DCCNN) was designed to improve the accuracy of automatic labeling. The model integrates two convolutional neural network (CNN) channels with different structures. One channel is used for training based on the low-frequency samples and increases the proportion of low-frequency samples in the model, and the other is used for training based on all training sets. In the labeling process, the outputs of the two channels are fused to obtain a labeling decision. We verified the proposed model on the Caltech-256, Pascal VOC 2007, and Pascal VOC 2012 standard datasets. On the Pascal VOC 2012 dataset, the proposed DCCNN model achieves an overall labeling accuracy of up to 93.4% after 100 training iterations: 8.9% higher than the CNN and 15% higher than the traditional method. A similar accuracy can be achieved by the CNN only after 2,500 training iterations. On the 50,000-image dataset from Caltech-256 and Pascal VOC 2012, the performance of the DCCNN is relatively stable; it achieves an average labeling accuracy above 93%. In contrast, the CNN reaches an accuracy of only 91% even after extended training. Furthermore, the proposed DCCNN achieves a labeling accuracy for low-frequency words approximately 10% higher than that of the CNN, which further verifies the reliability of the proposed model in this study.
机译:在当今社会中,图像资源无处不在,可用图像的数量可能非常庞大。确定如何快速有效地查询,检索和组织图像信息已成为一个流行的研究主题,而自动图像标注是基于文本的图像检索的关键。如果带有注释的语义图像在训练样本之间不平衡,则低频标记准确性可能会很差。在这项研究中,设计了双通道卷积神经网络(DCCNN)以提高自动标记的准确性。该模型集成了具有不同结构的两个卷积神经网络(CNN)通道。一个信道用于基于低频样本进行训练,并增加了模型中低频样本的比例,另一信道用于基于所有训练集进行训练。在标记过程中,两个通道的输出融合在一起以获得标记决策。我们在Caltech-256,Pascal VOC 2007和Pascal VOC 2012标准数据集上验证了提出的模型。在Pascal VOC 2012数据集上,提出的DCCNN模型经过100次训练迭代后,整体标签准确率达到了93.4%:比CNN高8.9%,比传统方法高15%。仅在经过2500次训练迭代后,CNN才能达到类似的精度。在Caltech-256和Pascal VOC 2012的50,000张图像数据集上,DCCNN的性能相对稳定;它的平均贴标精度达到93%以上。相比之下,即使经过长时间的训练,CNN的准确率也仅为91%。此外,提出的DCCNN对低频单词的标注准确度比CNN高约10%,这进一步验证了该模型在本研究中的可靠性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号