A New Image Captioning Approach for Visually Impaired People

机译：一种新的视觉受损人物的形象标题方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic caption generation in natural language to describe the visual content of an image has attracted an increasing amount of attention in the last decade due to its potential applications. It is a challenging task to generate captions with proper linguistics properties as it requires an advanced level of image understanding that goes far beyond image classification and object detection. In this paper, we propose to use the Stanford CoreNLP model to generate a caption after images are trained using VGG16 deep learning architecture. The visual attributes of images are extracted with the VGG16, which conveys richer content, and then they are fed into the Stanford model for caption generation. Experimental results on the MSCOCO dataset show that the proposed model significantly outperforms the state-of-the-art approaches consistently across different evaluation metrics.

机译：自然语言中的自动标题生成描述图像的视觉内容由于其潜在应用而在过去十年中引起了越来越大的关注。有一个具有挑战性的任务，可以生成具有正确语言学属性的标题，因为它需要高级图像理解，远远超出图像分类和对象检测。在本文中，我们建议使用斯坦福Corenlp模型在使用VGG16深度学习架构进行图像训练后生成标题。使用VGG16提取图像的视觉属性，该VGG16传达更丰富的内容，然后将它们馈入以用于字幕生成的斯坦福模型。 MSCOCO数据集上的实验结果表明，拟议的模型在不同的评估指标上一致地优于最先进的方法。

著录项

来源
《International Conference on Electrical and Electronics Engineering》|2019年|1 v.|共5页
会议地点
作者
Burak Makav; Volkan K?l??;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类一般性问题;
关键词
feature extraction; handicapped aids; image classification; image segmentation; learning (artificial intelligence); natural language processing; object detection; recurrent neural nets; text analysis;

机译：特征提取;有残障辅助助剂;图像分类;图像分割;学习（人工智能）;自然语言处理;对象检测;复发性神经网络;文本分析;

相似文献

外文文献
中文文献
专利

1. Hierarchical visual localization for visually impaired people using multimodal images [J] . Cheng Ruiqi, Hu Weijian, Chen Hao, Expert systems with applications . 2021,第Mara期

机译：使用多式联运图像的视觉障碍者的分层视觉本地化
2. Automatic generation of high performance morphological filters to fix missing data in depth images on real-time embedded systems for visually impaired people [J] . Antonio Miguel Batista DOURADO, Emerson Carlos PEDRINO Przeglad Elektrotechniczny . 2020,第1期

机译：自动生成高性能形态过滤器，以修复视力障碍者实时嵌入式系统上深度图像中的缺失数据
3. Indoor object recognition in RGBD images with complex-valued neural networks for visually-impaired people [J] . Trabelsi Rim, Jabri Issam, Melgani Farid, Neurocomputing . 2019,第FEBa22期

机译：带有复数值神经网络的RGBD图像中的室内目标识别
4. A New Image Captioning Approach for Visually Impaired People [C] . Burak Makav, Volkan Kılıç International Conference on Electrical and Electronics Engineering . 2019

机译：视障人士的新图像字幕方法
5. Agent-based automated image descriptor approach for visually impaired people. [D] . Hassan, Mohammad Mahdi. 2007

机译：针对视障人士的基于代理的自动图像描述符方法。
6. Feature Enhancement in Visually Impaired Images [O] . Madhuri Suthar, Hossein Asghari, Bahram Jalali -1

机译：视觉障碍图像的功能增强
7. An Automatic Approach for Translating Simple Images into Text Descriptions and Speech for Visually Impaired People [O] . Mrunmayee Patil, Ramesh Kagalkar 2015

机译：将简单图像翻译成文本描述和语音的自动方法

A New Image Captioning Approach for Visually Impaired People

摘要

著录项

相似文献

相关主题

期刊订阅