Comic MTL: optimized multi-task learning for comic book image analysis

Nhu-Van Nguyen; Rigaud Christophe; Burie Jean-Christophe

首页> 外文期刊>International Journal on Document Analysis and Recognition >Comic MTL: optimized multi-task learning for comic book image analysis

【24h】

Comic MTL: optimized multi-task learning for comic book image analysis

机译：漫画MTL：针对漫画图像分析而优化的多任务学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Comic book image analysis methods often propose multiple algorithms or models for multiple tasks like panel and character (body and face) detection, balloon segmentation, text recognition, etc. In this work, we aim to reduce the processing time for comic book image analysis by proposing one model that can learn multiple tasks called Comic MTL instead of using one model per task. In addition to detection and segmentation tasks, we integrate the relation analysis task for balloons and characters into the Comic MTL model. The experiments are carried out on DCM772 and eBDtheque public datasets that contain the annotations for panels, balloons, characters and also the associations between balloon and character. We show that the Comic MTL model can detect the associations between balloons and their speakers (comic characters) and handle other tasks like panel and character detection and also balloons segmentation with promising results.

机译：漫画图像分析方法通常针对多种任务（例如面板和角色（身体和面部）检测，气球分割，文本识别等）提出多种算法或模型。在这项工作中，我们旨在通过以下方式减少漫画图像分析的处理时间：提出一种可以学习多个任务的模型，称为Comic MTL，而不是每个任务使用一个模型。除了检测和分段任务，我们还将气球和角色的关系分析任务集成到Comic MTL模型中。实验是在DCM772和eBDtheque公开数据集上进行的，该数据集包含面板，气球，角色的注释以及气球和角色之间的关联。我们表明，Comic MTL模型可以检测气球及其说话者（漫画人物）之间的关联，并可以处理其他任务，例如面板和角色检测以及气球分割，而且效果可观。

著录项

来源
《International Journal on Document Analysis and Recognition》 |2019年第3期|265-284|共20页
作者
Nhu-Van Nguyen; Rigaud Christophe; Burie Jean-Christophe;
展开▼
作者单位

Univ La Rochelle SAIL Joint Lab Lab L3i F-17042 La Rochelle 1 France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Comic book image analysis; Association balloon-character; Multi-task learning; CNN; Deep learning;

机译：漫画形象分析;协会气球字符;多任务学习;CNN;深度学习;

相似文献

外文文献
中文文献
专利

1. Images for Little Architects. Architecture and Architectural Drawing in Childrena??s Books and Comics: An Interesting Case Study [J] . Casonato Camilla Proceedings . 2017,第9期

机译：小建筑师的图像。儿童图书和漫画中的建筑和建筑制图：一个有趣的案例研究
2. Knowledge-driven understanding of images in comic books [J] . Michael Lesk Computing reviews . 2016,第2期

机译：知识驱动的漫画图像理解
3. Knowledge-driven understanding of images in comic books [J] . Rigaud Christophe, Guerin Clement, Karatzas Dimosthenis, International Journal on Document Analysis and Recognition . 2015,第3期

机译：知识驱动的漫画图像理解
4. Multi-task Model for Comic Book Image Analysis [C] . Nhu-Van Nguyen, Christophe Rigaud, Jean-Christophe Burie International conference on multimedia modeling . 2019

机译：漫画图像分析的多任务模型
5. Re-Imagining the Conquest in Comics: Adoption of Colonial Spanish American Narrative and Image in Contemporary Comic Books [D] . Jones, Braeden. 2020

机译：重新想象征服漫画：采用殖民地西班牙美国叙事和图像在当代漫画书中
6. Homepage to distribute the anatomy learning contents including Visible Korean products comics and books [O] . Beom Sun Chung, Min Suk Chung 2018

机译：分发解剖学学习内容的主页包括可见的韩国产品漫画和书籍
7. Revitalizing the Stagnating U.S. Comic Book Industry: A Historic Analysis and Creative Fusion of American and Japanese Comic Books [O] . Meade Alayna Naomi 2017

机译：振兴停滞的美国漫画产业：美日漫画的历史分析与创意融合

Comic MTL: optimized multi-task learning for comic book image analysis

摘要

著录项

相似文献

相关主题

期刊订阅