首页> 外文期刊>International Journal on Document Analysis and Recognition >Comic MTL: optimized multi-task learning for comic book image analysis
【24h】

Comic MTL: optimized multi-task learning for comic book image analysis

机译:漫画MTL:针对漫画图像分析而优化的多任务学习

获取原文
获取原文并翻译 | 示例
           

摘要

Comic book image analysis methods often propose multiple algorithms or models for multiple tasks like panel and character (body and face) detection, balloon segmentation, text recognition, etc. In this work, we aim to reduce the processing time for comic book image analysis by proposing one model that can learn multiple tasks called Comic MTL instead of using one model per task. In addition to detection and segmentation tasks, we integrate the relation analysis task for balloons and characters into the Comic MTL model. The experiments are carried out on DCM772 and eBDtheque public datasets that contain the annotations for panels, balloons, characters and also the associations between balloon and character. We show that the Comic MTL model can detect the associations between balloons and their speakers (comic characters) and handle other tasks like panel and character detection and also balloons segmentation with promising results.
机译:漫画图像分析方法通常针对多种任务(例如面板和角色(身体和面部)检测,气球分割,文本识别等)提出多种算法或模型。在这项工作中,我们旨在通过以下方式减少漫画图像分析的处理时间:提出一种可以学习多个任务的模型,称为Comic MTL,而不是每个任务使用一个模型。除了检测和分段任务,我们还将气球和角色的关系分析任务集成到Comic MTL模型中。实验是在DCM772和eBDtheque公开数据集上进行的,该数据集包含面板,气球,角色的注释以及气球和角色之间的关联。我们表明,Comic MTL模型可以检测气球及其说话者(漫画人物)之间的关联,并可以处理其他任务,例如面板和角色检测以及气球分割,而且效果可观。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号