Exploring multi-tasking learning in document attribute classification

Mondal Tanmoy; Das Abhijit; Ming Zuheng

首页> 外文期刊>Pattern recognition letters >Exploring multi-tasking learning in document attribute classification

【24h】

Exploring multi-tasking learning in document attribute classification

机译：Exploring multi-tasking learning in document attribute classification

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

In this work, we adhere to explore a Multi-Tasking learning (MTL) based network to perform document attribute classification such as the font type, font size, font emphasis and scanning resolution classification of a document image. To accomplish these tasks, we operate on either segmented word level or on uniformed size patches randomly cropped out of the document. Furthermore, a hybrid convolution neural network (CNN) architecture "MTL+MI", which is based on the combination of MTL and Multi Instance (MI) of patch and word is used to accomplish joint learning for the classification of the same document attributes. The contribution of this paper are three fold: firstly, based on segmented word images and patches, we present a MTL based network for the classification of a full document image. Secondly, we propose a MTL and MI (using segmented words and patches) based combined CNN architecture ("MTL+MI") for the classification of same document attributes. Thirdly, based on the multi-tasking classifications of the words and/or patches, we propose an intelligent voting system which is based on the posterior probabilities of each words and/or patches to perform the classification of document's attributes of complete document image. (c) 2022 Published by Elsevier B.V.

著录项

来源
《Pattern recognition letters》 |2022年第5期|49-59|共11页
作者
Mondal Tanmoy; Das Abhijit; Ming Zuheng;
展开▼
作者单位

IMT Atlantique, Math & Elect Engn Dept, Brest, France;

Univ La Rochelle, L3i, La Rochelle, France;

BITS Pilani, Dept Comp Sci & Informat Syst, Hyderabad Campus, Hyderabad, India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Multi-tasks Learning; Multi-instance Learning; Weighted Multi-task Learning; Convolutional Neural Networks; ResNet; Font Size Recognition; Font Type Recognition; Font Emphasis Recognition; Scanning Resolution Recognition;

Exploring multi-tasking learning in document attribute classification

摘要

著录项

相关主题

期刊订阅