Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context

机译：全局表提取器（GTE）：使用视觉上下文的联合表识别和单元结构识别的框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Documents are often used for knowledge sharing and preservation in business and science, within which are tables that capture most of the critical data. Unfortunately, most documents are stored and distributed as PDF or scanned images, which fail to preserve logical table structure. Recent vision-based deep learning approaches have been proposed to address this gap, but most still cannot achieve state-of-the-art results. We present Global Table Extractor (GTE), a vision-guided systematic framework for joint table detection and cell structured recognition, which could be built on top of any object detection model. With GTE-Table, we invent a new penalty based on the natural cell containment constraint of tables to train our table network aided by cell location predictions. GTE-Cell is a new hierarchical cell detection network that leverages table styles. Further, we design a method to automatically label table and cell structure in existing documents to cheaply create a large corpus of training and test data. We use this to enhance PubTabNet with cell labels and create FinTabNet, real-world and complex scientific and financial datasets with detailed table structure annotations to help train and test structure recognition. Our framework surpasses previous state-of-the-art results on the ICDAR 2013 and ICDAR 2019 table competition in both table detection and cell structure recognition. Further experiments demonstrate a greater than 45% improvement in cell structure recognition when compared to a vanilla RetinaNet object detection model in our new out-of-domain FinTabNet.

机译：文档通常用于商业和科学中的知识共享和保存，其中包括捕获大多数关键数据的表。不幸的是，大多数文档都被存储和分发为PDF或扫描图像，这不能保留逻辑表结构。最近的基于视觉的深度学习方法已经提出解决这种差距，但大多数仍然无法实现最先进的结果。我们呈现全球表提取器（GTE），视觉引导系统框架，用于联合表检测和单元结构识别，这可以构建在任何物体检测模型的顶部。使用GTE-TABLE，我们根据小区位置预测，根据表的自然单元格限制来培训我们的表网络的新罚则。 GTE-COLL是一种新的分层单元检测网络，可利用表格样式。此外，我们设计一种方法来自动标记现有文档中的表和单元结构，以便宜地创建大型培训和测试数据的语料库。我们使用它来增强PubTabnet与单元格标签，并创建FintabNet，现实世界和复杂的科学和金融数据集，详细的表结构注释来帮助列车和测试结构识别。我们的框架超越了以前的ICDAR 2013和ICDAR 2019年表竞赛的最先进的结果，既表格检测和细胞结构识别。进一步的实验表明，与我们新的域名FintabNet中的香草视网网对象检测模型相比，细胞结构识别的提高大于45％。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision》|2021年|697-706|共10页
会议地点
作者
Xinyi Zheng; Douglas Burdick; Lucian Popa; Xu Zhong; Nancy Xin Ru Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Deep learning; Visualization; Computer vision; Systematics; Design methodology; Conferences;

机译：培训;深入学习;可视化;计算机愿景;系统学;设计方法;会议;

相似文献

外文文献
中文文献
专利

1. Identification and quantitation of extractables from cellulose acetate butyrate (CAB) and estimation of their in vivo exposure levels. [J] . Ma D, Beasley C, Harmon PA, Journal of Pharmaceutical and Biomedical Analysis: An International Journal on All Drug-Related Topics in Pharmaceutical, Biomedical and Clinical Analysis . 2004,第4期

机译：鉴定和定量从醋酸丁酸纤维素（CAB）中提取的提取物，并估算其体内暴露水平。
2. Clinical Orthopaedics. Editor-in-Chief Anthony F. DePalma, with the Assistance of the Associate Editors, the Board of Advisory Editors and the Board of Corresponding Editors. Number Twenty. Disorders of the Shoulder Joint. 10?+7? in. Pp. xi+272, with many figures and tables. Index. 1961. Number Twenty-one. Back Disorders in Children. 10?x7? in. Pp. ix+244, with many figures and tables. Index. 1961. Number Twenty-two. Diseases of the Hip in Children. 10?x7? in. Pp. xii+253, with many figures and tables. Index. 1962. Philadelphia and Montreal: J. B. Lippincott Company, London: Pitman Medical Publishing Co. Ltd. Price 60s. each [J] . Walter Mercer The Journal of Bone and Joint Surgery. British VolumecBritish Orthopaedic Association , Australian Orthopaedic Association , Canadian Orthopaedic Association . . . [et al] . 1962,第3期

机译：临床骨科。主编Anthony F. DePalma，在副编辑，咨询编辑委员会和通讯编辑委员会的协助下。二十号。肩关节疾病。 10 + 7 in。Pp。 xi + 272，带有许多图形和表格。指数。 1961年，《二十一号》。儿童背部疾病。 10？x7？ in。Pp。 ix + 244，具有许多图形和表格。指数。 1961年，第22期。儿童髋关节疾病。 10？x7？ in。Pp。 xii + 253，包含许多图形和表格。指数。 1962年。费城和蒙特利尔：J。B. Lippincott Company，伦敦：Pitman Medical Publishing Co. Ltd.价格60年代。每
3. Identifying and Mitigating Errors in Screening for Organic Extractables and Leachables: Part 1—Introduction to Errors in Chromatographic Screening for Organic Extractables and Leachables and Discussion of the Errors of Omission [J] . PDA journal of pharmaceutical science and technology . 2020,第1期

机译：识别和减轻筛选有机萃取物和鹿的误差：第1部分 - 用于有机萃取物和鹿的色谱筛查误差引入，以及遗漏误差的讨论
4. Study on control system and precision positioning table identification technology of universal joint performance testing table [C] . Jiabao Chen, Lun Shi, Xuqi Qin International Workshop on Materials and Mechanical Engineering . 2014

机译：通用联合性能检测表控制系统和精密定位表识别技术研究
5. God's global table: Entering the multicultured reality of the global evangelical movement through the local context [D] . Bieber, Kenneth R., Jr. 2013

机译：上帝的全球餐桌：通过当地背景进入全球福音派运动的多元文化现实
6. Analysis of Stiffness of Clamped Joints versus Bolted Joints in Steel Structures by Means of Accelerometers and Shaking Table Tests [O] . Manuel Cabaleiro, Carlos Moutinho, Cristina González-Gaya, 2021

机译：通过加速度计及摇架测试分析钢结构中夹紧接头与螺栓连接的刚度
7. Table 3.5.f.1 Infiltration of cancer cells at pancreatic cut end margin (PCM) Table 3.5.f.2 Procedure of pancreatectomy and PCM Table 3.5.f.3 Infiltration of cancer cells at bile duct cut end margin (BCM) Table 3.5.f.4 Infiltration of cancer cells at dissected pancreatic tissue margin (DPM) Table 3.5.f.5 Procedure of pancreatectomy and DPM Table 3.5.f.6 Combined resection of surrounding organs and DPM Table 3.5.f.7 Residual tumor Table 3.5.f.8 Procedure of pancreatectomy and residual tumor Table 3.5.f.9 Combined resection of surrounding organs and residual tumor [O] . 2007

机译：表3.5.f.1胰蛋白末端边缘癌细胞浸润表3.5.f.2胰腺切除术和PCM的过程表3.5.F.3胆管切断癌细胞渗透边缘（BCM）表3.5.f.4癌细胞处于解剖胰腺组织边缘（DPM）表3.5.f.5胰腺切除术和DPM的过程表3.5.f.6组合切除周围器官的切除术和DPM 表3.5.F.7残留肿瘤表3.5.f.8胰腺切除术和残留肿瘤的程序表3.5.f.9结合周围器官和残留肿瘤的切除
8. Domain Independent Framework for Extracting Linked Semantic Data from Tables. [R] . Mulwad, V., Finin, T., Joshi, A. 2012

机译：用于从表中提取链接语义数据的域独立框架。

Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context

摘要

著录项

相似文献

相关主题

期刊订阅