curated-table-structure-recognition

This repository provides a curated list of resources in the research domain of Table Structure Recognition (TSR), updated in my free time.

Methods based on heuristic rules is not included.

Keywords

"Table Structure Recognition" OR "Table Recognition"

Popular Benchmarks

PubTabNet Version 2: val / development (9115 samples)

Supervised training only using the training set of PubTabNet Version 2 (500777 samples)
Evaluation for PubTabNet:
- High TEDS
- High TEDS-S + High AP: to focus on the structure
- High TEDS-S only: insufficient

Paper	Group	Date	Publication	TEDS	TEDS-S	AP-50
`BGTR` Enhancing Table Structure Recognition via Bounding Box Guidance	SCUT	2024-12	ICPR-2024	96.57	97.63	-
`TableAttention` Multi-Modal Attention Based on 2D Structured Sequence for Table Recognition	CAS, UCAS, Zhongke Fanyu	2024-11	PRCV-2024	95.38, Simple-96.50, Complex-94.20	96.96, Simple-98.63, Complex-95.14	96.80
Enhancing Transformer-Based Table Structure Recognition for Long Tables	PKU	2024-11	PRCV-2024	96.77	98.82	-
`TFLOP` TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism	Upstage AI	2024-08	IJCAI-2024	98.00	98.30	-
`SEMv3` SEMv3: A Fast and Robust Approach to Table Separation Line Detection	USTC, iFLYTEK	2024-05	IJCAI-2024	97.30	97.50	-
`MuTabNet` Multi-Cell Decoder and Mutual Learning for Table Structure and Character Recognition	PFN	2024-04	ICDAR-2024	96.87, Simple-98.16, Complex-95.53	-	-
`OmniParser` OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition	Alibaba, HUST	2024-03	CVPR-2024	88.83	90.45	-
`UniTable` UniTable: Towards a Unified Framework for Table Structure Recognition via Self-Supervised Pretraining	Georgia Tech, ADP	2024-03	NeurIPS-Workshop-2024	96.50	97.89	98.43
`LinearProj` Self-Supervised Pre-Training for Table Structure Recognition Transformer	Georgia Tech, ADP	2024-02	AAAI-Workshop-2024	-	96.83, Simple-98.48, Complex-95.11	-
`CNN-transformer` High-Performance Transformers for Table Structure Recognition Need Early Convolutions	Georgia Tech, ADP	2023-11	NeurIPS-Workshop-2023	-	96.53, Simple-98.33, Complex-94.66	-
`GridFormer` GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction	Baidu, SCUT	2023-09	ACM-MM-2023	95.84	97.00	-
`DRCC` Divide Rows and Conquer Cells: Towards Structure Recognition for Large Tables	CAS, UCAS, Hikvision, CUC	2023-08	IJCAI-2023	97.80	98.90	-
An End-to-End Local Attention Based Model for Table Recognition	NII	2023-08	ICDAR-2023	96.77, Simple-98.07, Complex-95.42	-	-
`TSRFormer DQ-DETR` Robust Table Structure Recognition with Dynamic Queries Enhanced Detection Transformer	USTC, Microsoft, SJTU, Alibaba	2023-03	Pattern-Recognition.2023	-	97.50	-
`MTL-TabNet` An End-to-End Multi-Task Learning Model for Image-based Table Recognition	NII	2023-03	VISIGRAPP-2023	96.67, Simple-97.92, Complex-95.36	97.88, Simple-99.05, Complex-96.66	-
`VAST` Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling	Huawei, PKU	2023-03	CVPR-2023	96.31	97.23	94.80
`SEMv2` SEMv2: Table Separation Line Detection Based on Conditional Convolution	USTC, iFLYTEK	2023-03	Pattern-Recognition.2024	-	97.50	-
`WSTabNet` Rethinking Image-based Table Recognition Using Weakly Supervised Methods	NII	2023-02	ICPRAM-2023	96.48, Simple-97.89, Complex-95.02	97.74, Simple-99.06, Complex-96.37	-
`TSRNet` Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing	CAS, UCAS	2022-12	Pattern-Recognition.2022	-	95.64	-
`SLANet` PP-StructureV2: A Stronger Document Analysis System	Baidu	2022-10	arXiv	95.89	97.01	-
`TRUST` TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers	Baidu, DUT	2022-08	arXiv	96.20	97.10	-
`TSRFormer` TSRFormer: Table Structure Recognition with Transformers	Microsoft, UCAS, USTC, SJTU	2022-08	ACM-MM-2022	-	97.50	-
`RobusTabNet` Robust Table Detection and Structure Recognition from Heterogeneous Document Images	USTC, Microsoft	2022-03	Pattern-Recognition.2023	-	97.00	-
`TableFormer` TableFormer: Table Structure Understanding with Transformers	IBM	2022-03	CVPR-2022	93.60, Simple-95.40, Complex-90.10	96.75, Simple-98.50, Complex-95.00	82.10
`SEM` Split, Embed and Merge: An accurate table structure recognizer	USTC, iFLYTEK	2021-07	Pattern-Recognition.2022	93.70, Simple-94.80, Complex-92.50	-	-
`LGPMA` LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment	Hikvision, ZJU	2021-05	ICDAR-2021	94.60	96.70	-
`TableMASTER` PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML	Ping An	2021-05	ICDAR-2021	96.18(~~96.84~~)	-	-

PubTabNet Version 2: ICDAR-2021-SLP final evaluation (9064 samples)

It is allowed to use additional third-party data or pre-trained models for performance improvement.
HTML tags that define the text style including bold, italic, strike through, superscript, and subscript should be included in the cell content.
Due to a problem with the final evaluation data set, bold tags <b> where not considered in the evaluation.

Paper	Group	Date	Publication	TEDS	TEDS-S	AP-50
`TFLOP` TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism	Upstage AI	2024-08	IJCAI-2024	96.66	98.38	-
`MuTabNet` Multi-Cell Decoder and Mutual Learning for Table Structure and Character Recognition	PFN	2024-04	ICDAR-2024	96.53, Simple-98.01, Complex-94.98	-	-
An End-to-End Local Attention Based Model for Table Recognition	NII	2023-07	ICDAR-2023	96.21, Simple-97.77, Complex-94.58	-	-
`MTL-TabNet` An End-to-End Multi-Task Learning Model for Image-based Table Recognition	NII	2023-03	VISIGRAPP-2023	96.17, Simple-97.60, Complex-94.68	-	-
`WSTabNet` Rethinking Image-based Table Recognition Using Weakly Supervised Methods	NII	2023-02	ICPRAM-2023	95.97, Simple-97.51, Complex-94.37	-	-
`CoT_SRN` Contextual transformer sequence-based recognition network for medical examination reports	SDNU	2022-12	Applied-Intelligence.2023	92.34	95.71	-

Team	Group	TEDS
Davar-Lab-OCR	Hikvision	96.36, Simple-97.88, Complex-94.78
VCGroup	Ping An	96.32, Simple-97.90, Complex-94.68
XM	USTC-NELSLIP	96.27, Simple-97.60, Complex-94.89
YG		96.11, Simple-97.38, Complex-94.79
DBJ		95.66, Simple-97.39, Complex-93.87
TAL	TAL	95.65, Simple-97.30, Complex-93.93
PaodingAI	Paoding	95.61, Simple-97.35, Complex-93.79
anyone		95.23, Simple-96.95, Complex-93.43
LTIAYN		94.84, Simple-97.18, Complex-92.40

FinTabNet Version 1.0.0

Supervised training only using the training set of FinTabNet Version 1.0.0 (91596 samples)
Evaluation for PubTabNet:
- High TEDS
- High TEDS-S only: insufficient

Paper	Group	Date	Publication	TEDS	TEDS-S	AP-50
`BGTR` Enhancing Table Structure Recognition via Bounding Box Guidance	SCUT	2024-12	ICPR-2024	-	98.89	-
`TableAttention` Multi-Modal Attention Based on 2D Structured Sequence for Table Recognition	CAS, UCAS, Zhongke Fanyu	2024-11	PRCV-2024	97.20, Simple-97.50, Complex-97.10	98.54, Simple-99.14, Complex-98.23	97.90
Enhancing Transformer-Based Table Structure Recognition for Long Tables	PKU	2024-11	PRCV-2024	96.82	99.04	-
`SPRINT` SPRINT: Script-agnostic Structure Recognition in Tables	IIT Bombay	2024-09	ICDAR-2024	-	98.03, Simple-98.35, Complex-97.74	-
`TFLOP` TFLOP: Table Structure Recognition Framework with Layout Pointer Mechanism	Upstage AI	2024-08	IJCAI-2024	99.45	99.56	-
`MuTabNet` Multi-Cell Decoder and Mutual Learning for Table Structure and Character Recognition	PFN	2024-04	ICDAR-2024	97.69	98.87	-
`OmniParser` OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition	Alibaba, HUST	2024-03	CVPR-2024	89.75	91.55	-
`UniTable` UniTable: Towards a Unified Framework for Table Structure Recognition via Self-Supervised Pretraining	Georgia Tech, ADP	2024-03	NeurIPS-Workshop-2024	-	98.89	-
`GridFormer` GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction	Baidu, SCUT	2023-09	ACM-MM-2023	-	98.63	-
An End-to-End Local Attention Based Model for Table Recognition	NII	2023-08	ICDAR-2023	95.74	98.85	-
`TSRFormer DQ-DETR` Robust Table Structure Recognition with Dynamic Queries Enhanced Detection Transformer	USTC, Microsoft, SJTU, Alibaba	2023-03	Pattern-Recognition.2023	-	98.40	-
`MTL-TabNet` An End-to-End Multi-Task Learning Model for Image-based Table Recognition	NII	2023-03	VISIGRAPP-2023	-	98.79, Simple-99.07, Complex-98.46	-
`VAST` Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling	Huawei, PKU	2023-03	CVPR-2023	98.21	98.63	96.20
`WSTabNet` Rethinking Image-based Table Recognition Using Weakly Supervised Methods	NII	2023-02	ICPRAM-2023	95.32, Simple-95.24, Complex-95.41	98.72, Simple-99.06, Complex-98.33	-
`TableFormer` TableFormer: Table Structure Understanding with Transformers	IBM	2022-03	CVPR-2022	-	96.80, Simple-97.50, Complex-96.00	-

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.md		data.md
methodology.md		methodology.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

curated-table-structure-recognition

Keywords

Table of Contents

Popular Benchmarks

PubTabNet Version 2: val / development (9115 samples)

PubTabNet Version 2: ICDAR-2021-SLP final evaluation (9064 samples)

FinTabNet Version 1.0.0

About

License

qyhou/curated-table-structure-recognition

Folders and files

Latest commit

History

Repository files navigation

curated-table-structure-recognition

Keywords

Table of Contents

Popular Benchmarks

PubTabNet Version 2: val / development (9115 samples)

PubTabNet Version 2: ICDAR-2021-SLP final evaluation (9064 samples)

FinTabNet Version 1.0.0

About

Topics

Resources

License

Stars

Watchers

Forks