bsmock
/

TATR-v1.1-All

table structure recognition

table extraction

Model card Files Files and versions Community

TATR-v1.1-All / README.md

bsmock's picture

Update README.md

30e6112 about 1 year ago

|

history blame contribute delete

1.74 kB

	---
	license: mit
	datasets:
	- bsmock/pubtables-1m
	- bsmock/fintabnet.c
	tags:
	- table structure recognition
	- table extraction
	---

	# Model Card for TATR-v1.1-All

	This repo contains the model weights for TATR (Table Transformer) v1.1, trained on the PubTables-1M and FinTabNet.c datasets, using the training details in the paper: ["Aligning benchmark datasets for table structure recognition"](https://arxiv.org/abs/2303.00716).

	These model weights are intended to be used with [the Microsoft implementation of Table Transformer (TATR)](https://github.com/microsoft/table-transformer).

	This model was trained to work best on tightly cropped table images (5 pixels or less).
	Images with significant padding included around the table were not included in the training, and so the model may not perform well on these at inference.
	Use a table detection model to detect and tightly crop the table prior to passing to this model.

	Model weights that can be loaded into the Hugging Face implementation of TATR are coming soon.

	FinTabNet.c will be officially released soon.
	Please see [our GitHub repo](https://github.com/microsoft/table-transformer) for a script to create FinTabNet.c from the [original FinTabNet dataset](https://developer.ibm.com/exchanges/data/all/fintabnet/).

	## Model Details

	### Model Description

	- Developed by: Brandon Smock and Rohith Pesala, while at Microsoft
	- License: MIT
	- Finetuned from model: DETR ResNet-18

	### Model Sources

	Please see the following for more details:

	- Repository: ["https://github.com/microsoft/table-transformer"](https://github.com/microsoft/table-transformer)
	- Paper: ["Aligning benchmark datasets for table structure recognition"](https://arxiv.org/abs/2303.00716)