|
--- |
|
license: mit |
|
datasets: |
|
- bsmock/pubtables-1m |
|
- bsmock/fintabnet.c |
|
tags: |
|
- table structure recognition |
|
- table extraction |
|
--- |
|
|
|
# Model Card for TATR-v1.1-All |
|
|
|
This repo contains the model weights for TATR (Table Transformer) v1.1, trained on the PubTables-1M and FinTabNet.c datasets, using the training details in the paper: ["Aligning benchmark datasets for table structure recognition"](https://arxiv.org/abs/2303.00716). |
|
|
|
These model weights are intended to be used with [the Microsoft implementation of Table Transformer (TATR)](https://github.com/microsoft/table-transformer). |
|
|
|
This model was trained to work best on tightly cropped table images (5 pixels or less). |
|
Images with significant padding included around the table were not included in the training, and so the model may not perform well on these at inference. |
|
Use a table detection model to detect and tightly crop the table prior to passing to this model. |
|
|
|
Model weights that can be loaded into the Hugging Face implementation of TATR are coming soon. |
|
|
|
FinTabNet.c will be officially released soon. |
|
Please see [our GitHub repo](https://github.com/microsoft/table-transformer) for a script to create FinTabNet.c from the [original FinTabNet dataset](https://developer.ibm.com/exchanges/data/all/fintabnet/). |
|
|
|
## Model Details |
|
|
|
### Model Description |
|
|
|
- **Developed by:** Brandon Smock and Rohith Pesala, while at Microsoft |
|
- **License:** MIT |
|
- **Finetuned from model:** DETR ResNet-18 |
|
|
|
### Model Sources |
|
|
|
Please see the following for more details: |
|
|
|
- **Repository:** ["https://github.com/microsoft/table-transformer"](https://github.com/microsoft/table-transformer) |
|
- **Paper:** ["Aligning benchmark datasets for table structure recognition"](https://arxiv.org/abs/2303.00716) |