naverpapago
/

garnet

Scene Text Removal

Model card Files Files and versions Community

garnet / README.md

Hyeonsu's picture

Update draft of model card

97d7ab8 almost 2 years ago

|

history blame contribute delete

2.14 kB

	---
	license: apache-2.0
	tags:
	- Scene Text Removal
	- Image to Image
	library_name: pytorch
	---

	### GaRNet

	This is text-removal model that introduced in the paper below and first released at [this page](https://github.com/naver/garnet). \
	[The Surprisingly Straightforward Scene Text Removal Method With Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis](https://arxiv.org/abs/2210.07489). \
	Hyeonsu Lee, Chankyu Choi \
	Naver Corp. \
	In ECCV 2022.

	### Model description

	GaRNet is a generator that create non-text image with given image and coresponding text box mask. It consists of convolution encoder and decoder. The encoder consists of residual block with attention module called Gated Attention.

	Gated Attention module has two Spatial attention branch. Each attention branch finds text stroke or its surrounding regions. The module adjusts the weight of these two domains by trainable parameters.

	The model was trained in PatchGAN manner with Region-of-Interest Generation. \
	The discriminator is consists of convolution encoder. Given an image, it determines whether each patch, which indicates text-box regions, is real or fake.
	All loss functions treat non-textbox regions as 'don't care'.

	### Intended uses & limitations

	This model can be used for areas that require the process of erasing text from an image, such as concealment private information, text editing.\
	You can use the raw model or pre-trained model.\
	Note that pre-trained model was trained in both Synthetic and SCUT_EnsText dataset. And the SCUT-EnsText dataset can only be used for non-commercial research purposes.

	### How to use

	You can use inference code in [this page](https://github.com/naver/garnet).

	### BibTeX entry and citation info

	```
	@inproceedings{lee2022surprisingly,
	title={The Surprisingly Straightforward Scene Text Removal Method with Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis},
	author={Lee, Hyeonsu and Choi, Chankyu},
	booktitle={European Conference on Computer Vision},
	pages={457--472},
	year={2022},
	organization={Springer}
	}
	```