MohamedRashad
/

arabic-small-nougat

vision-encoder-decoder

image-text-to-text

Inference Endpoints

Model card Files Files and versions Community

MohamedRashad commited on Nov 8, 2024

Commit

48741d4

·

verified ·

1 Parent(s): 3ae4410

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -30,7 +30,7 @@ The model is based on the [facebook/nougat-small](https://huggingface.co/faceboo
 **Demo:** https://huggingface.co/spaces/MohamedRashad/Arabic-Nougat
 Or, use the code below to get started with the model locally.
-python
 from PIL import Image
 import torch
 from transformers import NougatProcessor, VisionEncoderDecoderModel
@@ -61,6 +61,8 @@ def predict(img_path):
     return page_sequence
 print(predict("path/to/page_image.jpg"))
 ## Bias, Risks, and Limitations
 1. **Text Hallucination:** The model may occasionally generate repeated or incorrect text due to the inherent complexities of OCR tasks.
@@ -92,7 +94,7 @@ By selecting the GPL 3.0 license, you promote the principles of open source and
 ### Citation
 If you find this model useful, please consider citing the original [facebook/nougat-small]((https://huggingface.co/facebook/nougat-small)) model and the datasets used for fine-tuning, including the [Khatt dataset](https://huggingface.co/datasets/Fakhraddin/khatt) and any details regarding the custom dataset.
-bibtex
 @misc{blecher2023nougat,
       title={Nougat: Neural Optical Understanding for Academic Documents},
       author={Lukas Blecher and Guillem Cucurull and Thomas Scialom and Robert Stojnic},
@@ -113,6 +115,7 @@ bibtex
       year={2023},
       howpublished={\url{https://huggingface.co/MohamedRashad/arabic-small-nougat}}
 }
 ### Disclaimer
 The arabic-small-nougat OCR is a tool provided "as is," and the developers make no guarantees regarding its suitability for specific tasks. Users are encouraged to thoroughly evaluate the model's output for their particular use cases and requirements.

 **Demo:** https://huggingface.co/spaces/MohamedRashad/Arabic-Nougat
 Or, use the code below to get started with the model locally.
+```python
 from PIL import Image
 import torch
 from transformers import NougatProcessor, VisionEncoderDecoderModel
     return page_sequence
 print(predict("path/to/page_image.jpg"))
+```
 ## Bias, Risks, and Limitations
 1. **Text Hallucination:** The model may occasionally generate repeated or incorrect text due to the inherent complexities of OCR tasks.
 ### Citation
 If you find this model useful, please consider citing the original [facebook/nougat-small]((https://huggingface.co/facebook/nougat-small)) model and the datasets used for fine-tuning, including the [Khatt dataset](https://huggingface.co/datasets/Fakhraddin/khatt) and any details regarding the custom dataset.
+```bibtex
 @misc{blecher2023nougat,
       title={Nougat: Neural Optical Understanding for Academic Documents},
       author={Lukas Blecher and Guillem Cucurull and Thomas Scialom and Robert Stojnic},
       year={2023},
       howpublished={\url{https://huggingface.co/MohamedRashad/arabic-small-nougat}}
 }
+```
 ### Disclaimer
 The arabic-small-nougat OCR is a tool provided "as is," and the developers make no guarantees regarding its suitability for specific tasks. Users are encouraged to thoroughly evaluate the model's output for their particular use cases and requirements.