Update README.md
Browse files
README.md
CHANGED
@@ -143,4 +143,27 @@ print(processor.decode(predictions[0], skip_special_tokens=True))
|
|
143 |
|
144 |
# Contribution
|
145 |
|
146 |
-
This model was originally contributed by Kenton Lee, Mandar Joshi et al. and added to the Hugging Face ecosystem by [Younes Belkada](https://huggingface.co/ybelkada).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
143 |
|
144 |
# Contribution
|
145 |
|
146 |
+
This model was originally contributed by Kenton Lee, Mandar Joshi et al. and added to the Hugging Face ecosystem by [Younes Belkada](https://huggingface.co/ybelkada).
|
147 |
+
|
148 |
+
# Citation
|
149 |
+
|
150 |
+
If you want to cite this work, please consider citing the original paper:
|
151 |
+
```
|
152 |
+
@misc{https://doi.org/10.48550/arxiv.2210.03347,
|
153 |
+
doi = {10.48550/ARXIV.2210.03347},
|
154 |
+
|
155 |
+
url = {https://arxiv.org/abs/2210.03347},
|
156 |
+
|
157 |
+
author = {Lee, Kenton and Joshi, Mandar and Turc, Iulia and Hu, Hexiang and Liu, Fangyu and Eisenschlos, Julian and Khandelwal, Urvashi and Shaw, Peter and Chang, Ming-Wei and Toutanova, Kristina},
|
158 |
+
|
159 |
+
keywords = {Computation and Language (cs.CL), Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences, FOS: Computer and information sciences},
|
160 |
+
|
161 |
+
title = {Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding},
|
162 |
+
|
163 |
+
publisher = {arXiv},
|
164 |
+
|
165 |
+
year = {2022},
|
166 |
+
|
167 |
+
copyright = {Creative Commons Attribution 4.0 International}
|
168 |
+
}
|
169 |
+
```
|