If you use these models, please cite the following paper: | |
@article{turc2019, | |
title={Well-Read Students Learn Better: On the Importance of Pre-training Compact Models}, | |
author={Turc, Iulia and Chang, Ming-Wei and Lee, Kenton and Toutanova, Kristina}, | |
journal={arXiv preprint arXiv:1908.08962v2 }, | |
year={2019} | |
} |