tf-tpu/roberta-base-epochs-500-no-wd
Fill-Mask
•
Updated
•
78
None defined yet.
This organization keeps track of all the artifacts that were produced as a part of our internal testing of the following example:
Training a masked language model end-to-end from scratch on TPUs
You can also check out the accompanying blog post here: https://huggingface.co/blog/tf_tpu.