
hmByT5 Preliminary
community
AI & ML interests
ByT5, historic language models
hmbyt5-preliminary's activity
Post
1539
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.
👉 Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
👉 Model Hub Link: https://huggingface.co/model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with ❤️ and 🥨.
👉 Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
👉 Model Hub Link: https://huggingface.co/model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with ❤️ and 🥨.
Adding `safetensors` variant of this model
#1 opened 4 months ago
by
SFconvertbot


stefan-it
updated
14
models
over 1 year ago

hmbyt5-preliminary/byt5-small-multilingual-4g
Text2Text Generation
•
Updated
•
28
•
1

hmbyt5-preliminary/byt5-small-historic-multilingual-span20-flax
Text2Text Generation
•
Updated
•
44

hmbyt5-preliminary/flair-hipe-2022-hipe2020-de
Token Classification
•
Updated
•
4

hmbyt5-preliminary/flair-hipe-2022-topres19th-en
Token Classification
•
Updated
•
7

hmbyt5-preliminary/flair-hipe-2022-letemps-fr
Token Classification
•
Updated
•
4

hmbyt5-preliminary/flair-icdar-fr
Token Classification
•
Updated
•
8

hmbyt5-preliminary/flair-icdar-nl
Token Classification
•
Updated
•
2

hmbyt5-preliminary/flair-hipe-2022-newseye-sv
Token Classification
•
Updated
•
8

hmbyt5-preliminary/flair-hipe-2022-newseye-fi
Token Classification
•
Updated
•
6

hmbyt5-preliminary/flair-hipe-2022-newseye-fr
Token Classification
•
Updated
•
2

hmbyt5-preliminary/flair-hipe-2022-newseye-de
Token Classification
•
Updated
•
7

hmbyt5-preliminary/flair-hipe-2022-ajmc-fr
Token Classification
•
Updated
•
8

hmbyt5-preliminary/flair-hipe-2022-ajmc-en
Token Classification
•
Updated
•
7

hmbyt5-preliminary/flair-hipe-2022-ajmc-de
Token Classification
•
Updated
•
14

stefan-it
updated
2
models
over 1 year ago