
hmBERT 64k
non-profit
AI & ML interests
Pretraining Historical Multilingual Language Models
hmbert-64k's activity
Post
1539
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.
π Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
π Model Hub Link: https://huggingface.co/model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with β€οΈ and π₯¨.
π Link: https://github.com/stefan-it/model-garden-lms
An overview of some features:
- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS
I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!
π Model Hub Link: https://huggingface.co/model-garden-lms
If you find these resources useful, please give them a like!
Made from Bavarian Oberland with β€οΈ and π₯¨.

stefan-itΒ
updated
13
models
over 1 year ago

hmbert-64k/flair-hipe-2022-topres19th-en
Token Classification
β’
Updated
β’
2

hmbert-64k/flair-hipe-2022-newseye-sv
Token Classification
β’
Updated
β’
6

hmbert-64k/flair-hipe-2022-newseye-fr
Token Classification
β’
Updated
β’
7

hmbert-64k/flair-hipe-2022-newseye-fi
Token Classification
β’
Updated
β’
5

hmbert-64k/flair-hipe-2022-newseye-de
Token Classification
β’
Updated
β’
9

hmbert-64k/flair-hipe-2022-letemps-fr
Token Classification
β’
Updated
β’
6

hmbert-64k/flair-icdar-nl
Token Classification
β’
Updated
β’
3

hmbert-64k/flair-icdar-fr
Token Classification
β’
Updated
β’
6

hmbert-64k/flair-hipe-2022-hipe2020-fr
Token Classification
β’
Updated
β’
10

hmbert-64k/flair-hipe-2022-hipe2020-de
Token Classification
β’
Updated
β’
4

hmbert-64k/flair-hipe-2022-ajmc-fr
Token Classification
β’
Updated
β’
10

hmbert-64k/flair-hipe-2022-ajmc-en
Token Classification
β’
Updated
β’
6

hmbert-64k/flair-hipe-2022-ajmc-de
Token Classification
β’
Updated
β’
9

stefan-itΒ
updated
a
Space
over 1 year ago

stefan-itΒ
authored
2
papers
over 1 year ago

stefan-itΒ
authored
2
papers
almost 2 years ago