rdiehlmartinez's picture
Create README.md
da52fa4
metadata
language:
  - en

BPE Tokenizer Model trained on the BabyLM dataset with a vocab size of 32768.