Update README.md
Browse files
README.md
CHANGED
@@ -17,3 +17,4 @@ This model is part of the experiments in my master's thesis titled "Linguistic S
|
|
17 |
|
18 |
The model is trained on the BabyLM 10M dataset, with a RobertaTokenizer pretrained on the BabyLM 10M dataset with 16K tokens (https://huggingface.co/omarmomen/babylm_bpe_tokenizer_16k).
|
19 |
|
|
|
|
17 |
|
18 |
The model is trained on the BabyLM 10M dataset, with a RobertaTokenizer pretrained on the BabyLM 10M dataset with 16K tokens (https://huggingface.co/omarmomen/babylm_bpe_tokenizer_16k).
|
19 |
|
20 |
+
https://arxiv.org/abs/2403.09714
|