LennartKeller
/

longformer-gottbert-base-8192-aw512

Feature Extraction

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

LennartKeller commited on Feb 15, 2022

Commit

e32c6f8

·

1 Parent(s): bc04f3b

update readme

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -30,7 +30,7 @@ Longformer models enable processing long texts using a mixture of local attentio
 The [OSCAR](https://oscar-corpus.com) dataset is freely avaible corpus of filtered web texts from the Common Crawl in various languages. We used the 2017 version of the dataset.
 ## Training procedure
-The model was trained with masked language modeling for 3 epochs on a customly created 500m tokens subset of the german proportion of the oscar dataset.
 It was validated using 5% of the original subset.
 ### Training hyperparameters

 The [OSCAR](https://oscar-corpus.com) dataset is freely avaible corpus of filtered web texts from the Common Crawl in various languages. We used the 2017 version of the dataset.
 ## Training procedure
+The model was trained with masked language modeling for 3 epochs on a customly created 500 million tokens subset of the german proportion of the [OSCAR](https://oscar-corpus.com) dataset.
 It was validated using 5% of the original subset.
 ### Training hyperparameters