distilgpt2-pl / README.md

Update README.md

149d6a0 verified about 1 month ago

285 Bytes

metadata

datasets:
  - wikimedia/wikipedia
  - nthngdy/oscar-small
language:
  - pl
base_model:
  - distilbert/distilgpt2
license: apache-2.0

distilgpt2 with new tokenizer, trained from scratch with polish datasets.

Needs more training, however it's able to generate correct polish sentences.