metadata

license: apache-2.0
base_model: distilgpt2
tags:
  - generated_from_trainer
model-index:
  - name: tsel_distilgpt
    results: []

tsel_distilgpt

This model is a fine-tuned version of distilgpt2 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
No log	1.0	1	5.9501
No log	2.0	2	5.8630
No log	3.0	3	5.7924
No log	4.0	4	5.7383
No log	5.0	5	5.6969
No log	6.0	6	5.6665
No log	7.0	7	5.6445
No log	8.0	8	5.6297
No log	9.0	9	5.6202
No log	10.0	10	5.6157