metadata

license: apache-2.0
base_model: distilbert/distilgpt2
tags:
  - generated_from_trainer
model-index:
  - name: DYG_DistillGPT2
    results: []

DYG_DistillGPT2

This model is a fine-tuned version of distilbert/distilgpt2 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
No log	1.0	8	0.6735
No log	2.0	16	0.6331
No log	3.0	24	0.6168
No log	4.0	32	0.5801
No log	5.0	40	0.5487
No log	6.0	48	0.5625
No log	7.0	56	0.5189
No log	8.0	64	0.4941
No log	9.0	72	0.4708
No log	10.0	80	0.4678