End of training

Files changed (12) hide show

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ license: apache-2.0
 base_model: distilgpt2
 tags:
 - generated_from_trainer
 model-index:
 - name: distilgpt2-finetuned-github_cybersecurity_READMEs
   results: []
@@ -15,7 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5260
 ## Model description
@@ -34,21 +37,26 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3.0
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 2.4374        | 1.0   | 23960 | 2.6365          |
-| 2.3011        | 2.0   | 47920 | 2.5576          |
-| 2.2825        | 3.0   | 71880 | 2.5260          |
 ### Framework versions

 base_model: distilgpt2
 tags:
 - generated_from_trainer
+metrics:
+- accuracy
 model-index:
 - name: distilgpt2-finetuned-github_cybersecurity_READMEs
   results: []
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.4529
+- Accuracy: 0.0699
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 1000
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|
+| No log        | 0.97  | 14   | 5.6664          | 0.0648   |
+| No log        | 2.0   | 29   | 5.6299          | 0.0653   |
+| No log        | 2.97  | 43   | 5.5792          | 0.0660   |
+| No log        | 4.0   | 58   | 5.5133          | 0.0664   |
+| No log        | 4.83  | 70   | 5.4529          | 0.0699   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:34d4d98e61e3e181d52b603ee38384bc213f2314bbb10ef9128d546726147922
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:fcbd8d52304cab1c8974cd089e269f2f6a83616e576aecb272ab4ee24ff7a60e
 size 327657928

runs/Apr13_18-16-53_DESKTOP-7EBBP1S/events.out.tfevents.1713003413.DESKTOP-7EBBP1S.223104.2 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:078eb80e06da547c32e52876691c3f2a3b24c93d984c353b8950cf6e589290c5
-size 16187

 version https://git-lfs.github.com/spec/v1
+oid sha256:736a0e9ee7307790f849dc48b50ace4d7ba1db247efba46e4796c6a66dfd8599
+size 37101

runs/Apr13_18-16-53_DESKTOP-7EBBP1S/events.out.tfevents.1713004552.DESKTOP-7EBBP1S.223104.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b542c40cf2ab4de658080a01c1f43cce0e76c6ba540e736afdeb44b3725f23af
+size 411

runs/Apr13_19-14-31_DESKTOP-7EBBP1S/events.out.tfevents.1713006873.DESKTOP-7EBBP1S.225414.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:952e508f33752e442fd2c868ad7b64df7973393e0f09d1d87833d661caafd988
+size 5358

runs/Apr13_19-16-30_DESKTOP-7EBBP1S/events.out.tfevents.1713006993.DESKTOP-7EBBP1S.225414.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:53bcb31eba97ec19df2a079b37882c3cfb1ef778f513577cf806a2acfc58f429
+size 6973

runs/Apr13_19-16-30_DESKTOP-7EBBP1S/events.out.tfevents.1713007194.DESKTOP-7EBBP1S.225414.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0014b163df471e0c47a006b7f27726672a41d559eaece6ef2b338b8e8ea7b7a1
+size 405

runs/Apr13_19-27-28_DESKTOP-7EBBP1S/events.out.tfevents.1713007649.DESKTOP-7EBBP1S.226230.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:de1c55e729854e3e7b1406266b309ae52bb79a56dac370170b4fef34da630f60
+size 6973

runs/Apr13_19-27-28_DESKTOP-7EBBP1S/events.out.tfevents.1713007732.DESKTOP-7EBBP1S.226230.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:45961019cc44ed68cae21ba82904658542fcf8aff9f8945fcee7146161105854
+size 405

runs/Apr13_19-41-38_DESKTOP-7EBBP1S/events.out.tfevents.1713008498.DESKTOP-7EBBP1S.227770.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a2954de8c0567a79db494ce161be4e8c105b5e9cc070921e7f66ddca36320b22
+size 6973

runs/Apr13_19-41-38_DESKTOP-7EBBP1S/events.out.tfevents.1713008551.DESKTOP-7EBBP1S.227770.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:eedf60c40ef07f7930b1d36336324cfa6a5f5cb8661ffa92f5d332bac149a1ca
+size 405

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cb7ac9c7ac5c0a261d3a78d7d343e8188d0c6e93e7f4d53114231dce424b62a6
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:e7cdaa9837824bb2f1bd1cf858f610f017082ebba79b1529db844784ec318bbd
 size 4984