ypl/bart_game_clean

Browse files

Files changed (4) hide show

README.md +23 -42
config.json +1 -1
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,4 +1,6 @@
 ---
 tags:
 - generated_from_trainer
 model-index:
@@ -11,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # bart_test_p2
-This model was trained from scratch on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0194
 ## Model description
@@ -42,46 +44,25 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 0.0273        | 0.08  | 500   | 0.0224          |
-| 0.0255        | 0.16  | 1000  | 0.0215          |
-| 0.0245        | 0.24  | 1500  | 0.0213          |
-| 0.0234        | 0.32  | 2000  | 0.0211          |
-| 0.025         | 0.39  | 2500  | 0.0207          |
-| 0.0243        | 0.47  | 3000  | 0.0208          |
-| 0.0236        | 0.55  | 3500  | 0.0206          |
-| 0.0246        | 0.63  | 4000  | 0.0204          |
-| 0.0235        | 0.71  | 4500  | 0.0202          |
-| 0.0231        | 0.79  | 5000  | 0.0203          |
-| 0.0221        | 0.87  | 5500  | 0.0201          |
-| 0.0239        | 0.95  | 6000  | 0.0199          |
-| 0.0209        | 1.03  | 6500  | 0.0200          |
-| 0.0193        | 1.1   | 7000  | 0.0198          |
-| 0.0207        | 1.18  | 7500  | 0.0199          |
-| 0.0189        | 1.26  | 8000  | 0.0201          |
-| 0.0193        | 1.34  | 8500  | 0.0200          |
-| 0.0186        | 1.42  | 9000  | 0.0197          |
-| 0.0199        | 1.5   | 9500  | 0.0197          |
-| 0.0207        | 1.58  | 10000 | 0.0195          |
-| 0.0199        | 1.66  | 10500 | 0.0196          |
-| 0.0188        | 1.74  | 11000 | 0.0195          |
-| 0.0194        | 1.81  | 11500 | 0.0194          |
-| 0.0201        | 1.89  | 12000 | 0.0195          |
-| 0.0181        | 1.97  | 12500 | 0.0194          |
-| 0.0177        | 2.05  | 13000 | 0.0194          |
-| 0.0161        | 2.13  | 13500 | 0.0196          |
-| 0.0172        | 2.21  | 14000 | 0.0195          |
-| 0.0184        | 2.29  | 14500 | 0.0195          |
-| 0.0168        | 2.37  | 15000 | 0.0195          |
-| 0.0176        | 2.44  | 15500 | 0.0194          |
-| 0.0177        | 2.52  | 16000 | 0.0194          |
-| 0.0158        | 2.6   | 16500 | 0.0194          |
-| 0.0177        | 2.68  | 17000 | 0.0193          |
-| 0.0179        | 2.76  | 17500 | 0.0193          |
-| 0.0167        | 2.84  | 18000 | 0.0194          |
-| 0.0177        | 2.92  | 18500 | 0.0193          |
-| 0.0171        | 3.0   | 19000 | 0.0194          |
 ### Framework versions

 ---
+license: apache-2.0
+base_model: facebook/bart-base
 tags:
 - generated_from_trainer
 model-index:
 # bart_test_p2
+This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0306
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 1.5336        | 0.18  | 500  | 0.1004          |
+| 0.1089        | 0.35  | 1000 | 0.0755          |
+| 0.0923        | 0.53  | 1500 | 0.0637          |
+| 0.0747        | 0.7   | 2000 | 0.0554          |
+| 0.0719        | 0.88  | 2500 | 0.0498          |
+| 0.0653        | 1.05  | 3000 | 0.0470          |
+| 0.0557        | 1.23  | 3500 | 0.0425          |
+| 0.0505        | 1.4   | 4000 | 0.0403          |
+| 0.05          | 1.58  | 4500 | 0.0378          |
+| 0.0477        | 1.75  | 5000 | 0.0362          |
+| 0.0451        | 1.93  | 5500 | 0.0343          |
+| 0.0443        | 2.1   | 6000 | 0.0327          |
+| 0.0372        | 2.28  | 6500 | 0.0326          |
+| 0.0384        | 2.45  | 7000 | 0.0316          |
+| 0.0371        | 2.63  | 7500 | 0.0311          |
+| 0.0363        | 2.8   | 8000 | 0.0306          |
+| 0.0345        | 2.98  | 8500 | 0.0306          |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "./bart_test_p2/backup_checkpoint-12000",
   "activation_dropout": 0.1,
   "activation_function": "gelu",
   "add_bias_logits": false,

 {
+  "_name_or_path": "facebook/bart-base",
   "activation_dropout": 0.1,
   "activation_function": "gelu",
   "add_bias_logits": false,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bd2d4cc6d9060405892bbe3e531d4520c68bd9a3a2fbf019f9e89cb820504c2f
 size 557912620

 version https://git-lfs.github.com/spec/v1
+oid sha256:f2558d47436667ae1b8a797015805d813c8d89be3b9c41971ba494d8ff97ee0f
 size 557912620

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:da19c72832637aa9b262dfd4f56dcc1ff2faa4b4d65254eee1a7a257ba33f327
 size 4664

 version https://git-lfs.github.com/spec/v1
+oid sha256:73dae393b2d260a70f60ef338ffd292b5f213775449dfde701757ecec893be35
 size 4664