End of training

Browse files

Files changed (4) hide show

README.md +37 -17
model.safetensors +1 -1
runs/Nov09_08-51-26_6665d88e9b9d/events.out.tfevents.1699519891.6665d88e9b9d.586.0 +2 -2
runs/Nov09_08-51-26_6665d88e9b9d/events.out.tfevents.1699520956.6665d88e9b9d.586.1 +3 -0

README.md CHANGED Viewed

@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [prajjwal1/bert-small](https://huggingface.co/prajjwal1/bert-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9943
-- Accuracy: 0.5822
-- F1 Score: 0.5820
 ## Model description
@@ -38,28 +38,48 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1.0136026165598675e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Score |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
-| No log        | 1.0   | 186  | 1.1631          | 0.4636   | 0.4346   |
-| No log        | 2.0   | 372  | 1.0563          | 0.5553   | 0.5558   |
-| 1.1271        | 3.0   | 558  | 1.0238          | 0.5633   | 0.5629   |
-| 1.1271        | 4.0   | 744  | 0.9990          | 0.5795   | 0.5786   |
-| 1.1271        | 5.0   | 930  | 0.9943          | 0.5822   | 0.5820   |
-| 0.8392        | 6.0   | 1116 | 1.0389          | 0.5741   | 0.5692   |
-| 0.8392        | 7.0   | 1302 | 1.0114          | 0.5768   | 0.5759   |
-| 0.8392        | 8.0   | 1488 | 1.0277          | 0.5741   | 0.5702   |
-| 0.692         | 9.0   | 1674 | 1.0246          | 0.5822   | 0.5799   |
-| 0.692         | 10.0  | 1860 | 1.0241          | 0.5822   | 0.5797   |
 ### Framework versions

 This model is a fine-tuned version of [prajjwal1/bert-small](https://huggingface.co/prajjwal1/bert-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9941
+- Accuracy: 0.5903
+- F1 Score: 0.5865
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 64
+- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 30
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Score |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
+| No log        | 1.0   | 24   | 1.2145          | 0.4933   | 0.4701   |
+| No log        | 2.0   | 48   | 1.0960          | 0.5391   | 0.5365   |
+| No log        | 3.0   | 72   | 1.0569          | 0.5768   | 0.5791   |
+| No log        | 4.0   | 96   | 1.0052          | 0.5714   | 0.5698   |
+| No log        | 5.0   | 120  | 0.9889          | 0.5714   | 0.5702   |
+| No log        | 6.0   | 144  | 0.9932          | 0.5795   | 0.5772   |
+| No log        | 7.0   | 168  | 0.9841          | 0.5714   | 0.5680   |
+| No log        | 8.0   | 192  | 0.9941          | 0.5903   | 0.5865   |
+| No log        | 9.0   | 216  | 0.9788          | 0.5903   | 0.5891   |
+| No log        | 10.0  | 240  | 1.0105          | 0.5660   | 0.5617   |
+| No log        | 11.0  | 264  | 1.0473          | 0.5526   | 0.5464   |
+| No log        | 12.0  | 288  | 1.0272          | 0.5714   | 0.5685   |
+| No log        | 13.0  | 312  | 1.0627          | 0.5499   | 0.5492   |
+| No log        | 14.0  | 336  | 1.0428          | 0.5795   | 0.5782   |
+| No log        | 15.0  | 360  | 1.0644          | 0.5633   | 0.5625   |
+| No log        | 16.0  | 384  | 1.1463          | 0.5364   | 0.5261   |
+| No log        | 17.0  | 408  | 1.1109          | 0.5714   | 0.5689   |
+| No log        | 18.0  | 432  | 1.1260          | 0.5741   | 0.5739   |
+| No log        | 19.0  | 456  | 1.1793          | 0.5580   | 0.5533   |
+| No log        | 20.0  | 480  | 1.1968          | 0.5580   | 0.5535   |
+| 0.6103        | 21.0  | 504  | 1.1961          | 0.5741   | 0.5722   |
+| 0.6103        | 22.0  | 528  | 1.2399          | 0.5553   | 0.5504   |
+| 0.6103        | 23.0  | 552  | 1.2642          | 0.5526   | 0.5473   |
+| 0.6103        | 24.0  | 576  | 1.2530          | 0.5660   | 0.5625   |
+| 0.6103        | 25.0  | 600  | 1.2637          | 0.5714   | 0.5687   |
+| 0.6103        | 26.0  | 624  | 1.3012          | 0.5526   | 0.5468   |
+| 0.6103        | 27.0  | 648  | 1.2932          | 0.5606   | 0.5579   |
+| 0.6103        | 28.0  | 672  | 1.2888          | 0.5687   | 0.5664   |
+| 0.6103        | 29.0  | 696  | 1.3087          | 0.5660   | 0.5634   |
+| 0.6103        | 30.0  | 720  | 1.3073          | 0.5714   | 0.5687   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dc8cf347d3d9123a0e164ba3f3c1a26834fc8300b29986ed5183e7e03dead86d
 size 115071152

 version https://git-lfs.github.com/spec/v1
+oid sha256:ad734bab375087fd03c678a63cd68d97084742b3a465827f0cd63fb5dab9946c
 size 115071152

runs/Nov09_08-51-26_6665d88e9b9d/events.out.tfevents.1699519891.6665d88e9b9d.586.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:776498fb57a759312b18d8721ca884984d5fc7d107b720e501b0f7e0d9dce198
-size 15328

 version https://git-lfs.github.com/spec/v1
+oid sha256:8b3fbd76b70180d1ceb52c7ee7c68ea384727c2ba554b5ef0a74be0ad5819ea8
+size 16057

runs/Nov09_08-51-26_6665d88e9b9d/events.out.tfevents.1699520956.6665d88e9b9d.586.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:45a2eb75aa1f0b1f7a28d1b1d1f92913ed29371e1c1b3bd93a8ac439ef5ff718
+size 463