josejointriple commited on
Commit
02fee68
·
verified ·
1 Parent(s): 5d8bdbb

jointriple/brand_classification_2_20240626_model_1

Browse files
Files changed (3) hide show
  1. README.md +13 -11
  2. model.safetensors +1 -1
  3. training_args.bin +2 -2
README.md CHANGED
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 4.9735
21
- - Accuracy: 0.7396
22
 
23
  ## Model description
24
 
@@ -38,24 +38,26 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 2e-05
41
- - train_batch_size: 128
42
- - eval_batch_size: 128
43
  - seed: 42
44
  - gradient_accumulation_steps: 4
45
- - total_train_batch_size: 512
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
- - num_epochs: 5
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
- | 7.3089 | 1.0 | 592 | 6.6011 | 0.1957 |
55
- | 6.5028 | 2.0 | 1185 | 5.8580 | 0.5069 |
56
- | 5.9384 | 3.0 | 1778 | 5.3658 | 0.6636 |
57
- | 5.5484 | 4.0 | 2371 | 5.0733 | 0.7220 |
58
- | 5.2914 | 4.99 | 2960 | 4.9735 | 0.7396 |
 
 
59
 
60
 
61
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 5.0555
21
+ - Accuracy: 0.7284
22
 
23
  ## Model description
24
 
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 2e-05
41
+ - train_batch_size: 256
42
+ - eval_batch_size: 256
43
  - seed: 42
44
  - gradient_accumulation_steps: 4
45
+ - total_train_batch_size: 1024
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - num_epochs: 7
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
+ | 7.0753 | 1.0 | 296 | 6.9521 | 0.0573 |
55
+ | 6.4689 | 2.0 | 593 | 6.3053 | 0.2966 |
56
+ | 6.0338 | 3.0 | 889 | 5.8345 | 0.5124 |
57
+ | 5.6788 | 4.0 | 1186 | 5.4963 | 0.6125 |
58
+ | 5.4413 | 5.0 | 1482 | 5.2485 | 0.6947 |
59
+ | 5.271 | 6.0 | 1779 | 5.1036 | 0.7218 |
60
+ | 5.2718 | 6.99 | 2072 | 5.0555 | 0.7284 |
61
 
62
 
63
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2cd84ec27ed1b68fa4dac847eab3e1dc438a51b0c24f106a3807cb99853778cd
3
  size 444369032
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d98abd5f9e5ee134b10775868be7bdd923fe4d5cab6c90a4512d6a8f90f41e0b
3
  size 444369032
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9021cc0d929ee4828a226a2f43d69af6d3f6aafd4a19569edb4fc01d91b6b1c2
3
- size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:05c5a36bdb092e119bcffd9c5856a3800ec61f39b9204079b5281a2b6a55892e
3
+ size 4856