josejointriple commited on
Commit
5864528
·
verified ·
1 Parent(s): 02fee68

jointriple/brand_classification_2_20240627_model_1

Browse files
Files changed (3) hide show
  1. README.md +13 -14
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 5.0555
21
- - Accuracy: 0.7284
22
 
23
  ## Model description
24
 
@@ -37,27 +37,26 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 2e-05
41
- - train_batch_size: 256
42
- - eval_batch_size: 256
43
  - seed: 42
44
- - gradient_accumulation_steps: 4
45
- - total_train_batch_size: 1024
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
 
48
  - num_epochs: 7
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
- | 7.0753 | 1.0 | 296 | 6.9521 | 0.0573 |
55
- | 6.4689 | 2.0 | 593 | 6.3053 | 0.2966 |
56
- | 6.0338 | 3.0 | 889 | 5.8345 | 0.5124 |
57
- | 5.6788 | 4.0 | 1186 | 5.4963 | 0.6125 |
58
- | 5.4413 | 5.0 | 1482 | 5.2485 | 0.6947 |
59
- | 5.271 | 6.0 | 1779 | 5.1036 | 0.7218 |
60
- | 5.2718 | 6.99 | 2072 | 5.0555 | 0.7284 |
61
 
62
 
63
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.7750
21
+ - Accuracy: 0.9257
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 5e-05
41
+ - train_batch_size: 512
42
+ - eval_batch_size: 512
43
  - seed: 42
 
 
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - lr_scheduler_warmup_steps: 500
47
  - num_epochs: 7
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
53
+ | 6.4334 | 1.0 | 593 | 6.1799 | 0.3660 |
54
+ | 4.3096 | 2.0 | 1186 | 3.9473 | 0.8412 |
55
+ | 2.6773 | 3.0 | 1779 | 2.3788 | 0.9079 |
56
+ | 1.7389 | 4.0 | 2372 | 1.5072 | 0.9197 |
57
+ | 1.1692 | 5.0 | 2965 | 1.0537 | 0.9236 |
58
+ | 0.9072 | 6.0 | 3558 | 0.8410 | 0.9254 |
59
+ | 0.7699 | 7.0 | 4151 | 0.7750 | 0.9257 |
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d98abd5f9e5ee134b10775868be7bdd923fe4d5cab6c90a4512d6a8f90f41e0b
3
  size 444369032
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:16d7a19a8227ddd87e6e3b53f02dd08fd910ce3bc82c9c5a75589343514b08cb
3
  size 444369032
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:05c5a36bdb092e119bcffd9c5856a3800ec61f39b9204079b5281a2b6a55892e
3
  size 4856
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ead15358f313b14eaa0fa2c68ea7494eb00a2343e4786eb4d5f55e17a5c99701
3
  size 4856