NofarSachs commited on
Commit
d48e6be
1 Parent(s): 6310d5c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -13
README.md CHANGED
@@ -16,8 +16,7 @@ GemMarketing outperforms gemma-2b on specific marketing tasks. We are releasing
16
 
17
  ### Model Description
18
 
19
- GemMarketing is a powerful tool that can aid in generating high-quality marketing content and conducting research in the field of marketing.
20
- It's a great resource for anyone looking to stay ahead in the rapidly changing world of marketing.
21
 
22
  While the model is designed to encode marketing knowledge, this checkpoint is not yet adapted to deliver knowledge appropriately, safely, or within professional actionable constraints.
23
  We recommend against deploying GemMarketing in real-world practice settings.
@@ -71,17 +70,18 @@ This is an **early checkpoint** of the model that we are releasing to the commun
71
 
72
  #### Training Hyperparameters
73
 
74
- | Param | Value |
75
- |---------------|------------|
76
- | bf16 | true |
77
- | tf32 | true |
78
- | lr | 1e-4 |
79
- | optim | adamw |
80
- | epochs | 1 |
81
- | lr scheduler | constant |
82
- | warmup ratio | 0.03 |
83
- | max grad norm | 0.3 |
84
- | context len | 3072 |
 
85
 
86
 
87
 
 
16
 
17
  ### Model Description
18
 
19
+ GemMarketing is a powerful tool that can help generate high-quality marketing content and conduct research in the field of marketing. It is an excellent resource for staying ahead in the rapidly changing world of marketing.
 
20
 
21
  While the model is designed to encode marketing knowledge, this checkpoint is not yet adapted to deliver knowledge appropriately, safely, or within professional actionable constraints.
22
  We recommend against deploying GemMarketing in real-world practice settings.
 
70
 
71
  #### Training Hyperparameters
72
 
73
+ | Param | Value |
74
+ |---------------------|------------|
75
+ | bf16 | true |
76
+ | tf32 | true |
77
+ | lr | 1e-4 |
78
+ | optim | adamw |
79
+ | epochs | 1 |
80
+ | lr scheduler | constant |
81
+ | warmup ratio | 0.03 |
82
+ | max grad norm | 0.3 |
83
+ | context lengt | 3072 |
84
+ | attention | SPDA |
85
 
86
 
87