VERSIL91 commited on
Commit
744e0bd
·
verified ·
1 Parent(s): dd7e515

End of training

Browse files
Files changed (2) hide show
  1. README.md +5 -4
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -75,7 +75,7 @@ lora_target_modules:
75
  lr_scheduler: cosine
76
  max_memory:
77
  0: 70GiB
78
- max_steps: 5
79
  micro_batch_size: 2
80
  mlflow_experiment_name: /tmp/bb875996e902a1b4_train_data.json
81
  model_type: AutoModelForCausalLM
@@ -144,15 +144,16 @@ The following hyperparameters were used during training:
144
  - optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
145
  - lr_scheduler_type: cosine
146
  - lr_scheduler_warmup_steps: 10
147
- - training_steps: 5
148
 
149
  ### Training results
150
 
151
  | Training Loss | Epoch | Step | Validation Loss |
152
  |:-------------:|:------:|:----:|:---------------:|
153
  | 0.0 | 0.0159 | 1 | nan |
154
- | 0.0 | 0.0318 | 2 | nan |
155
- | 0.0 | 0.0636 | 4 | nan |
 
156
 
157
 
158
  ### Framework versions
 
75
  lr_scheduler: cosine
76
  max_memory:
77
  0: 70GiB
78
+ max_steps: 50
79
  micro_batch_size: 2
80
  mlflow_experiment_name: /tmp/bb875996e902a1b4_train_data.json
81
  model_type: AutoModelForCausalLM
 
144
  - optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
145
  - lr_scheduler_type: cosine
146
  - lr_scheduler_warmup_steps: 10
147
+ - training_steps: 50
148
 
149
  ### Training results
150
 
151
  | Training Loss | Epoch | Step | Validation Loss |
152
  |:-------------:|:------:|:----:|:---------------:|
153
  | 0.0 | 0.0159 | 1 | nan |
154
+ | 0.0 | 0.2066 | 13 | nan |
155
+ | 0.0 | 0.4131 | 26 | nan |
156
+ | 0.0 | 0.6197 | 39 | nan |
157
 
158
 
159
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3ed02991bfb14856da116f62902d9b2b607b6956ef4be3e6652547d461916967
3
  size 37070634
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8cda401164255b2616f72c3e7fd30cf6cdb721c14c93ccb97e5d32785a145e65
3
  size 37070634