End of training

Files changed (2) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7674
 ## Model description
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results

 This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7337
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results

runs/Jun09_03-51-53_1fe304df3563/events.out.tfevents.1717914956.1fe304df3563.139.4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6a24b052b8f229969f16b0e08f2127ec771b3ceb7eb567d12b9f3bc0ec7b3bce
+size 359