bala3040/paper_eval_gpt

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,12 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 0.4806
-- eval_runtime: 106.6233
-- eval_samples_per_second: 0.816
-- eval_steps_per_second: 0.816
-- epoch: 8.0
-- step: 160
 ## Model description
@@ -52,6 +47,22 @@ The following hyperparameters were used during training:
 - num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Framework versions
 - PEFT 0.10.0

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5294
 ## Model description
 - num_epochs: 10
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 2.1594        | 1.0   | 5    | 1.8573          |
+| 1.6557        | 2.0   | 10   | 1.3995          |
+| 1.2039        | 3.0   | 15   | 1.0328          |
+| 0.8274        | 4.0   | 20   | 0.7538          |
+| 0.5816        | 5.0   | 25   | 0.6224          |
+| 0.4699        | 6.0   | 30   | 0.5730          |
+| 0.408         | 7.0   | 35   | 0.5498          |
+| 0.3595        | 8.0   | 40   | 0.5377          |
+| 0.3235        | 9.0   | 45   | 0.5319          |
+| 0.3042        | 10.0  | 50   | 0.5294          |
 ### Framework versions
 - PEFT 0.10.0

adapter_config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "alpha_pattern": {},
   "auto_mapping": null,
-  "base_model_name_or_path": null,
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

 {
   "alpha_pattern": {},
   "auto_mapping": null,
+  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8ab40b610fa9e12d59b2762e2595968e1aab8c86b73c7ade7e72ef2c35844b77
-size 8398144

 version https://git-lfs.github.com/spec/v1
+oid sha256:10a9947d86f2c6a34bf29d90249f8ff19f420e723f742704413e3231bedf051e
+size 8397056

runs/Apr08_09-16-59_91c30f7cc014/events.out.tfevents.1712567819.91c30f7cc014.698.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c60a43f13d5f34857cdf1f451de6585f0cbbd56c036ef466d2292876c599426f
+size 10289

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1c5ee3f4a19e085531a13eb386222dc266a1637c9c536efe153641fb217c03c1
 size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:4cf8d48594cbcef15c2a218f4b26be8b7e8f28dcacde6925f2c94fbc3d05d247
 size 4856