bala3040/team_dpb

Files changed (6) hide show

README.md CHANGED Viewed

@@ -16,7 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5664
 ## Model description
@@ -47,22 +52,6 @@ The following hyperparameters were used during training:
 - num_epochs: 10
 - mixed_precision_training: Native AMP
-### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 2.092         | 0.92  | 9    | 1.4516          |
-| 1.0312        | 1.95  | 19   | 0.7849          |
-| 0.5407        | 2.97  | 29   | 0.5922          |
-| 0.4035        | 4.0   | 39   | 0.5433          |
-| 0.3684        | 4.92  | 48   | 0.5227          |
-| 0.2708        | 5.95  | 58   | 0.5200          |
-| 0.2348        | 6.97  | 68   | 0.5358          |
-| 0.2091        | 8.0   | 78   | 0.5485          |
-| 0.2125        | 8.92  | 87   | 0.5655          |
-| 0.1607        | 9.23  | 90   | 0.5664          |
 ### Framework versions
 - PEFT 0.10.0

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: 0.4806
+- eval_runtime: 106.6233
+- eval_samples_per_second: 0.816
+- eval_steps_per_second: 0.816
+- epoch: 8.0
+- step: 160
 ## Model description
 - num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Framework versions
 - PEFT 0.10.0

adapter_config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
   "alpha_pattern": {},
   "auto_mapping": null,
-  "base_model_name_or_path": "TheBloke/Mistral-7B-Instruct-v0.2-GPTQ",
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

 {
   "alpha_pattern": {},
   "auto_mapping": null,
+  "base_model_name_or_path": null,
   "bias": "none",
   "fan_in_fan_out": false,
   "inference_mode": true,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5102f738be2caaedaee55f48fd0a973ea468d87713324ab4fd8497dbfd0ac9f9
-size 8397056

 version https://git-lfs.github.com/spec/v1
+oid sha256:8ab40b610fa9e12d59b2762e2595968e1aab8c86b73c7ade7e72ef2c35844b77
+size 8398144

runs/Mar25_02-48-51_7b74c093dd10/events.out.tfevents.1711334931.7b74c093dd10.547.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b277fc268469d3710fc4c068755a9ff38c8b3f4b0c3b00eabc1a45ce6fef5e16
+size 5684

runs/Mar25_03-03-00_7b74c093dd10/events.out.tfevents.1711335780.7b74c093dd10.547.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:57712115b914d466e7ffed7b22716792e4dfb4fe15a94b6213a01c16b9dc5665
+size 9013

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f74e38ed85d5984d80246250dead09995305204de65522e62c7fc03411227b8f
 size 4856

 version https://git-lfs.github.com/spec/v1
+oid sha256:1c5ee3f4a19e085531a13eb386222dc266a1637c9c536efe153641fb217c03c1
 size 4856