jacobthebanana commited on
Commit
9e3f1bf
·
verified ·
1 Parent(s): 919f014

Model save

Browse files
Files changed (1) hide show
  1. README.md +2 -4
README.md CHANGED
@@ -1,6 +1,4 @@
1
  ---
2
- base_model: Qwen/Qwen2.5-0.5B
3
- datasets: simplescaling/s1K-1.1
4
  library_name: transformers
5
  model_name: deepseek_lora_48
6
  tags:
@@ -12,7 +10,7 @@ licence: license
12
 
13
  # Model Card for deepseek_lora_48
14
 
15
- This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B](https://huggingface.co/Qwen/Qwen2.5-0.5B) on the [simplescaling/s1K-1.1](https://huggingface.co/datasets/simplescaling/s1K-1.1) dataset.
16
  It has been trained using [TRL](https://github.com/huggingface/trl).
17
 
18
  ## Quick start
@@ -28,7 +26,7 @@ print(output["generated_text"])
28
 
29
  ## Training procedure
30
 
31
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/jacobthebanana/20250215-s1k-lora-sft/runs/b6cx95ys)
32
 
33
 
34
  This model was trained with SFT.
 
1
  ---
 
 
2
  library_name: transformers
3
  model_name: deepseek_lora_48
4
  tags:
 
10
 
11
  # Model Card for deepseek_lora_48
12
 
13
+ This model is a fine-tuned version of [None](https://huggingface.co/None).
14
  It has been trained using [TRL](https://github.com/huggingface/trl).
15
 
16
  ## Quick start
 
26
 
27
  ## Training procedure
28
 
29
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/jacobthebanana/20250223-s1k-lora-sft/runs/v8sae5to)
30
 
31
 
32
  This model was trained with SFT.