jacobthebanana
/

deepseek_lora_48

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

jacobthebanana commited on 3 days ago

Commit

9e3f1bf

·

verified ·

1 Parent(s): 919f014

Model save

Files changed (1) hide show

README.md +2 -4

README.md CHANGED Viewed

@@ -1,6 +1,4 @@
 ---
-base_model: Qwen/Qwen2.5-0.5B
-datasets: simplescaling/s1K-1.1
 library_name: transformers
 model_name: deepseek_lora_48
 tags:
@@ -12,7 +10,7 @@ licence: license
 # Model Card for deepseek_lora_48
-This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B](https://huggingface.co/Qwen/Qwen2.5-0.5B) on the [simplescaling/s1K-1.1](https://huggingface.co/datasets/simplescaling/s1K-1.1) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
@@ -28,7 +26,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/jacobthebanana/20250215-s1k-lora-sft/runs/b6cx95ys)
 This model was trained with SFT.

 ---
 library_name: transformers
 model_name: deepseek_lora_48
 tags:
 # Model Card for deepseek_lora_48
+This model is a fine-tuned version of [None](https://huggingface.co/None).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/jacobthebanana/20250223-s1k-lora-sft/runs/v8sae5to)
 This model was trained with SFT.