Training in progress, epoch 1
Browse files- README.md +4 -5
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -1,17 +1,16 @@
|
|
1 |
---
|
2 |
base_model: Qwen/Qwen2.5-Math-1.5B
|
|
|
3 |
library_name: transformers
|
4 |
-
model_name: Qwen-2.5-1.5B-Simple-RL
|
5 |
tags:
|
6 |
- generated_from_trainer
|
7 |
-
-
|
8 |
-
- grpo
|
9 |
licence: license
|
10 |
---
|
11 |
|
12 |
-
# Model Card for
|
13 |
|
14 |
-
This model is a fine-tuned version of [Qwen/Qwen2.5-Math-1.5B](https://huggingface.co/Qwen/Qwen2.5-Math-1.5B).
|
15 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
16 |
|
17 |
## Quick start
|
|
|
1 |
---
|
2 |
base_model: Qwen/Qwen2.5-Math-1.5B
|
3 |
+
datasets: DigitalLearningGmbH/MATH-lighteval
|
4 |
library_name: transformers
|
|
|
5 |
tags:
|
6 |
- generated_from_trainer
|
7 |
+
- open-r1
|
|
|
8 |
licence: license
|
9 |
---
|
10 |
|
11 |
+
# Model Card for None
|
12 |
|
13 |
+
This model is a fine-tuned version of [Qwen/Qwen2.5-Math-1.5B](https://huggingface.co/Qwen/Qwen2.5-Math-1.5B) on the [DigitalLearningGmbH/MATH-lighteval](https://huggingface.co/datasets/DigitalLearningGmbH/MATH-lighteval) dataset.
|
14 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
15 |
|
16 |
## Quick start
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 3554214752
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4a01c896242d61fe45bb1afbbff9740bb7e99465651b79fd084b3d7397172f41
|
3 |
size 3554214752
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 7672
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bf2c0994671a500a1bc1dd8bf35aade41f6ddaa6555273ddd8276bd553d63963
|
3 |
size 7672
|