yfliao commited on
Commit
5be35f9
·
verified ·
1 Parent(s): 8b2cf35

Training in progress, epoch 1

Browse files
Files changed (3) hide show
  1. README.md +4 -5
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -1,17 +1,16 @@
1
  ---
2
  base_model: Qwen/Qwen2.5-Math-1.5B
 
3
  library_name: transformers
4
- model_name: Qwen-2.5-1.5B-Simple-RL
5
  tags:
6
  - generated_from_trainer
7
- - trl
8
- - grpo
9
  licence: license
10
  ---
11
 
12
- # Model Card for Qwen-2.5-1.5B-Simple-RL
13
 
14
- This model is a fine-tuned version of [Qwen/Qwen2.5-Math-1.5B](https://huggingface.co/Qwen/Qwen2.5-Math-1.5B).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
 
1
  ---
2
  base_model: Qwen/Qwen2.5-Math-1.5B
3
+ datasets: DigitalLearningGmbH/MATH-lighteval
4
  library_name: transformers
 
5
  tags:
6
  - generated_from_trainer
7
+ - open-r1
 
8
  licence: license
9
  ---
10
 
11
+ # Model Card for None
12
 
13
+ This model is a fine-tuned version of [Qwen/Qwen2.5-Math-1.5B](https://huggingface.co/Qwen/Qwen2.5-Math-1.5B) on the [DigitalLearningGmbH/MATH-lighteval](https://huggingface.co/datasets/DigitalLearningGmbH/MATH-lighteval) dataset.
14
  It has been trained using [TRL](https://github.com/huggingface/trl).
15
 
16
  ## Quick start
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a552b7c22d1518545d432ceb5309a10041adbfb98eab6609ac68f06cb039f854
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a01c896242d61fe45bb1afbbff9740bb7e99465651b79fd084b3d7397172f41
3
  size 3554214752
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7aff461dd40134c7f432c4d6150d3e85c0aa6bb7bf3bf80175b7df8d9c46b7ba
3
  size 7672
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bf2c0994671a500a1bc1dd8bf35aade41f6ddaa6555273ddd8276bd553d63963
3
  size 7672