Step Recap 24B Instruct

This model is created with GRPO (QLoRA) from Mistral Small 24B Instruct to follow the thinking pattern of step, recap, answer.

Example: Which is bigger? 9.9 or 9.11?

<step>
To determine which number is bigger between 9.11 and 9.9, we can compare them directly.

</step>

<recap>
I've identified the numbers to compare: 9.11 and 9.9. The next step is to compare these numbers to determine which is larger.

</recap>

<step>
Comparing the two numbers:
- 9.11 has a tenths digit of 1 and a hundredths digit of 1.
- 9.9 has a tenths digit of 9 and no hundredths digit (which is equivalent to 0 hundredths).

Since 9 (tenths digit of 9.9) is greater than 1 (tenths digit of 9.11), 9.9 is larger than 9.11.

</step>

<recap>
I have compared the numbers 9.11 and 9.9 and determined that 9.9 is the larger number. There are no further steps needed to solve this problem.

</recap>

<answer>9.9</answer>

Downloads last month
8
Safetensors
Model size
23.6B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for qingy2024/Step-Recap-24B-Instruct

Quantizations
1 model

Dataset used to train qingy2024/Step-Recap-24B-Instruct