mehmetkeremturkcan
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,7 @@
|
|
1 |
---
|
2 |
-
base_model:
|
|
|
|
|
3 |
datasets: HuggingFaceTB/smol-smoltalk
|
4 |
library_name: transformers
|
5 |
model_name: SmollerLM-48M-Instruct-ft-sft
|
@@ -12,7 +14,9 @@ licence: license
|
|
12 |
|
13 |
# Model Card for SmollerLM-48M-Instruct-ft-sft
|
14 |
|
15 |
-
|
|
|
|
|
16 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
17 |
|
18 |
## Quick start
|
|
|
1 |
---
|
2 |
+
base_model:
|
3 |
+
- mehmetkeremturkcan/SmollerLM-63M-Instruct-Base
|
4 |
+
- HuggingFaceTB/SmolLM2-135M-Instruct
|
5 |
datasets: HuggingFaceTB/smol-smoltalk
|
6 |
library_name: transformers
|
7 |
model_name: SmollerLM-48M-Instruct-ft-sft
|
|
|
14 |
|
15 |
# Model Card for SmollerLM-48M-Instruct-ft-sft
|
16 |
|
17 |
+
A post-pruning SFT-finetuned model based on [HuggingFaceTB/SmolLM2-135M-Instruct](https://huggingface.co/mehmetkeremturkcan/HuggingFaceTB/SmolLM2-135M-Instruct).
|
18 |
+
|
19 |
+
This model is a fine-tuned version of [mehmetkeremturkcan/SmollerLM-63M-Instruct-Base](https://huggingface.co/mehmetkeremturkcan/SmollerLM-63M-Instruct-Base) (improperly labeled) on the [HuggingFaceTB/smol-smoltalk](https://huggingface.co/datasets/HuggingFaceTB/smol-smoltalk) dataset.
|
20 |
It has been trained using [TRL](https://github.com/huggingface/trl).
|
21 |
|
22 |
## Quick start
|