Quardo
/

TR-Qwen2-7B-003-241201

Text Generation

Model card Files Files and versions Community

TR-Qwen2-7B-003-241201 / README.md

Quardo's picture

Update README.md

22dbed8 verified about 1 month ago

|

1.13 kB

	---
	datasets:
	- Quardo/wikipedia-turkish-qa
	language:
	- tr
	base_model:
	- Qwen/Qwen2-7B-Instruct
	pipeline_tag: text-generation
	---

	# TR-Qwen2-7B-003-241201

	The third generation of TR-Qwen2, fine-tuned with the largest token count so far.

	- [7B-001](https://huggingface.co/Quardo/TR-Qwen2-7B-001-241129): Fine-tuned on 12 million tokens.
	- [1.5B-002](https://huggingface.co/Quardo/TR-Qwen2-1.5B-002-241130): Fine-tuned on 2 million tokens.
	- 7B-003 (this version): Fine-tuned on 103 million tokens, or effectively 25.75 million tokens across 4 epochs (compared to just 1 epoch for the earlier versions).

	The model has been fine-tuned with this system prompt only: `Yardımsever bir yapay zeka asistanısın.`
	Still the fine-tuning service is the same: [Together.ai](https://together.ai)

	---

	Current Status: Importing the model from [Together.ai](https://together.ai)

	---

	Note: This version may perform poorly with mathematical tasks, as its fine-tuning is based on Wikipedia-based data. Expect stronger cultural understanding instead (unlike 002) with actual capabilities to talk normally (unlike 001).