Quardo
/

TR-Qwen2-7B-003-241201

Text Generation

Model card Files Files and versions Community

Quardo commited on Nov 30, 2024

Commit

c871bab

·

verified ·

1 Parent(s): 60de1ca

Create README.md

Files changed (1) hide show

README.md +20 -0

README.md ADDED Viewed

	@@ -0,0 +1,20 @@

+---
+datasets:
+- Quardo/wikipedia-turkish-qa
+language:
+- tr
+base_model:
+- Qwen/Qwen2-7B-Instruct
+pipeline_tag: text-generation
+---
+# TR-Qwen2-7B-003-241130
+The third generation of TR-Qwen2, fine-tuned with the largest token count so far.
+- **001**: Fine-tuned on 12 million tokens.
+- **002**: Fine-tuned on 2 million tokens.
+- **003** (this version): Fine-tuned on 103 million tokens, or effectively 25.75 million tokens across 4 epochs (compared to just 1 epoch for the earlier versions).
+**Current Status:** Fine-tuning in progress on [Together.ai](https://together.ai).
+**Note:** This version may perform poorly with mathematical tasks, as its fine-tuning is based on Wikipedia-based data. Expect stronger cultural understanding instead (unlike 002) with actual capabilities to talk normally (unlike 001).