LongTalk A Very Long Chain-of-Thought Dataset for Reasoning Model Post-Training kenhktsui/longtalk-cot-v0.1 Viewer • Updated Dec 30, 2024 • 61.2k • 128 • 13 kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged-gguf Updated Dec 30, 2024 • 71 • 1 kenhktsui/qwen2.5-7b-instruct-thinking-sft-merged Text Generation • Updated Dec 30, 2024 • 19 kenhktsui/llama3.1-8b-instruct-thinking-sft-merged-gguf Updated Dec 30, 2024 • 82 • 1
FastText Model for Pretraining Data Curation kenhktsui/llm-data-textbook-quality-fasttext-classifier-v2 Text Classification • Updated Nov 28, 2024 • 539 • 27 kenhktsui/fineweb-edu-fasttext-classifier Text Classification • Updated Jun 6, 2024 • 1.34k • 3 kenhktsui/code-natural-language-fasttext-classifier Text Classification • Updated Oct 30, 2024 • 72 • 1 kenhktsui/math-fasttext-classifier Text Classification • Updated 8 days ago • 53 • 1
kenhktsui/llm-data-textbook-quality-fasttext-classifier-v2 Text Classification • Updated Nov 28, 2024 • 539 • 27
kenhktsui/code-natural-language-fasttext-classifier Text Classification • Updated Oct 30, 2024 • 72 • 1
kenhktsui/Qwen-0.5B-GRPO-gsm8k-count-wait-cap-cross-correct Text Generation • Updated 27 days ago • 26