Pinkstack
/

SuperThoughts-CoT-14B-16k-o1-QwQ-GGUF

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pinkstack commited on 20 days ago

Commit

44825f1

·

verified ·

1 Parent(s): 7546939

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -133,8 +133,11 @@ Please check the examples we provided: https://huggingface.co/Pinkstack/SuperTho
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/QDHJhI0EVT_L9AHY_g3Br.png)
 Beats qwen/qwq at MATH & MuSR (MuSR being a reasoning benchmark)
 Evaluation:
-![eval](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/Dk-HD4wrS54r0lYlOF1Bz.png)
-Please note, the low IFEVAL results is probably due to it always reasoning, it does have issues with instruction following.
 Unlike previous models we've uploaded, this one is the best one we've published! Answers in two steps: Reasoning -> Final answer like o1 mini and other similar reasoning ai models.
 # 🧀 Which quant is right for you? (all tested!)
@@ -145,6 +148,7 @@ Unlike previous models we've uploaded, this one is the best one we've published!
 # [Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/Pinkstack__SuperThoughts-CoT-14B-16k-o1-QwQ-details)!
 Summarized results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/contents/viewer/default/train?q=Pinkstack%2FSuperThoughts-CoT-14B-16k-o1-QwQ&sort[column]=Average%20%E2%AC%86%EF%B8%8F&sort[direction]=desc)!
 |      Metric       |Value (%)|
 |-------------------|--------:|

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/QDHJhI0EVT_L9AHY_g3Br.png)
 Beats qwen/qwq at MATH & MuSR (MuSR being a reasoning benchmark)
 Evaluation:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/csbdGKzGcDVMPRqMCoH8D.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/HR9WtjBhE4h6wrq88FLAf.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/GLt4ct4yAVMvYEpoYO5o6.png)
 Unlike previous models we've uploaded, this one is the best one we've published! Answers in two steps: Reasoning -> Final answer like o1 mini and other similar reasoning ai models.
 # 🧀 Which quant is right for you? (all tested!)
 # [Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/Pinkstack__SuperThoughts-CoT-14B-16k-o1-QwQ-details)!
 Summarized results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/contents/viewer/default/train?q=Pinkstack%2FSuperThoughts-CoT-14B-16k-o1-QwQ&sort[column]=Average%20%E2%AC%86%EF%B8%8F&sort[direction]=desc)!
+Please note, the low IFEVAL results is probably due to it always reasoning, it does have issues with instruction following.
 |      Metric       |Value (%)|
 |-------------------|--------:|