Qwen
/

QwQ-32B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

feihu.hf commited on 1 day ago

Commit

7c0a8dc

·

1 Parent(s): 2ebdfe8

update readme

Files changed (2) hide show

README.md +8 -1
figures/benchmark.jpg +0 -0

README.md CHANGED Viewed

@@ -19,6 +19,11 @@ tags:
 QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
 **This repo contains the QwQ 32B model**, which has the following features:
 - Type: Causal Language Models
 - Training Stage: Pretraining & Post-training (Supervised Finetuning and Reinforcement Learning)
@@ -31,6 +36,8 @@ QwQ is the reasoning model of the Qwen series. Compared with conventional instru
 **Note:** For the best experience, please review the [usage guidelines](#usage-guidelines) before deploying QwQ models.
 For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwq-32b/), [GitHub](https://github.com/QwenLM/Qwen2.5), and [Documentation](https://qwen.readthedocs.io/en/latest/).
 ## Requirements
@@ -126,7 +133,7 @@ If you find our work helpful, feel free to give us a cite.
 ```
 @misc{qwq32b,
-    title = {Qwen2.5: A Party of Foundation Models},
     url = {https://qwenlm.github.io/blog/qwq-32b/},
     author = {Qwen Team},
     month = {March},

 QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
+<p align="center">
+  <img width="100%" src="figures/benchmark.jpg">
+</p>
 **This repo contains the QwQ 32B model**, which has the following features:
 - Type: Causal Language Models
 - Training Stage: Pretraining & Post-training (Supervised Finetuning and Reinforcement Learning)
 **Note:** For the best experience, please review the [usage guidelines](#usage-guidelines) before deploying QwQ models.
+You can try our [demo](https://huggingface.co/spaces/Qwen/QwQ-32B-Demo) or access QwQ models via [QwenChat](https://chat.qwen.ai).
 For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwq-32b/), [GitHub](https://github.com/QwenLM/Qwen2.5), and [Documentation](https://qwen.readthedocs.io/en/latest/).
 ## Requirements
 ```
 @misc{qwq32b,
+    title = {QwQ-32B: The Power of Scaling RL},
     url = {https://qwenlm.github.io/blog/qwq-32b/},
     author = {Qwen Team},
     month = {March},

figures/benchmark.jpg ADDED Viewed