Daemontatox
/

CogitoZ

Text Generation

text-generation-inference

Chain-of-thought

Inference Endpoints

Model card Files Files and versions Community

Daemontatox commited on 24 days ago

Commit

cd6c033

·

verified ·

1 Parent(s): 116fdd9

Update README.md

Files changed (1) hide show

README.md +8 -12

README.md CHANGED Viewed

@@ -7,25 +7,27 @@ tags:
 - unsloth
 - qwen2
 - trl
 license: apache-2.0
 language:
 - en
-metrics:
-- accuracy
 new_version: Daemontatox/CogitoZ
 library_name: transformers
 ---
 ![image](./image.webp)
-# CogitoZ - Qwen2
 ## Model Overview
-CogitoZ - Qwen2 is a state-of-the-art large language model fine-tuned to excel in advanced reasoning and real-time decision-making tasks. This enhanced version was trained using [Unsloth](https://github.com/unslothai/unsloth), achieving a 2x faster training process. Leveraging Hugging Face's TRL (Transformers Reinforcement Learning) library, CogitoZ combines efficiency with exceptional reasoning performance.
 - **Developed by**: Daemontatox
 - **License**: Apache 2.0
 - **Base Model**: [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview)
-- **Finetuned from**: [Daemontatox/CogitoZ](https://huggingface.co/Daemontatox/CogitoZ)
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
@@ -95,10 +97,4 @@ The fine-tuning process utilized reasoning-specific datasets, including:
 **Safe Deployment** **->**  Not recommended for generating harmful or unethical content.
 ## Acknowledgments
-This model was developed with contributions from Daemontatox and the Unsloth team, utilizing state-of-the-art techniques in fine-tuning and optimization.
-For more information or collaboration inquiries, please contact:
-Author: Daemontatox
-GitHub: Daemontatox GitHub Profile
-Unsloth: Unsloth GitHub

 - unsloth
 - qwen2
 - trl
+- Chain-of-thought
+- Reasoning
 license: apache-2.0
 language:
 - en
 new_version: Daemontatox/CogitoZ
 library_name: transformers
+datasets:
+- PJMixers/Math-Multiturn-100K-ShareGPT
 ---
 ![image](./image.webp)
+# CogitoZ - 32B
 ## Model Overview
+CogitoZ - 32B is a state-of-the-art large language model fine-tuned to excel in advanced reasoning and real-time decision-making tasks. This enhanced version was trained using [Unsloth](https://github.com/unslothai/unsloth), achieving a 2x faster training process. Leveraging Hugging Face's TRL (Transformers Reinforcement Learning) library, CogitoZ combines efficiency with exceptional reasoning performance.
 - **Developed by**: Daemontatox
 - **License**: Apache 2.0
 - **Base Model**: [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview)
+- **Finetuned To**: [Daemontatox/CogitoZ](https://huggingface.co/Daemontatox/CogitoZ)
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 **Safe Deployment** **->**  Not recommended for generating harmful or unethical content.
 ## Acknowledgments
+This model was developed with contributions from Daemontatox and the Unsloth team, utilizing state-of-the-art techniques in fine-tuning and optimization.