File size: 4,464 Bytes
239825a
 
 
 
 
 
 
 
 
54544eb
239825a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
---
library_name: transformers
license: apache-2.0
datasets:
- Abhaykoul/Dhanishtha-R1
- open-thoughts/OpenThoughts-114k
language:
- en
base_model:
- deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
---
# Dhanishtha Overview

Dhanishtha is a cutting-edge reasoning AI model developed by **HelpingAI**, designed for deep introspection and structured logical analysis. Unlike traditional models that generate immediate responses, Dhanishtha employs a unique **deep-thinking process** process—an internal deliberation phase that enhances reasoning depth before presenting refined answers.

## Model Capabilities
Dhanishtha operates in **Dhanishtha Mode**, inspired by the **Dhanishtha Nakshatra**, known for wisdom, rhythm, and intellectual depth. The model engages in a multi-step thought process before providing responses, ensuring high accuracy and coherence.

### Key Features:
- **Structured Internal Reasoning:** Engages in self-dialogue within `<think></think>` tags, iterating through ideas and refining its thought process before responding.
- **Progressive Thought Refinement:** Evaluates multiple perspectives, making logical connections and ensuring a well-rounded answer.
- **Emotionally Intelligent Conversational Style:** Responses are expressive, engaging, and tailored for natural human interaction.
- **Optimized for Critical Thinking & Problem-Solving:** Excels in analytical reasoning, debate, and deep philosophical discussions.
- **Context Awareness:** Maintains logical coherence in extended interactions, avoiding contradictions and ensuring smooth thought progression.

## Training & Architecture
- **Model Size:** Optimized for high-performance reasoning with balanced efficiency.
- **Training Approach:** Fine-tuned using advanced structured learning techniques to enhance deliberative thinking and introspective processing.
- **Data Sources:** Trained on a diverse dataset covering philosophy, critical reasoning, and problem-solving scenarios to develop a deep intellectual foundation.

## Performance & Benchmarks
Dhanishtha outperforms conventional models in structured reasoning and contextual depth. The model has been rigorously evaluated across various metrics, demonstrating significant improvements in:
- **Logical Coherence & Argumentation:** Enhanced ability to follow complex discussions and construct persuasive arguments.
- **Depth of Analysis:** Excels in breaking down intricate topics into clear, structured responses.
- **Adaptive Conversational Flow:** Seamlessly shifts between casual and analytical tones based on user input.

## Deployment & Use Cases
Dhanishtha is designed for:
- **High-precision academic and philosophical discussions**
- **Deep problem-solving and strategic reasoning**
- **Engaging and thought-provoking conversations**
- **Use in AI-driven research and advanced dialogue systems**

## Benchmarks
We report Pass@1 accuracy averaged over 16 samples for each problem.

| Model                         | AIME 2024 | MATH 500 | AMC 2023 | Minerva Math | OlympiadBench | Avg. |
|-------------------------------|-----------|----------|----------|--------------|----------------|------|
| 2.5-7B-Instruct               | 13.3      | 79.8     | 50.6     | 34.6         | 40.7           | 43.8 |
| rStar-Math-7B                 | 26.7      | 78.4     | 47.5     | -            | 47.1           | -    |
| Eurus-2.7B-PRIME              | 26.7      | 79.2     | 57.8     | 38.6         | 42.1           | 48.9 |
| Qwen2.5-7B-SimpleRL           | 26.7      | 82.4     | 62.5     | 39.7         | 43.3           | 50.9 |
| DeepSeek-R1-Distill-Qwen-1.5B | 28.8      | 82.8     | 62.9     | 26.5         | 43.3           | 48.9 |
| Still-1.5B                    | 32.5      | 84.4     | 66.7     | 29.0         | 45.4           | 51.6 |
| DeepScaleR-1.5B-Preview       | 43.1      | 87.8     | 73.6     | 30.2         | 50.0           | 57.0 |
| O1-Preview                    | 40.0      | 81.4     | -        | -            | -              | -    |
| **Dhanishta**                    | 38.2      | 85.1     | 70.3     | 30.5         | 42.0           | 53.2 |
| **Dhanishta-Large**                    | -      | -    | -     | -         | -           | - |


## Credits & License
Dhanishtha is developed and maintained by **HelpingAI**, pushing the boundaries of AI-driven introspection and structured reasoning. The model is open-source and community-driven, encouraging contributions and collaborative innovation.