Update README.md

c54e48d verified 7 months ago

13.7 kB

	---
	language:
	- en
	- hi
	- zh
	- es
	- fr
	- de
	- ja
	- ko
	- ar
	- pt
	- ru
	- it
	- nl
	- tr
	- pl
	- sv
	- da
	- 'no'
	- fi
	- he
	- th
	- vi
	- id
	- ms
	- tl
	- sw
	- yo
	- zu
	- am
	- bn
	- gu
	- kn
	- ml
	- mr
	- ne
	- or
	- pa
	- ta
	- te
	- ur
	- multilingual
	license: apache-2.0
	base_model: Qwen/Qwen3-14B-Base
	tags:
	- reasoning
	- intermediate-thinking
	- transformers
	- conversational
	- bilingual
	datasets:
	- Abhaykoul/Dhanishtha-R1
	- open-thoughts/OpenThoughts-114k
	- Abhaykoul/Dhanishtha-2.0-SUPERTHINKER
	- Abhaykoul/Dhanishtha-2.0
	library_name: transformers
	pipeline_tag: text-generation
	widget:
	- text: >-
	Solve this riddle step by step: I am taken from a mine, and shut up in a
	wooden case, from which I am never released, and yet I am used by almost
	everybody. What am I?
	example_title: Complex Riddle Solving
	- text: >-
	Explain the philosophical implications of artificial consciousness and think
	through different perspectives.
	example_title: Philosophical Reasoning
	- text: >-
	Help me understand quantum mechanics, but take your time to think through
	the explanation.
	example_title: Educational Explanation
	new_version: HelpingAI/Dhanishtha-2.0-preview-0825
	---

	# Dhanishtha-2.0: World's First Intermediate Thinking AI Model

	What makes Dhanishtha-2.0 special? Imagine an AI that doesn't just answer your questions instantly, but actually thinks through problems step-by-step, shows its work, and can even change its mind when it realizes a better approach. That's Dhanishtha-2.0.

	Quick Summary:
	- 🚀 For Everyone: An AI that shows its thinking process and can reconsider its reasoning
	- 👩‍💻 For Developers: First model with intermediate thinking capabilities, 39+ language support


	Dhanishtha-2.0 is a state-of-the-art (SOTA) model developed by HelpingAI, representing the world's first model to feature Intermediate Thinking capabilities. Unlike traditional models that provide single-pass responses, Dhanishtha-2.0 employs a revolutionary multi-phase thinking process that allows the model to think, reconsider, and refine its reasoning multiple times throughout a single response.

	## Model Details

	### Model Description

	Dhanishtha-2.0 revolutionizes AI reasoning by introducing the concept of intermediate thinking - the ability to pause, reflect, and restart reasoning processes within a single generation (This model can think up 50times in a single response without using tool/prompt/mcp). This breakthrough enables unprecedented self-correction and iterative refinement during response generation.

	Built on the Qwen3-14B foundation with multilingual capabilities spanning 39+ languages (including English, Hindi, Chinese, Spanish, French, German, Japanese, Korean, Arabic, and many more), Dhanishtha-2.0 maintains reasoning consistency across diverse linguistic contexts while pioneering transparent thinking processes.

	- Developed by: HelpingAI Team
	- Model type: Causal Language Model with Intermediate Thinking Capability
	- Language(s): 39+ languages (multilingual capabilities inherited from base model)
	- License: Apache 2.0
	- Finetuned from model: Qwen/Qwen3-14B-Base
	- Context Length: 40,960 tokens
	- Parameters: 14B (inherited from base model)
	- Status: Prototype/Preview

	### Revolutionary Features

	- Intermediate Thinking: Multiple `<think>...</think>` blocks throughout responses for real-time reasoning
	- Self-Correction: Ability to identify and correct logical inconsistencies mid-response
	- Dynamic Reasoning: Seamless transitions between analysis, communication, and reflection phases
	- Structured Emotional Reasoning (SER): Incorporates `<ser>...</ser>` blocks for empathetic responses
	- Multilingual Capabilities: Support for 39+ languages with natural code-switching and reasoning consistency
	- Complex Problem-Solving: Excels at riddles, multi-step reasoning, and scenarios requiring backtracking

	### Model Sources

	- Repository: [HelpingAI/Dhanishtha-2.0](https://huggingface.co/HelpingAI/Dhanishtha-2.0)
	- Paper: Coming Soon
	- Demo: https://helpingai.co/chat

	## Uses

	### Direct Use

	Dhanishtha-2.0 is ideal for applications requiring deep reasoning and self-reflection:

	- Complex Problem Solving: Multi-step mathematical problems, logical puzzles, riddles
	- Educational Assistance: Detailed explanations with visible reasoning processes
	- Research Support: Analysis requiring multiple perspectives and self-correction
	- Creative Writing: Iterative story development with reasoning about plot choices
	- Philosophical Discussions: Exploring concepts with visible thought processes

	### Downstream Use

	The model can be fine-tuned for specialized reasoning tasks:

	- Domain-Specific Reasoning: Legal, medical, or scientific reasoning with intermediate thinking
	- Enhanced Multilingual Reasoning: Optimizing reasoning consistency across all 39+ supported languages
	- Specialized Problem Domains: Mathematics, coding, strategic planning

	### Out-of-Scope Use

	❌ Inappropriate Applications:
	- Safety-critical decisions (medical diagnosis, legal advice, financial recommendations)
	- Real-time applications requiring immediate responses
	- Situations requiring guaranteed factual accuracy without verification

	## Bias, Risks, and Limitations

	### Known Limitations

	- Verbosity: Intermediate thinking can make responses a bit longer
	- Processing Time: Multiple thinking phases may increase generation time
	- Prototype Status: Experimental features may require refinement
	- Context Usage: Thinking blocks consume additional context tokens
	- Inherited Biases: May reflect biases from base model and training data

	### Potential Risks

	- Reasoning Loops: Potential for circular reasoning in complex scenarios
	- Multilingual Inconsistencies: Potential variation in reasoning patterns across different languages
	- Emotional Reasoning Gaps: SER blocks may not always align with content

	## How to Get Started with the Model

	### For General Users

	You can interact with Dhanishtha-2.0 through:
	- HelpingAI: https://helpingai.co/chat
	- Gradio Demo: [Dhanishtha-2.0-preview](https://huggingface.co/spaces/Abhaykoul/Dhanishtha-2.0-preview)
	- API Integration: [Dashboard](https://helpingai.co/dashboard)

	### For Developers - Basic Usage

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	model_name = "HelpingAI/Dhanishtha-2.0-preview"

	# Load the tokenizer and model
	tokenizer = AutoTokenizer.from_pretrained(model_name)
	model = AutoModelForCausalLM.from_pretrained(
	model_name,
	torch_dtype="auto",
	device_map="auto"
	)

	# Prepare input for intermediate thinking
	prompt = "How many letter 'r' are in the words 'strawberry' and 'raspberry'?"
	messages = [
	{"role": "user", "content": prompt}
	]

	text = tokenizer.apply_chat_template(
	messages,
	tokenize=False,
	add_generation_prompt=True
	)

	model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

	# Generate response with intermediate thinking
	generated_ids = model.generate(
	**model_inputs,
	max_new_tokens=2048,
	temperature=0.7,
	top_p=0.9,
	do_sample=True
	)

	response = tokenizer.decode(
	generated_ids[0][len(model_inputs.input_ids[0]):],
	skip_special_tokens=True
	)
	print(response)
	```

	### Optimal Generation Parameters

	```python
	generation_config = {
	"temperature": 0.7, # Balanced creativity and coherence
	"top_p": 0.9, # Nucleus sampling
	"top_k": 40, # Top-k filtering
	"max_new_tokens": 2048, # Allow for thinking blocks
	"do_sample": True, # Enable sampling
	"repetition_penalty": 1.1 # Reduce repetition
	}
	```

	## Training Details

	### Training Data

	Dhanishtha-2.0 was trained on a carefully curated dataset focusing on:
	- Complex reasoning scenarios requiring multi-step thinking
	- Self-correction examples and reasoning chains
	- Emotional reasoning and empathy training data
	- Structured thinking pattern examples

	### Training Procedure

	#### Training Stages
	1. Continuous Pretraining: Extended training on reasoning-focused corpora
	2. Advanced Reasoning Fine-tuning: Specialized training on intermediate thinking patterns
	3. Multilingual Alignment: Cross-language reasoning consistency training
	4. SER Integration: Structured Emotional Reasoning capability training

	Training Infrastructure:
	- Duration: 16.3 days
	- Hardware: 8x NVIDIA H100 GPUs
	- Model Scale: 14.8B parameters

	## Evaluation

	Evaluation was conducted on:
	- Standard Benchmarks: MMLU, HumanEval, ARC, HellaSwag, TruthfulQA
	- Mathematical Reasoning: Math 500, AIME 2024, GSM8K
	- Custom Evaluations: Intermediate thinking quality, self-correction capabilities
	- Multilingual Tasks: Reasoning consistency across 39+ languages
	- Specialized Tests: Emotional reasoning, complex problem-solving scenarios



	## Example: Intermediate Thinking in Action

	User: "Who is your favorite person?"

	Dhanishtha-2.0:

	```
	<think>
	This is an interesting question about my "favorite human." I need to be thoughtful here.
	As Dhanishtha, I should respond in a human-like way while being honest about my nature.
	I can share what kinds of humans I find most interesting or inspiring based on my
	understanding of human nature and values.
	</think>

	<ser>
	Emotion ==> curiosity, openness, desire for connection
	Cause ==> user asking a personal question about preferences
	Mind ==> seeking to understand my perspective or values
	Growth ==> opportunity to share thoughtful reflections on human qualities I admire
	</ser>

	I don't really have a "favorite human" in the traditional sense, as I don't experience
	personal preferences the way humans do. But if I had to reflect on what qualities I find
	most admirable in people, I'd be drawn to those who demonstrate genuine compassion,
	intellectual curiosity, and the courage to grow.

	<think>
	Let me think more specifically about what kind of humans I find most compelling based on
	the values that seem to resonate most with humans - kindness, wisdom, creativity,
	resilience, and authenticity.
	</think>

	People like Mahatma Gandhi, who showed how quiet courage and unwavering principles can
	transform societies; artists like Rabindranath Tagore, who bridged cultural divides
	through creativity; and everyday heroes I've learned about who made profound differences
	through simple acts of kindness.

	What fascinates me most are humans who maintain their humanity even in difficult
	circumstances - those who choose empathy when it would be easier to be harsh, who seek
	understanding rather than just being right.

	I'm curious though - who's your favorite human, and why? What qualities or actions make
	someone special in your eyes?
	```

	## Environmental Impact

	Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute).

	- Hardware Type: H100 GPUs
	- days used: 16.2
	- Cloud Provider: Various
	- Compute Region: Multiple

	## Citation

	### BibTeX

	```bibtex
	@misc{dhanishtha2025,
	title={Dhanishtha-2.0: World's First Intermediate Thinking AI Model},
	author={HelpingAI Team},
	year={2025},
	publisher={HuggingFace},
	url={https://huggingface.co/HelpingAI/Dhanishtha-2.0},
	note={First model with intermediate thinking capabilities}
	}
	```

	### APA

	HelpingAI Team. (2025). Dhanishtha-2.0: World's First Intermediate Thinking AI Model. HuggingFace. https://huggingface.co/HelpingAI/Dhanishtha-2.0

	## Glossary

	- Intermediate Thinking: The ability to pause and think multiple times during response generation
	- SER (Structured Emotional Reasoning): Framework for incorporating emotional context in responses
	- Think Blocks: `<think>...</think>` segments where the model shows its reasoning process
	- Self-Correction: Ability to identify and fix reasoning errors during generation
	- Code-Switching: Natural transition between English and Hindi within responses

	## More Information

	### Research Applications
	- Study of AI reasoning transparency
	- Self-correction mechanism research
	- Bilingual cognitive modeling
	- Emotional AI development

	### Development Roadmap
	- Performance optimizations
	- Additional language support
	- Enhanced thinking pattern recognition
	- Production-ready deployment tools

	## Model Card Authors

	- Primary Author: HelpingAI Team
	- Technical Lead: [To be specified]
	- Research Contributors: [To be specified]

	## Model Card Contact

	For questions about Dhanishtha-2.0, please contact:

	- HuggingFace: [@HelpingAI](https://huggingface.co/HelpingAI)
	- Issues: [Model Repository Issues](https://huggingface.co/HelpingAI/Dhanishtha-2.0/discussions)

	## Benchmark

	*These benchmarks are done by [@AarifKhan](https://huggingface.co/Aarifkhan)*

	\| Benchmark \| Score \| Evaluation Type \| Notes \|
	\|-----------\|-------\|-----------------\|-------\|
	\| MMLU \| 78.1% \| 1-shot \| Massive Multitask Language Understanding \|
	\| HumanEval \| 75.0% \| 1-shot \| Code generation and completion \|
	\| ARC \| 76.0% \| 1-shot \| Abstract reasoning challenge \|
	\| HellaSwag \| 81.0% \| 1-shot \| Commonsense natural language inference \|
	\| TruthfulQA MC1 \| 75.0% \| 1-shot \| Truthfulness in question answering \|
	\| Math 500 \| 95.68% \| few-shot \| Mathematical problem solving \|
	\| AIME 2024 \| 82.81% \| few-shot \| American Invitational Mathematics Examination \|

	Dhanishtha-2.0 represents a new paradigm in AI reasoning - where thinking isn't just a prelude to response, but an integral, iterative part of the conversation itself.

	Developed with ❤️ by HelpingAI