Instructions to use razielAI/Duchifat-2.3-Instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use razielAI/Duchifat-2.3-Instruct with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="razielAI/Duchifat-2.3-Instruct", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("razielAI/Duchifat-2.3-Instruct", trust_remote_code=True, dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use razielAI/Duchifat-2.3-Instruct with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "razielAI/Duchifat-2.3-Instruct"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "razielAI/Duchifat-2.3-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/razielAI/Duchifat-2.3-Instruct

SGLang

How to use razielAI/Duchifat-2.3-Instruct with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "razielAI/Duchifat-2.3-Instruct" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "razielAI/Duchifat-2.3-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "razielAI/Duchifat-2.3-Instruct" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "razielAI/Duchifat-2.3-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use razielAI/Duchifat-2.3-Instruct with Docker Model Runner:
```
docker model run hf.co/razielAI/Duchifat-2.3-Instruct
```

Duchifat-2.3-Instruct

File size: 6,615 Bytes

---
library_name: transformers
tags:
- chemistry
- biology
- finance
- legal
- music
- code
- art
- climate
- medical
- agent
- text-generation-inference
- duchifat-2
- hebrew
- AI
- conversational
- chatty
license: apache-2.0
language:
- he
- en
base_model:
- Raziel1234/Duchifat-2
pipeline_tag: text-generation
---
# 🕊️ Duchifat-2.3-Instruct: The Paradigm Shift in Hebrew AI

**Duchifat-2.3-Instruct** is a state-of-the-art, instruction-tuned Large Language Model developed by **TopAI**. As the flagship of the Duchifat series, this model represents a fundamental breakthrough in how Hebrew is processed, reasoned, and generated in the LLM era.

## 💎 The "Language-Native" Architecture

The core innovation of **Duchifat-2.3** lies in its **Language-Native Reasoning** engine. While most models suffer from a "Translation Gap"—reasoning in English and translating to Hebrew—Duchifat-2.3 was architected to bridge this divide.

### 🧠 Native Cognitive Processing
By optimizing the model's internal weights and tokenizer for Hebrew-specific structures, we have achieved a system that:
- **Internalizes Hebrew Logic:** The model's "Chain of Thought" is executed natively in Hebrew, preserving the unique semantic and syntactic nuances of the language.
- **Eliminates Syntactic Artifacts:** Unlike translated models, Duchifat-2.3 produces text that flows naturally, avoiding the stiff and robotic feel of English-to-Hebrew conversion.
- **Enhanced Token Efficiency:** The specialized architecture allows for a more dense and accurate representation of Hebrew text, leading to faster inference and better context retention.

---

## 🚀 Advanced Instruction Tuning & Alignment

Duchifat-2.3-Instruct has undergone a sophisticated Supervised Fine-Tuning (SFT) process designed to transform a raw base model into a highly capable, mission-aligned assistant.

### 🛡️ Ethical Generalization & Safety
One of the model's most impressive feats is its ability to generalize safety protocols. It doesn't just rely on a static list of blocked words; it understands the **intent and context** of human interaction.
- **Zero-Shot Moderation:** The model can identify and appropriately handle offensive content, slurs, and harmful prompts it has never encountered during training.
- **Value-Locked Alignment:** The "TopAI" safety standards are deeply embedded, ensuring the model remains helpful, harmless, and honest across all domains.

### 🤖 Multi-Domain Mastery
The model is tuned to excel in diverse environments:
- **Technical & Scientific Research:** Deep understanding of AI architecture, software development, and complex data analysis.
- **Creative & Cultural Context:** Native fluency in Israeli idioms, professional drafting, and nuanced storytelling.
- **Logical Reasoning:** High performance in solving complex puzzles and following multi-stage instructions.

---

## 🎨 The Duchifat Persona: A Digital Partner

We believe that interaction is as important as information. Duchifat-2.3-Instruct carries a unique, refined persona:
- **Quirky & Engaging:** It balances professional rigor with an approachable, brand-aligned voice.
- **Adaptive Tone:** Seamlessly shifts between formal technical documentation and casual, helpful conversation.
- **Identity-Aware:** The model "knows" who it is and remains consistent in its role as a specialized AI assistant.

---

## 🏗️ Technical Specifications

- **Developer:** TopAI
- **Architecture:** Causal Decoder-Only Transformer.
- **Primary Objective:** Hebrew-Native Instruction Following.
- **Secondary Capability:** Full English Fluency and Cross-Lingual reasoning.
- **Optimization:** Optimized for high-precision inference and minimal catastrophic forgetting.

---

## 📊 Benchmark Results

The following evaluation was performed using `lm-evaluation-harness` (0-shot) to assess the model's core reasoning and common-sense capabilities.

| Task | Metric | Value | Significance |
| :--- | :--- | :--- | :--- |
| **PIQA** | Accuracy | **53.65%** | Above Random Guessing |
| **WinoGrande** | Accuracy | **52.25%** | Above Random Guessing |
| **ARC-Easy** | Accuracy (Norm) | **27.86%** | Baseline Performance |
| **HellaSwag** | Accuracy | **25.94%** | Baseline Performance |

**Analysis:**
Duchifat-2.3-Instruct shows its strongest performance in binary-choice logic tasks (**PIQA** and **WinoGrande**), consistently outperforming random chance. While multi-choice benchmarks like ARC and HellaSwag remain at baseline levels, this is a common trade-off for models aggressively fine-tuned for conversational alignment and Hebrew-native reasoning.

## Use

```python
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

# הגדרות - טעינה מה-Hub
MODEL_ID = "razielAI/Duchifat-2.3-Instruct"
device = "cuda" if torch.cuda.is_available() else "cpu"

# טעינה
tokenizer = AutoTokenizer.from_pretrained(MODEL_ID, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    MODEL_ID,
    trust_remote_code=True,
    torch_dtype=torch.bfloat16 if torch.cuda.is_available() else torch.float32
).to(device)

def chat():
    print("✨ Duchifat-2 Online (TopAI) | Type 'exit' to quit")
    while True:
        user_input = input("\n👤 User: ")
        if user_input.lower() in ["exit", "quit", "יציאה"]: break

        # בניית הפרומפט עם הטוקנים המיוחדים
        prompt = f"<|instruction|>\n{user_input}\n<|assistant|>\n"
        inputs = tokenizer(prompt, return_tensors="pt").to(device)

        # יצירה
        with torch.no_grad():
            output_tokens = model.generate(
                **inputs,
                max_new_tokens=256,
                temperature=0.7,
                do_sample=True,
                pad_token_id=tokenizer.eos_token_id,
                eos_token_id=tokenizer.encode("<|eos|>", add_special_tokens=False)[0]
            )

        # פיענוח והצגת התשובה בלבד
        decoded = tokenizer.decode(output_tokens[0], skip_special_tokens=False)
        response = decoded.split("<|assistant|>")[-1].replace("<|eos|>", "").strip()

        print(f"🤖 Duchifat-2: {response}")

if __name__ == "__main__":
    chat()
```

## 🌍 Impact and Mission

Duchifat-2.3-Instruct is more than a model; it is a statement on the future of specialized AI. By proving that a dedicated, language-native approach can outperform general-purpose "translation" models, **TopAI** is setting a new standard for the Israeli and global tech ecosystem.

---
**Developed with technical excellence and linguistic precision by TopAI.**