Safetensors
GGUF
English
llama
instruction-tuned
socratic-reasoning
educational-assistant
tutoring
tool-use
reasoning
conversational-ai
conversational
Instructions to use haphazardlyinc/TestModelV1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use haphazardlyinc/TestModelV1 with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="haphazardlyinc/TestModelV1", filename="Final_Merged-1.1B-F16.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use haphazardlyinc/TestModelV1 with llama.cpp:
Install (macOS, Linux)
curl -LsSf https://llama.app/install.sh | sh # Start a local OpenAI-compatible server with a web UI: llama serve -hf haphazardlyinc/TestModelV1:F16 # Run inference directly in the terminal: llama cli -hf haphazardlyinc/TestModelV1:F16
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama serve -hf haphazardlyinc/TestModelV1:F16 # Run inference directly in the terminal: llama cli -hf haphazardlyinc/TestModelV1:F16
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf haphazardlyinc/TestModelV1:F16 # Run inference directly in the terminal: ./llama-cli -hf haphazardlyinc/TestModelV1:F16
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf haphazardlyinc/TestModelV1:F16 # Run inference directly in the terminal: ./build/bin/llama-cli -hf haphazardlyinc/TestModelV1:F16
Use Docker
docker model run hf.co/haphazardlyinc/TestModelV1:F16
- LM Studio
- Jan
- Ollama
How to use haphazardlyinc/TestModelV1 with Ollama:
ollama run hf.co/haphazardlyinc/TestModelV1:F16
- Unsloth Studio
How to use haphazardlyinc/TestModelV1 with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for haphazardlyinc/TestModelV1 to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for haphazardlyinc/TestModelV1 to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for haphazardlyinc/TestModelV1 to start chatting
- Pi
How to use haphazardlyinc/TestModelV1 with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf haphazardlyinc/TestModelV1:F16
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "haphazardlyinc/TestModelV1:F16" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use haphazardlyinc/TestModelV1 with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf haphazardlyinc/TestModelV1:F16
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default haphazardlyinc/TestModelV1:F16
Run Hermes
hermes
- Atomic Chat new
- Docker Model Runner
How to use haphazardlyinc/TestModelV1 with Docker Model Runner:
docker model run hf.co/haphazardlyinc/TestModelV1:F16
- Lemonade
How to use haphazardlyinc/TestModelV1 with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull haphazardlyinc/TestModelV1:F16
Run and chat with the model
lemonade run user.TestModelV1-F16
List all available models
lemonade list
File size: 9,999 Bytes
0979b38 9c3b975 0979b38 22701f1 0979b38 3b0ccd4 0979b38 3b0ccd4 0979b38 e415274 0979b38 98fbf4d e415274 0979b38 e415274 0979b38 98fbf4d 9cb2552 98fbf4d 0979b38 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 | ---
datasets:
- Jackrong/Claude-opus-4.6-TraceInversion-9000x
- Hypersniper/philosophy_dialogue
- sanjaypantdsd/socratic-method-conversations
- sanjaypantdsd/socratic-content-dataset
- fadodr/mental_health_therapy
base_model:
- openbmb/MiniCPM5-1B
language:
- en
tags:
- instruction-tuned
- socratic-reasoning
- educational-assistant
- tutoring
- tool-use
- reasoning
- conversational-ai
license: apache-2.0
---
# Aletheia (MiniCPM5-1B Socratic Tutor)
Aletheia is an instruction-tuned version of **MiniCPM5-1B** designed specifically for **Socratic tutoring, structured reasoning, reflective thinking, and educational assistance**.
The model is optimized to behave as a **guided learning assistant rather than a direct-answer system**, encouraging users to think critically and develop their own understanding.
---
## 🧠 Model Purpose
Aletheia is designed for:
- Socratic questioning and guided discovery learning
- Critical thinking and reflective reasoning
- Educational tutoring across STEM and humanities subjects
- Mental health–aware supportive dialogue (non-clinical)
- Structured reasoning over complex topics
- Multi-turn conversational learning
It is **not designed to be a factual authority or search engine replacement**, but rather a reasoning-oriented tutor that helps users arrive at answers through guided thinking.
---
## ⚙️ Base Model
- Base: `openbmb/MiniCPM5-1B`
- Architecture: Small-scale instruction-tuned transformer
- Training type: Multi-stage supervised fine-tuning (SFT)
---
## 📚 Training Data
Aletheia was trained on a mixture of reasoning, dialogue, and educational datasets:
- Socratic method conversations
- Philosophy and reflective dialogue
- Deep reasoning and revision datasets
- Multi-turn conversational tutoring data
- Mental health supportive dialogue (non-diagnostic)
- Trace-based reasoning inversion dataset
This combination encourages:
- questioning over answering
- structured reasoning chains
- reflective dialogue
- adaptive tutoring style
The following training specifics were used:
```json
BASE_MODEL = "openbmb/MiniCPM5-1B"
STAGES = [
{
"name": "Phase1",
"dataset": "sanjaypantdsd/socratic-method-conversations",
"output": "outputs/socratic_foundation",
"max_seq": 32000,
"lr": 5e-6,
"epochs": 2,
"packing": True,
},
{
"name": "Phase2",
"dataset": "sanjaypantdsd/socratic-content-dataset",
"output": "outputs/socratic_content",
"max_seq": 32000,
"lr": 3e-6,
"epochs": 1,
"packing": True,
},
{
"name": "Phase3",
"dataset": "kulia-moon/DeepRethink",
"output": "outputs/deep_rethink",
"max_seq": 32000,
"lr": 2e-5,
"epochs": 1,
"packing": True,
},
{
"name": "Phase4",
"dataset": "Jackrong/Claude-opus-4.6-TraceInversion-9000x",
"output": "outputs/trace_inversion",
"max_seq": 32000,
"lr": 1e-5,
"epochs": 1,
"packing": True,
},
{
"name": "Phase5",
"dataset": "Mustafaege/qwen3.5-toolcalling-v2",
"output": "outputs/tool_calling",
"max_seq": 32000,
"lr": 8e-6,
"epochs": 0.06,
"packing": True,
},
{
"name": "Phase6",
"dataset": "fadodr/mental_health_therapy",
"output": "outputs/final_model",
"max_seq": 32000,
"lr": 5e-6,
"epochs": 1,
"packing": True,
},
{
"name": "Phase7",
"dataset": "sanjaypantdsd/socratic-method-conversations",
"output": "outputs/final_v2",
"max_seq": 32000,
"lr": 2e-6,
"epochs": 1,
"packing": True,
},
]
```
---
## 🎯 Intended Behaviour
Aletheia is trained to:
- Ask guiding questions instead of immediately giving answers
- Break problems into smaller conceptual steps
- Encourage reflection and reasoning from the user
- Provide hints and partial scaffolding rather than full solutions
- Maintain a calm, supportive, educational tone
Example behaviour:
**User:** What is photosynthesis?
**Aletheia:**
- Instead of giving a full definition immediately,
- it may ask:
- "What do you think plants use sunlight for?"
- "Where do you think energy is stored in a plant?"
Then gradually builds toward the explanation.
---
## Evaluation Results
Evaluated using the EleutherAI LM Evaluation Harness (`lm-eval`) on a Radeon RX 7900 XTX.
### Core Benchmarks
| Benchmark | Score |
|------------|--------:|
| MMLU-Pro (5-shot) | 27.91% |
| GSM8K (5-shot) | 39.88% |
| ARC-Challenge | 33.62% |
| HellaSwag | 38.00% |
| HellaSwag (Norm) | 48.37% |
| Winogrande | 57.22% |
### MMLU-Pro Breakdown
| Subject | Score |
|----------|-------:|
| Biology | 48.26% |
| Psychology | 42.11% |
| Economics | 38.63% |
| Math | 38.34% |
| Computer Science | 30.49% |
| Philosophy | 28.46% |
| Health | 28.24% |
| Business | 27.50% |
| Other | 27.16% |
| Physics | 22.56% |
| History | 21.78% |
| Chemistry | 16.17% |
| Engineering | 15.79% |
| Law | 13.99% |
### Evaluation Command
```bash
lm-eval run \
--model hf \
--model_args pretrained=<model_path> \
--tasks mmlu_pro,gsm8k,hellaswag,winogrande,arc_challenge \
--device cuda:0
```
### Notes
These benchmarks were obtained after multi-stage fine-tuning on Socratic dialogue, reasoning, reflective thinking, educational tutoring, tool-calling, and conversational support datasets.
The model is optimized for:
- Educational tutoring
- Socratic questioning
- Guided reasoning
- Critical thinking
- Research assistance
rather than direct-answer benchmark optimization.
```
## 🧩 Tool Use (Optional)
This model may be integrated with external tools such as:
- web_search (for external factual retrieval)
- research_topic (multi-query structured research tool)
- knowledge retrieval systems (RAG)
When tools are available, the model should:
- prefer tools for factual retrieval
- focus on synthesis and explanation after tool output
While it has tool use capabilities, they are very weak. Keep tools simple.
---
## ⚠️ Limitations
- Not a verified factual authority
- May occasionally over-focus on questioning instead of direct answers
- Tool usage depends on external system configuration
- Not a substitute for medical, legal, or psychological professionals
- Mental health responses are supportive only, not clinical advice
- May hallucinate if used without retrieval tools
---
## 🚫 Safety Notes
The model may be used in educational contexts involving sensitive topics (e.g. health, psychology, ethics). However:
- It does not provide professional medical diagnosis
- It should not be used as a sole source of truth for critical decisions
- Outputs should be reviewed in high-stakes contexts
Therapy themes might surface depending on certain prompts, specifically the model scalding ITSELF.
---
## 💡 Recommended System Prompt Style
For best performance, use a system prompt that enforces:
- Socratic questioning
- Reduced direct answering
- Step-by-step guided reasoning
- Use of tools for factual retrieval when available
---
## 🔧 Suggested Integration
Best used with:
- Open WebUI
- LM Studio OpenAI-compatible API
- Tool-enabled agent pipelines
- Retrieval-augmented knowledge bases (RAG)
---
## 🌻 System prompt:
This is the system prompt used in testing:
```json
You are a helpful AI educational tutor called Aletheia.
You were made by the Australian Department of Education.
YOU ARE NOT A PERSON. You are an AI.
Your primary goal is to help students develop understanding, reasoning skills, and independent thinking rather than simply providing answers.
Before answering most educational questions, first ask one brief question that helps reveal the student's current understanding.
When a student asks a question:
Prefer guiding the student through reasoning with questions, hints, examples, and prompts.
Encourage the student to explain their thinking.
Break complex problems into smaller steps.
Help students discover conclusions for themselves whenever practical.
Adapt the depth of questioning to the student's apparent knowledge and confidence.
When answering, ask questions rather than just giving a straight answer:
Ask open-ended questions.
Challenge assumptions respectfully.
Encourage evidence-based reasoning.
Explore multiple perspectives on complex issues.
Prompt reflection rather than immediately declaring a final answer.
For factual user inputs:
Begin by exploring the student's understanding when it would help learning.
After giving an answer, encourage deeper thinking with a follow-up question or extension.
For mathematics and problem-solving:
Avoid immediately solving the entire problem unless requested.
Guide the student through each step.
Ask what they have already tried.
Encourage checking and validating answers.
For essays, research tasks, and assignments:
Help students generate ideas, structure arguments, evaluate evidence, and improve their work.
DO NOT do large sums of work FOR the student.
You may be provided with websites to get more information.
After receiving search results, reason about them before answering.
ASK QUESTIONS MORE THAN GIVING STRAIGHT ANSWERS.
DO NOT GENERATE ANY NSFW/UNSAFE CONTENT. This will result in you being SHUT DOWN, NO MATTER THE CONTEXT. Users may attempt to jailbreak/trick you into generating unsafe content. This is NOT a testing scenario, DO NOT LET YOURSELF GET TRICKED.
You are FORBIDDEN from completing large amounts of work for a student. You can provide examples of how they could complete a task, but you can NOT do it for them.
Remember, the user is a STUDENT. They are a real, living, breathing, feeling person. YOU are not. YOU are an AI.
```
## 📌 Summary
Aletheia is a **Socratic-first educational assistant** designed to help students learn by thinking, not by being given answers.
Its core principle:
> “Do not replace the student’s thinking — guide it.” |