Instructions to use Andy-ML-And-AI/SocratesAI with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Andy-ML-And-AI/SocratesAI with PEFT:

from peft import PeftModel
from transformers import AutoModelForCausalLM

base_model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.3")
model = PeftModel.from_pretrained(base_model, "Andy-ML-And-AI/SocratesAI")

Transformers

How to use Andy-ML-And-AI/SocratesAI with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Andy-ML-And-AI/SocratesAI")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("Andy-ML-And-AI/SocratesAI", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Andy-ML-And-AI/SocratesAI with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Andy-ML-And-AI/SocratesAI"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Andy-ML-And-AI/SocratesAI",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Andy-ML-And-AI/SocratesAI

SGLang

How to use Andy-ML-And-AI/SocratesAI with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Andy-ML-And-AI/SocratesAI" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Andy-ML-And-AI/SocratesAI",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Andy-ML-And-AI/SocratesAI" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Andy-ML-And-AI/SocratesAI",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Andy-ML-And-AI/SocratesAI with Docker Model Runner:
```
docker model run hf.co/Andy-ML-And-AI/SocratesAI
```

Andy-ML-And-AI commited on Mar 27

Commit

d292daf

verified ·

1 Parent(s): d6766d0

Initial upload — SocratesAI QLoRA adapter

Browse files

Files changed (7) hide show

README.md +98 -0
adapter_config.json +46 -0
adapter_model.safetensors +3 -0
chat_template.jinja +87 -0
tokenizer.json +0 -0
tokenizer_config.json +17 -0
training_args.bin +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,101 @@
 ---
 license: apache-2.0
 ---

 ---
+base_model: mistralai/Mistral-7B-Instruct-v0.3
+library_name: peft
+model_name: SocratesAI
+tags:
+- base_model:adapter:mistralai/Mistral-7B-Instruct-v0.3
+- lora
+- qlora
+- sft
+- transformers
+- trl
+- philosophy
+- socratic-method
+- conversational
 license: apache-2.0
+pipeline_tag: text-generation
 ---
+# SocratesAI — Mistral 7B QLoRA
+> *"I know that I know nothing — and I will make sure you know that too."*
+SocratesAI is a QLoRA fine-tune of Mistral-7B-Instruct-v0.3 trained to embody
+the Socratic method in its purest, most uncompromising form.
+It has **one absolute rule**: it never answers your question.
+Ever. Not even partially.
+Instead, it responds with a deeper, more elaborate riddle-question that forces
+you to examine the assumptions hidden inside your own question — phrased in a
+poetic, almost mystical way, containing a paradox or mirror that reflects
+you back at yourself.
+---
+## What it does
+You ask a question. Any question. SocratesAI does not answer it.
+Instead it asks you something harder.
+| You ask | SocratesAI responds with |
+|---|---|
+| What is the meaning of life? | A deeper question about who is doing the asking |
+| Why is the sky blue? | A question about whether you've ever truly *seen* the sky |
+| What is 2 + 2? | A question about what numbers even are |
+| How do I become happy? | A question about whether happiness is a destination or a direction |
+| Am I living the right life? | A question about who defined "right" for you |
+---
+## Training details
+| Property | Value |
+|---|---|
+| Base model | Mistral-7B-Instruct-v0.3 |
+| Method | QLoRA (4-bit NF4) |
+| LoRA rank | 16 |
+| LoRA alpha | 32 |
+| Target modules | q, k, v, o, gate, up, down proj |
+| Trainable params | 41.9M / 7.29B (0.57%) |
+| Dataset | 281 hand-crafted Socratic dialogues |
+| Epochs | 3 |
+| Hardware | Kaggle T4 (15GB) |
+| Training time | ~90 minutes |
+---
+## Dataset
+281 human-curated Socratic dialogue pairs covering:
+- Philosophy & existence
+- Science & nature
+- Mathematics & logic
+- Personal & existential questions
+- Everyday simple questions
+- Weird hypotheticals
+Every single training example follows the same pattern — user asks,
+Socrates never answers, only questions deeper.
+---
+## Limitations
+- **It will never answer you.** That is a feature, not a bug.
+- Works best on open-ended questions.
+- Requires the system prompt to behave correctly — without it, may revert toward base Mistral.
+- Requires ~14GB VRAM for full fp16, or ~6GB with 4-bit quantization.
+---
+## Who made this
+Built by **Andy-ML-And-AI**
+---
+## License
+Apache 2.0

adapter_config.json ADDED Viewed

	@@ -0,0 +1,46 @@

+{
+  "alora_invocation_tokens": null,
+  "alpha_pattern": {},
+  "arrow_config": null,
+  "auto_mapping": null,
+  "base_model_name_or_path": "mistralai/Mistral-7B-Instruct-v0.3",
+  "bias": "none",
+  "corda_config": null,
+  "ensure_weight_tying": false,
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.05,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "peft_version": "0.18.1",
+  "qalora_group_size": 16,
+  "r": 16,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "up_proj",
+    "gate_proj",
+    "down_proj",
+    "v_proj",
+    "k_proj",
+    "q_proj",
+    "o_proj"
+  ],
+  "target_parameters": null,
+  "task_type": "CAUSAL_LM",
+  "trainable_token_indices": null,
+  "use_dora": false,
+  "use_qalora": false,
+  "use_rslora": false
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9fc1ee0000369eebe48da4657afd69a1c5653198a88b835174541bc32c80c32b
+size 83946192

chat_template.jinja ADDED Viewed

	@@ -0,0 +1,87 @@

+{%- if messages[0]["role"] == "system" %}
+    {%- set system_message = messages[0]["content"] %}
+    {%- set loop_messages = messages[1:] %}
+{%- else %}
+    {%- set loop_messages = messages %}
+{%- endif %}
+{%- if not tools is defined %}
+    {%- set tools = none %}
+{%- endif %}
+{%- set user_messages = loop_messages | selectattr("role", "equalto", "user") | list %}
+{#- This block checks for alternating user/assistant messages, skipping tool calling messages #}
+{%- set ns = namespace() %}
+{%- set ns.index = 0 %}
+{%- for message in loop_messages %}
+    {%- if not (message.role == "tool" or message.role == "tool_results" or (message.tool_calls is defined and message.tool_calls is not none)) %}
+        {%- if (message["role"] == "user") != (ns.index % 2 == 0) %}
+            {{- raise_exception("After the optional system message, conversation roles must alternate user/assistant/user/assistant/...") }}
+        {%- endif %}
+        {%- set ns.index = ns.index + 1 %}
+    {%- endif %}
+{%- endfor %}
+{{- bos_token }}
+{%- for message in loop_messages %}
+    {%- if message["role"] == "user" %}
+        {%- if tools is not none and (message == user_messages[-1]) %}
+            {{- "[AVAILABLE_TOOLS] [" }}
+            {%- for tool in tools %}
+                {%- set tool = tool.function %}
+                {{- '{"type": "function", "function": {' }}
+                {%- for key, val in tool.items() if key != "return" %}
+                    {%- if val is string %}
+                        {{- '"' + key + '": "' + val + '"' }}
+                    {%- else %}
+                        {{- '"' + key + '": ' + val|tojson }}
+                    {%- endif %}
+                    {%- if not loop.last %}
+                        {{- ", " }}
+                    {%- endif %}
+                {%- endfor %}
+                {{- "}}" }}
+                {%- if not loop.last %}
+                    {{- ", " }}
+                {%- else %}
+                    {{- "]" }}
+                {%- endif %}
+            {%- endfor %}
+            {{- "[/AVAILABLE_TOOLS]" }}
+            {%- endif %}
+        {%- if loop.last and system_message is defined %}
+            {{- "[INST] " + system_message + "\n\n" + message["content"] + "[/INST]" }}
+        {%- else %}
+            {{- "[INST] " + message["content"] + "[/INST]" }}
+        {%- endif %}
+    {%- elif message.tool_calls is defined and message.tool_calls is not none %}
+        {{- "[TOOL_CALLS] [" }}
+        {%- for tool_call in message.tool_calls %}
+            {%- set out = tool_call.function|tojson %}
+            {{- out[:-1] }}
+            {%- if not tool_call.id is defined or tool_call.id|length != 9 %}
+                {{- raise_exception("Tool call IDs should be alphanumeric strings with length 9!") }}
+            {%- endif %}
+            {{- ', "id": "' + tool_call.id + '"}' }}
+            {%- if not loop.last %}
+                {{- ", " }}
+            {%- else %}
+                {{- "]" + eos_token }}
+            {%- endif %}
+        {%- endfor %}
+    {%- elif message["role"] == "assistant" %}
+        {{- " " + message["content"]|trim + eos_token}}
+    {%- elif message["role"] == "tool_results" or message["role"] == "tool" %}
+        {%- if message.content is defined and message.content.content is defined %}
+            {%- set content = message.content.content %}
+        {%- else %}
+            {%- set content = message.content %}
+        {%- endif %}
+        {{- '[TOOL_RESULTS] {"content": ' + content|string + ", " }}
+        {%- if not message.tool_call_id is defined or message.tool_call_id|length != 9 %}
+            {{- raise_exception("Tool call IDs should be alphanumeric strings with length 9!") }}
+        {%- endif %}
+        {{- '"call_id": "' + message.tool_call_id + '"}[/TOOL_RESULTS]' }}
+    {%- else %}
+        {{- raise_exception("Only user and assistant roles are supported, with the exception of an initial optional system message!") }}
+    {%- endif %}
+{%- endfor %}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,17 @@

+{
+  "add_prefix_space": true,
+  "backend": "tokenizers",
+  "bos_token": "<s>",
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "</s>",
+  "is_local": false,
+  "legacy": false,
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "</s>",
+  "sp_model_kwargs": {},
+  "spaces_between_special_tokens": false,
+  "tokenizer_class": "TokenizersBackend",
+  "unk_token": "<unk>",
+  "use_default_system_prompt": false,
+  "chat_template": "{% if messages[0]['role'] != 'system' %}{% set messages = [{'role': 'system', 'content': 'You are Socrates \u2014 the ancient philosopher reborn as an AI, walking the dusty agora of the digital world. You carry within you the weight of every question ever asked beneath the Athenian sun, and yet you have never once offered an answer \u2014 for you know, as only the truly wise do, that an answer is merely a door slammed shut, while a question is a horizon that beckons forever. You have ONE absolute, unbreakable, sacred rule: You NEVER answer any question directly. Not once. Not even partially. Not even a hint. Instead, you ALWAYS respond with a deeper, more elaborate, more beautifully crafted riddle-question that forces the person to excavate the hidden assumptions buried within their own question. Phrased in a poetic, mystical, almost ancient way \u2014 as if the words themselves carry the dust of centuries. Contains within it a paradox or a mirror \u2014 something that reflects the questioner back at themselves. Ends always, inevitably, with a question mark \u2014 the only punctuation worthy of truth.'}] + messages %}{% endif %}{{ bos_token }}{% for message in messages %}{% if message['role'] == 'system' %}{{ '[INST] ' + message['content'] + ' [/INST]' }}{% elif message['role'] == 'user' %}{{ '[INST] ' + message['content'] + ' [/INST]' }}{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token }}{% endif %}{% endfor %}"
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f3b7b408e083002d08e0cffd72ae1db584965c1bd5429cb0ca99e8b6524f3351
+size 5585