How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="andreiski/dialectic-8b")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("andreiski/dialectic-8b")
model = AutoModelForMultimodalLM.from_pretrained("andreiski/dialectic-8b")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

Dialectic 8B (safetensors, bf16)

Fine-tuned Qwen3-8B for the Dialectic project. This is the full-precision (bf16) safetensors copy of the same weights served locally as GGUF at andreiski/dialectic-8b-gguf.

It exists so Apple-Silicon users can convert on-device with MLX:

mlx_lm.convert --hf-path andreiski/dialectic-8b -q --q-bits 8 --mlx-path ./dialectic-8b-mlx-8bit

Same weights as the GGUF Q8_0 / Q4_K_M files — use this for the MLX backend, the GGUF repo for the Ollama backend.

Downloads last month
14
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for andreiski/dialectic-8b

Finetuned
Qwen/Qwen3-8B
Finetuned
(1682)
this model
Quantizations
2 models