metadata
language:
- zh
- en
- fr
- de
- ja
- ko
- it
- fi
license: apache-2.0
tags:
- qwen3
- mlx
pipeline_tag: text-generation
base_model: OpenBuddy/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview2-QAT
library_name: mlx
WaveCut/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview2-QAT_MLX-4bit
This model WaveCut/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview2-QAT_MLX-4bit was converted to MLX format from OpenBuddy/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview2-QAT using mlx-lm version 0.25.2.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("WaveCut/OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview2-QAT_MLX-4bit")
prompt = "hello"
if tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, add_generation_prompt=True
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)