Instructions to use LibraxisAI/c4ai-command-a-03-2025-q5-mlx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use LibraxisAI/c4ai-command-a-03-2025-q5-mlx with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("LibraxisAI/c4ai-command-a-03-2025-q5-mlx") prompt = "Write a story about Einstein" messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
- Pi new
How to use LibraxisAI/c4ai-command-a-03-2025-q5-mlx with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "LibraxisAI/c4ai-command-a-03-2025-q5-mlx"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "LibraxisAI/c4ai-command-a-03-2025-q5-mlx" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use LibraxisAI/c4ai-command-a-03-2025-q5-mlx with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "LibraxisAI/c4ai-command-a-03-2025-q5-mlx"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default LibraxisAI/c4ai-command-a-03-2025-q5-mlx
Run Hermes
hermes
- MLX LM
How to use LibraxisAI/c4ai-command-a-03-2025-q5-mlx with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Interactive chat REPL mlx_lm.chat --model "LibraxisAI/c4ai-command-a-03-2025-q5-mlx"
Run an OpenAI-compatible server
# Install MLX LM uv tool install mlx-lm # Start the server mlx_lm.server --model "LibraxisAI/c4ai-command-a-03-2025-q5-mlx" # Calling the OpenAI-compatible server with curl curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "LibraxisAI/c4ai-command-a-03-2025-q5-mlx", "messages": [ {"role": "user", "content": "Hello"} ] }'
c4ai-command-a-03-2025-q5-mlx
c4ai-command-a-03-2025-q5-mlx is an MLX Q5 checkpoint derived from CohereLabs/c4ai-command-a-03-2025, intended for local text generation on Apple Silicon.
Intended use
- Local text generation and chat-style prompting on Apple Silicon
- MLX-LM experimentation with the declared upstream model family
- Offline or operator-controlled inference workflows
Out of scope
- Safety-critical decisions without domain expert review
- Claims of benchmark superiority not backed by published evaluation data
- Non-MLX runtime guarantees; this card documents the shipped HF checkpoint, not every possible serving stack
Training and conversion metadata
| Parameter | Value |
|---|---|
| Repository | LibraxisAI/c4ai-command-a-03-2025-q5-mlx |
| Base model | CohereLabs/c4ai-command-a-03-2025 |
| Task | text-generation |
| Library | mlx |
| Format | MLX / Apple Silicon checkpoint |
| Quantization | Q5 |
| Architecture | Cohere2ForCausalLM |
| Model files | 15 |
| Config model_type | cohere2 |
This card only reports metadata present in the Hugging Face repository, existing card frontmatter, or public config files. Missing benchmark, dataset, or training-run details are left explicit rather than reconstructed.
Tested inference path
Inference for this checkpoint has been tested with
LibraxisAI/mlx-batch-server.
This is the recommended tested path for operator-controlled local inference on Apple Silicon.
| Aspect | Status |
|---|---|
| Tested runtime | LibraxisAI/mlx-batch-server |
| Target hardware | Apple Silicon |
| Inference mode | Local / self-hosted |
| Hugging Face Hosted Inference | Disabled for this repository (inference: false) |
This does not claim compatibility with every possible serving stack. It documents the path that has been exercised for this published checkpoint.
Usage
CLI
pip install mlx-lm
mlx_lm.generate \
--model LibraxisAI/c4ai-command-a-03-2025-q5-mlx \
--prompt "Summarize the key signals in this document and list the next action items." \
--max-tokens 400
Python
from mlx_lm import load, generate
model, tokenizer = load("LibraxisAI/c4ai-command-a-03-2025-q5-mlx")
prompt = "Summarize the key signals in this document and list the next action items."
response = generate(model, tokenizer, prompt=prompt, max_tokens=400)
print(response)
Multi-turn with the chat template
This checkpoint follows the tokenizer/chat-template contract inherited from CohereLabs/c4ai-command-a-03-2025 when the
template is present in the repository:
from mlx_lm import load, generate
model, tokenizer = load("LibraxisAI/c4ai-command-a-03-2025-q5-mlx")
messages = [
{"role": "user", "content": "Summarize the key signals in this document and list the next action items."},
]
prompt = tokenizer.apply_chat_template(messages, add_generation_prompt=True, tokenize=False)
response = generate(model, tokenizer, prompt=prompt, max_tokens=400)
print(response)
Example output
No public sample output is currently declared for this checkpoint.
Quantization notes
| Aspect | Original/base checkpoint | This checkpoint |
|---|---|---|
| Lineage | CohereLabs/c4ai-command-a-03-2025 |
LibraxisAI/c4ai-command-a-03-2025-q5-mlx |
| Runtime target | Upstream runtime format | MLX on Apple Silicon |
| Quantization | Base precision or upstream-declared format | Q5 |
| Published quality delta | Not declared in public metadata | Not declared in public metadata |
Limitations
- No public benchmarks for this checkpoint are declared in the model metadata.
- No public benchmark claims are made by this card unless listed in the frontmatter.
- Validate outputs on your own domain data before relying on this checkpoint.
- Memory use and speed depend heavily on the exact Apple Silicon generation, unified-memory size, and prompt length.
License
cc-by-nc-4.0. Check the upstream/base model license as well when a base model is declared.
Citation
@misc{libraxisai-c4ai-command-a-03-2025-q5-mlx,
title = {c4ai-command-a-03-2025-q5-mlx},
author = {LibraxisAI},
year = {2026},
howpublished = {\url{https://huggingface.co/LibraxisAI/c4ai-command-a-03-2025-q5-mlx}},
note = {MLX checkpoint published by LibraxisAI}
}
𝚅𝚒𝚋𝚎𝚌𝚛𝚊𝚏𝚝𝚎𝚍. with AI Agents by VetCoders (c)2024-2026 LibraxisAI
- Downloads last month
- 5
5-bit
Model tree for LibraxisAI/c4ai-command-a-03-2025-q5-mlx
Base model
CohereLabs/c4ai-command-a-03-2025