You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

c4ai-command-a-03-2025-q5-mlx

c4ai-command-a-03-2025-q5-mlx is an MLX Q5 checkpoint derived from CohereLabs/c4ai-command-a-03-2025, intended for local text generation on Apple Silicon.

Intended use

  • Local text generation and chat-style prompting on Apple Silicon
  • MLX-LM experimentation with the declared upstream model family
  • Offline or operator-controlled inference workflows

Out of scope

  • Safety-critical decisions without domain expert review
  • Claims of benchmark superiority not backed by published evaluation data
  • Non-MLX runtime guarantees; this card documents the shipped HF checkpoint, not every possible serving stack

Training and conversion metadata

Parameter Value
Repository LibraxisAI/c4ai-command-a-03-2025-q5-mlx
Base model CohereLabs/c4ai-command-a-03-2025
Task text-generation
Library mlx
Format MLX / Apple Silicon checkpoint
Quantization Q5
Architecture Cohere2ForCausalLM
Model files 15
Config model_type cohere2

This card only reports metadata present in the Hugging Face repository, existing card frontmatter, or public config files. Missing benchmark, dataset, or training-run details are left explicit rather than reconstructed.

Tested inference path

Inference for this checkpoint has been tested with LibraxisAI/mlx-batch-server.
This is the recommended tested path for operator-controlled local inference on Apple Silicon.

Aspect Status
Tested runtime LibraxisAI/mlx-batch-server
Target hardware Apple Silicon
Inference mode Local / self-hosted
Hugging Face Hosted Inference Disabled for this repository (inference: false)

This does not claim compatibility with every possible serving stack. It documents the path that has been exercised for this published checkpoint.

Usage

CLI

pip install mlx-lm

mlx_lm.generate \
  --model LibraxisAI/c4ai-command-a-03-2025-q5-mlx \
  --prompt "Summarize the key signals in this document and list the next action items." \
  --max-tokens 400

Python

from mlx_lm import load, generate

model, tokenizer = load("LibraxisAI/c4ai-command-a-03-2025-q5-mlx")

prompt = "Summarize the key signals in this document and list the next action items."
response = generate(model, tokenizer, prompt=prompt, max_tokens=400)
print(response)

Multi-turn with the chat template

This checkpoint follows the tokenizer/chat-template contract inherited from CohereLabs/c4ai-command-a-03-2025 when the template is present in the repository:

from mlx_lm import load, generate

model, tokenizer = load("LibraxisAI/c4ai-command-a-03-2025-q5-mlx")

messages = [
    {"role": "user", "content": "Summarize the key signals in this document and list the next action items."},
]
prompt = tokenizer.apply_chat_template(messages, add_generation_prompt=True, tokenize=False)
response = generate(model, tokenizer, prompt=prompt, max_tokens=400)
print(response)

Example output

No public sample output is currently declared for this checkpoint.

Quantization notes

Aspect Original/base checkpoint This checkpoint
Lineage CohereLabs/c4ai-command-a-03-2025 LibraxisAI/c4ai-command-a-03-2025-q5-mlx
Runtime target Upstream runtime format MLX on Apple Silicon
Quantization Base precision or upstream-declared format Q5
Published quality delta Not declared in public metadata Not declared in public metadata

Limitations

  • No public benchmarks for this checkpoint are declared in the model metadata.
  • No public benchmark claims are made by this card unless listed in the frontmatter.
  • Validate outputs on your own domain data before relying on this checkpoint.
  • Memory use and speed depend heavily on the exact Apple Silicon generation, unified-memory size, and prompt length.

License

cc-by-nc-4.0. Check the upstream/base model license as well when a base model is declared.

Citation

@misc{libraxisai-c4ai-command-a-03-2025-q5-mlx,
  title = {c4ai-command-a-03-2025-q5-mlx},
  author = {LibraxisAI},
  year = {2026},
  howpublished = {\url{https://huggingface.co/LibraxisAI/c4ai-command-a-03-2025-q5-mlx}},
  note = {MLX checkpoint published by LibraxisAI}
}

𝚅𝚒𝚋𝚎𝚌𝚛𝚊𝚏𝚝𝚎𝚍. with AI Agents by VetCoders (c)2024-2026 LibraxisAI

Downloads last month
5
MLX
Hardware compatibility
Log In to add your hardware

5-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LibraxisAI/c4ai-command-a-03-2025-q5-mlx

Quantized
(29)
this model