Halley AI

company

Verified

https://halleyai.ai/

AI & ML interests

Text Generation & Chat Assistants; Model Compression & Quantization (Q4/Q6/Q8, gs32); Inference & Serving (on-prem, low-latency); RAG / Retrieval; Agents & Tool Use; Distillation / LoRA / Fine-tuning

halley-ai 's models 9

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-5bit-gs32

Text Generation • 80B • Updated Sep 19, 2025 • 23 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-6bit-gs64

Text Generation • 80B • Updated Sep 19, 2025 • 17 • 1

halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-4bit-gs64

Text Generation • 80B • Updated Sep 19, 2025 • 29 • 1

halley-ai/gpt-oss-120b-MLX-bf16

Text Generation • 117B • Updated Sep 8, 2025 • 225 • 3

halley-ai/gpt-oss-120b-MLX-8bit-gs32

Text Generation • 117B • Updated Sep 8, 2025 • 40 • 1

halley-ai/gpt-oss-120b-MLX-6bit-gs64

Text Generation • 117B • Updated Sep 8, 2025 • 75 • 1

halley-ai/gpt-oss-20b-MLX-5bit-gs32

Text Generation • 21B • Updated Sep 8, 2025 • 39 • 1

halley-ai/gpt-oss-20b-MLX-6bit-gs32

Text Generation • 21B • Updated Aug 18, 2025 • 52 • 1

halley-ai/gpt-oss-20b-MLX-4bit-gs32

Text Generation • 21B • Updated Aug 18, 2025 • 137 • 3