Spaces:

halley-ai
/

README

No application file

sebastavar commited on Aug 17, 2025

Commit

9404600

verified ·

1 Parent(s): 7cbd15f

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -5,6 +5,7 @@ colorFrom: purple
 colorTo: pink
 sdk: gradio
 pinned: false
 ---
 # Halley AI on Hugging Face
@@ -17,12 +18,10 @@ High-quality, Apple-Silicon–optimized **MLX** builds, tools, and evals — foc
 ## 🚀 Featured models
-| Repo | Bits / GS | Footprint | Notes |
 |---|---:|---:|---|
 | **HalleyAI/gpt-oss-20b-MLX-4bit-gs32** | Q4 / 32 | ~13.1 GB | Best speed on 32 GB; near-baseline quality (+1.81% PPL vs 8-bit) |
 | **HalleyAI/gpt-oss-20b-MLX-6bit-gs32** | Q6 / 32 | ~18.4 GB | Near-Q8 fidelity (-0.51% PPL vs 8-bit) |
 | **Reference (8-bit)** | Q8 / 32 | — | Use upstream: `lmstudio-community/gpt-oss-20b-MLX-8bit` |
-> **Format:** MLX (not GGUF). For Linux/Windows or non-MLX stacks, use a GGUF build with llama.cpp.

 colorTo: pink
 sdk: gradio
 pinned: false
+sdk_version: 5.42.0
 ---
 # Halley AI on Hugging Face
 ## 🚀 Featured models
+| Repo | Bits/GS | Footprint | Notes |
 |---|---:|---:|---|
 | **HalleyAI/gpt-oss-20b-MLX-4bit-gs32** | Q4 / 32 | ~13.1 GB | Best speed on 32 GB; near-baseline quality (+1.81% PPL vs 8-bit) |
 | **HalleyAI/gpt-oss-20b-MLX-6bit-gs32** | Q6 / 32 | ~18.4 GB | Near-Q8 fidelity (-0.51% PPL vs 8-bit) |
 | **Reference (8-bit)** | Q8 / 32 | — | Use upstream: `lmstudio-community/gpt-oss-20b-MLX-8bit` |
+> **Format:** MLX (not GGUF). For Linux/Windows or non-MLX stacks, use a GGUF build with llama.cpp.