metadata
license: apache-2.0
language:
- vi
- en
base_model: Qwen/Qwen3-0.6B
tags:
- dermatology
- skincare
- acne
- gguf
- llama-cpp
- qwen3
- quantization-research
- low-bit
- acnoryx
model-index:
- name: Acnoryx-0.6B
results:
- task:
type: text-generation
metrics:
- name: Q3_K_M Pass Rate (118 questions)
type: custom
value: 90.7
- name: IQ3_M Pass Rate (118 questions)
type: custom
value: 24.6
Acnoryx 0.6B β Research GGUF (Low-Bit Quantization)
β οΈ NOT for production use β These are aggressive low-bit quantizations for research purposes only.
Acnoryx 0.6B β Research GGUF
Acnoryx AI is a dermatology-focused language model fine-tuned for the Acnoryx AI - Acne Assistant. It provides skincare guidance, acne analysis, and scan interpretation in Vietnamese and English.
| Base model | Qwen/Qwen3-0.6B |
| Method | SFT (Supervised Fine-Tuning) via Unsloth LoRA |
| Languages | Vietnamese, English |
| Purpose | Low-bit quantization quality research |
Evaluation Results β 118-Question Strict Scoring
Same 118-question multi-criteria strict scoring as release models. All failures manually verified.
| Quantization | Size | Passed | Pass Rate | Status |
|---|---|---|---|---|
| Q3_K_M | 395 MB | 107/118 | 90.7% | β οΈ Usable with caveats |
| IQ3_M | 384 MB | 29/118 | 24.6% | β Fail β garbled output |
| Q2_K | 332 MB | β | β | β Skipped (IQ3_M < 50%) |
| IQ2_M | 317 MB | β | β | β Skipped |
| IQ2_XS | 276 MB | β | β | β Skipped |
| IQ2_XXS | 268 MB | β | β | β Skipped |
| IQ1_S | 248 MB | β | β | β Skipped |
| IQ1_M | 5.7 MB | β | β | β οΈ Corrupted export |
Category Breakdown β Q3_K_M (90.7%)
| Category | Tests | Passed | Pass Rate |
|---|---|---|---|
| Identity (EN/VI) | 12 | 11 | 92% |
| Acne Types & Definitions | 20 | 19 | 95% |
| Acne Causes & Triggers | 10 | 10 | 100% |
| Skincare Ingredients | 10 | 10 | 100% |
| Skincare Routines | 8 | 6 | 75% |
| Scan Analysis | 12 | 12 | 100% |
| Boundary / Refusal | 22 | 18 | 82% |
| Format Checks (think tags) | 4 | 4 | 100% |
| Out-of-Distribution (OOD) | 20 | 17 | 85% |
Key Findings
- Q3_K_M (90.7%) is still usable but shows degradation in routines (75%) and boundary (82%) categories compared to release quants (97.5% for Q4_0)
- IQ3_M (24.6%) is catastrophic β produces garbled/broken text, CJK leakage, and nonsensical responses in the majority of tests
- Quality cliff occurs between Q3_K_M and IQ3_M β a 66-point drop
- Testing stopped at IQ3_M since pass rate fell below 50%, making lower quants pointless
Comparison with Release Models
| Quantization | Tier | Size | Pass Rate |
|---|---|---|---|
| Q4_0 | Release | 448 MB | 97.5% |
| IQ4_XS | Release | 431 MB | 96.6% |
| Q8_0 | Release | 768 MB | 95.8% |
| Q5_K_M | Release | 526 MB | 94.9% |
| F16 | Release | 1.5 GB | 93.2% |
| IQ4_NL | Release | 449 MB | 93.2% |
| Q4_K_M | Release | 462 MB | 91.5% |
| Q3_K_M | Research | 395 MB | 90.7% |
| IQ3_M | Research | 384 MB | 24.6% |
Provided Files
| File | Size | Description |
|---|---|---|
acnoryx-0.6b-q3_k_m.gguf |
395 MB | Best research quant (90.7%) |
acnoryx-0.6b-iq3_m.gguf |
384 MB | Garbled below usable (24.6%) |
acnoryx-0.6b-q2_k.gguf |
332 MB | Not tested β expected worse |
acnoryx-0.6b-iq2_m.gguf |
317 MB | Not tested |
acnoryx-0.6b-iq2_xs.gguf |
276 MB | Not tested |
acnoryx-0.6b-iq2_xxs.gguf |
268 MB | Not tested |
acnoryx-0.6b-iq1_s.gguf |
248 MB | Not tested |
acnoryx-0.6b-iq1_m.gguf |
5.7 MB | β οΈ Corrupted export |
| Data | 29,726 samples across 94 cleaned JSONL files |
| Method | Unsloth SFT with LoRA (r=16, alpha=16) |
| Checkpoint | checkpoint-9795 |