--- license: apache-2.0 base_model: stepfun-ai/Step-3.5-Flash base_model_relation: quantized quantized_by: turboderp tags: - exl3 --- EXL3 quants of [Step-3.5-Flash](https://huggingface.co/stepfun-ai/Step-3.5-Flash) ⚠️ Requires ExLlamaV3 v0.0.23 (or v0.0.22 `dev` branch) Base bitrates: [2.00 bits per weight](https://huggingface.co/turboderp/Step-3.5-Flash-exl3/tree/2.00bpw) [3.00 bits per weight](https://huggingface.co/turboderp/Step-3.5-Flash-exl3/tree/3.00bpw) [4.00 bits per weight](https://huggingface.co/turboderp/Step-3.5-Flash-exl3/tree/4.00bpw) Optimized: [2.08 bits per weight](https://huggingface.co/turboderp/Step-3.5-Flash-exl3/tree/2.08bpw) [3.05 bits per weight](https://huggingface.co/turboderp/Step-3.5-Flash-exl3/tree/3.05bpw) *(more coming soon)* . | Ppl¹ | KL-div ---------|--------|--------- 2.00 bpw | 2.629 | 0.653 2.08 bpw | 2.154 | 0.466 3.00 bpw | 1.521 | 0.142 3.05 bpw | 1.478 | 0.118 4.00 bpw | 1.379 | 0.053 Original | 1.336 | ¹ (10 rows of wikitext2)