Gemma-3-4B GGUF Quantized Models
Technical Details
- Quantization Tool: llama.cpp
- Version: version: 5862 (704bb7a7)
Model Information
- Base Model: Gunulhona/Gemma-3-4B
- Quantized by: matrixportal
Available Files
| ๐ Download | ๐ข Type | ๐ Description |
|---|---|---|
| Download | Q4 0 | Standard 4-bit (fast on ARM) |
| Download | Q4 K M | 4-bit balanced (recommended default) |
| Download | Q5 K M | 5-bit best (recommended HQ option) |
๐ก Q4 K M provides the best balance for most use cases
- Downloads last month
- 20
Hardware compatibility
Log In to add your hardware
4-bit
5-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support