matrixportalx
/

Gemma-3-4B-GGUF

Model card Files Files and versions

Gemma-3-4B GGUF Quantized Models

Technical Details

Quantization Tool: llama.cpp
Version: version: 5862 (704bb7a7)

Model Information

Base Model: Gunulhona/Gemma-3-4B
Quantized by: matrixportal

Available Files

🚀 Download	🔢 Type	📝 Description
Download	Q4 0	Standard 4-bit (fast on ARM)
Download	Q4 K M	4-bit balanced (recommended default)
Download	Q5 K M	5-bit best (recommended HQ option)

💡 Q4 K M provides the best balance for most use cases

Downloads last month: 20

GGUF

Model size

4B params

Architecture

gemma3

Hardware compatibility

Log In to add your hardware

4-bit

5-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for matrixportalx/Gemma-3-4B-GGUF

VIDraft/Gemma-3-R1984-4B

ZySec-AI/gemma-3-4b-document-writer

google/gemma-3-4b-it-qat-int4-unquantized

google/gemma-3-4b-it-qat-q4_0-unquantized

google/medgemma-4b-it

huihui-ai/gemma-3-4b-it-abliterated

neo4j/text-to-cypher-Gemma-3-4B-Instruct-2025.04.0

Merge model

this model