translategemma-4b-it-q5_k_m.gguf

This repo contains GGUF weights for google/translategemma-4b-it.

Quantization Details

File	Quant Method	Size	Description
`translategemma-4b-it-q5_k_m.gguf`	Q5_K_M	3.2 GB	High quality, recommended for most uses.

You can use these models with llama.cpp

./llama-server -m translategemma-4b-it-q5_k_m.gguf -no-mmap -ngl 99 --port 8080 -c 8192 -fa 1 --jinja

GGUF

Model size

4B params

Architecture

gemma3

Hardware compatibility

5-bit

Base model

Quantized

(32)

this model