ankitjakhar
/

medgemma-4bit-quantized

Model card Files Files and versions

MedGemma 1.5 4B INT4 Benchmark Project

This repository demonstrates:

NF4 quantization
Memory optimization
Benchmark comparison
GGUF compatibility
LiteRT conversion

Results

Metric	FP16	INT4
VRAM	~8GB	~3GB
Quality	Baseline	Near-identical

Disclaimer

Research purposes only.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ankitjakhar/medgemma-4bit-quantized

Base model

google/medgemma-1.5-4b-it

Finetuned

(75)

this model