MedGemma 1.5 4B INT4 Benchmark Project

This repository demonstrates:

  • NF4 quantization
  • Memory optimization
  • Benchmark comparison
  • GGUF compatibility
  • LiteRT conversion

Results

Metric FP16 INT4
VRAM ~8GB ~3GB
Quality Baseline Near-identical

Disclaimer

Research purposes only.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ankitjakhar/medgemma-4bit-quantized

Finetuned
(75)
this model