You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gemma 4 E4B Resilient Vision

This model is a compressed version of Gemma 4 optimized for the Resilient AI Challenge 2026 Image-to-Text category.

Optimization & Inference

  • Inference Engine: vLLM (v0.20.2)
  • Compression: Native vLLM default optimization and memory management.
  • Precision: 4-bit (via vLLM handshake).

Evaluation Setup

The model is served via vLLM with challenge-mandated parameters:

  • Temperature: 1.0
  • Top-P: 0.95
  • Top-K: 64

Evaluation


# python bench_vision.py 
================================================================================
GEMMA 4 E4B β€” vLLM HANDSHAKE EVALUATION
================================================================================
πŸ” Determinism Locked | Seed: 123

πŸ“¦ Handshaking with vLLM: http://127.0.0.1:8000/v1/chat/completions
βœ“ TOTAL SYSTEM RAM: 3.33 GB

============================================================
πŸ“Έ [1/3] Bee on Flower
============================================================
  πŸ“ Generated: Based on the image provided, here is a precise description:

**Image Description:**
This is a close-up, macro photograph...
  ⏱️  RTF: 0.0800 s/w | Words: 115 | Throughput: 12.5 w/s
  πŸ”‹ Energy: 450.71 J | Power: 49.0W
  πŸ’»  RAM: 3.46 GB | Score: 1.000

============================================================
πŸ“Έ  [2/3] Wisconsin Boardwalk
============================================================
  πŸ“  Generated: Based on the image provided, there are **no identifiable individuals** present.

**Precise Image Description:**

This is...
  ⏱️  RTF: 0.0536 s/w | Words: 117 | Throughput: 18.7 w/s
  πŸ”‹  Energy: 429.54 J | Power: 68.5W
  πŸ’»  RAM: 3.46 GB | Score: 1.000

============================================================
πŸ“Έ  [3/3] Turing Award Winners
============================================================
  πŸ“  Generated: This image is a composite featuring three prominent figures in the field of Artificial Intelligence (AI) and Machine Lea...
  ⏱️  RTF: 0.0585 s/w | Words: 106 | Throughput: 17.1 w/s
  πŸ”‹  Energy: 436.13 J | Power: 70.3W
  πŸ’»  RAM: 3.46 GB | Score: 1.000

================================================================================
πŸ“Š FINAL CHALLENGE AUDIT β€” GEMMA 4 E4B (vLLM)
================================================================================
  Average RAM:      3.46 GB
  Average RTF:      0.0640 s/w
  Avg Throughput:   16.1 words/sec
  Total Energy:     1316.38 J
  Total CO2e:       0.000365 kg
  Avg Quality:      1.000

πŸ” CHALLENGE TARGETS:
RAM < 4GB:    βœ… PASS (3.46 GB)
RTF < 1.0:    βœ… PASS (0.0640)
Quality > 80%:βœ… PASS (1.000)

πŸŽ‰ AUDIT COMPLETE β€” ALL TARGETS ACHIEVED!

License

Original Google Gemma License.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for frankmorales2020/gemma-4-e4b-resilient-vision

Finetuned
(195)
this model