gemma3-4b-cpt

CPT-only (Continued Pre-Training) adapter for Gemma-3-4B-IT on the MADLAD-400 Sinhala corpus. This adapter does not perform VQA on its own. It is intended to be used as the first stage of the sequential CPT → VQA pipeline together with Siluni/gemma3-4b-cpt-vqa-33k.

Base model: google/gemma-3-4b-it
Experiment: Group 3 — Sequential CPT stage
CPT corpus: MADLAD-400 Sinhala (~293M words)
Method: QLoRA (4-bit NF4, LoRA rank 16, alpha 32)

⚠️ Sequential Loading Required

This adapter must be loaded together with the VQA adapter and combined before inference. See Siluni/gemma3-4b-cpt-vqa-33k for the full loading instructions.

Citation

@misc{keerthiratne2025sinhalavqa,
  title        = {Benchmarking and Adapting Compact Multimodal Models for Sinhala Visual Question Answering},
  author       = {Keerthiratne, Siluni and Weerasinghe, Ruvan and Sumanathilaka, Deshan},
  year         = {2025},
  institution  = {Informatics Institute of Technology / Robert Gordon University},
}

Downloads last month: 1

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Siluni/gemma3-4b-cpt

Base model

google/gemma-3-4b-pt

Finetuned

google/gemma-3-4b-it

Adapter

(383)

this model

Dataset used to train Siluni/gemma3-4b-cpt

Collection including Siluni/gemma3-4b-cpt

Sinhala Visual Question Answering - Compact VLM Adaptation

Collection

Dataset and fine-tuned models from the study "Benchmarking and Adapting Compact Multimodal Models for Sinhala Visual Question Answering." • 8 items • Updated Apr 5