@pankajpandey-dev on Hugging Face: "🇮🇳 New in my Hindi LLM Series: Gemma-4 E4B, fine-tuned for Hindi

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update 3 days ago

Post

7686

🇮🇳 New in my Hindi LLM Series: Gemma-4 E4B, fine-tuned for Hindi — and it runs on your laptop's CPU.
I fine-tuned Google's new Gemma-4 E4B on ~10k Hindi instruction pairs (AI4Bharat: anudesh + dolly) using Unsloth + LoRA, on a single L4 GPU.
Then I ran an honest side-by-side eval: base Gemma-4 vs my fine-tune, across 25 Hindi prompts. The results were interesting 👇
✅ My fine-tune is more concise — ask for "3 tips" and it gives exactly 3. Base writes a 1,200-character essay.

✅ Pure native Hindi — base keeps slipping into English ("संतुलित आहार (Eat a Balanced Diet)", "तारा (Star)"). My fine-tune stays in clean Hindi.

✅ Tighter instruction-following — ask for a "short message" and it gives one, not a menu of options.
⚖️ And to be honest: base Gemma-4 is more detailed and comprehensive. I didn't build a "smarter" model — I built a focused, Hindi-native, edge-friendly one that runs as a 5GB GGUF (Q4) on CPU.
🔗 Try it:

Live demo (CPU): pankajpandey-dev/gemma-4-e4b-hindi-demo
GGUF (Ollama/llama.cpp): pankajpandey-dev/gemma-4-e4b-hindi-instruct-GGUF
16-bit model: pankajpandey-dev/gemma-4-e4b-hindi-instruct

Built with @unsloth · Data by @ai4bharat 🙏
#Hindi #LLM #Gemma #Unsloth #IndicNLP #GGUF

dronefreak

2 days ago

Aag the fire!

pankajpandey-dev

2 days ago

Fire in, energy out 🔥 → ⚡

Lounafa2

1 day ago

This comment has been hidden (marked as Abuse)

HuggingfacebrosLLM

1 day ago

This comment has been hidden (marked as Abuse)

HuggingfacebrosLLM

1 day ago

This comment has been hidden (marked as Spam)

HuggingfacebrosLLM

1 day ago

This comment has been hidden (marked as Spam)

Tralalabs

1 day ago

This comment has been hidden (marked as Spam)

cerealpotatochipssea

1 day ago

This comment has been hidden (marked as Graphic Content)

usermma

1 day ago

This comment has been hidden (marked as Spam)

divyanshukunwar

1 day ago

•

edited 1 day ago

I had done some similar work across hindi and its dialect . Very happy to see newer dataset and training in hindi language model.

You may check out this model and benchmark it was based on gemma 2 9b trained on T4 on a dataset of 100k english + hindi devanagri and latin translated datasets.
https://huggingface.co/divyanshukunwar/SASTRI_1_9B_GGUF

Training was done with unsloth more than a year ago. It took around 86 hours for complete one epoch training.

pankajpandey-dev

1 day ago

Thanks for sharing, Divyanshu! I appreciate it. It's encouraging to see more open-source work focused on Hindi and its dialects. I'll explore SASTRI and see how it performs on our evaluation benchmarks. If you have any recommended benchmark datasets or evaluation methodology, I'd love to hear about them

In this post