Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
pankajpandey-dev 
posted an update 3 days ago
Post
7686
🇮🇳 New in my Hindi LLM Series: Gemma-4 E4B, fine-tuned for Hindi — and it runs on your laptop's CPU.
I fine-tuned Google's new Gemma-4 E4B on ~10k Hindi instruction pairs (AI4Bharat: anudesh + dolly) using Unsloth + LoRA, on a single L4 GPU.
Then I ran an honest side-by-side eval: base Gemma-4 vs my fine-tune, across 25 Hindi prompts. The results were interesting 👇
✅ My fine-tune is more concise — ask for "3 tips" and it gives exactly 3. Base writes a 1,200-character essay.

✅ Pure native Hindi — base keeps slipping into English ("संतुलित आहार (Eat a Balanced Diet)", "तारा (Star)"). My fine-tune stays in clean Hindi.

✅ Tighter instruction-following — ask for a "short message" and it gives one, not a menu of options.
⚖️ And to be honest: base Gemma-4 is more detailed and comprehensive. I didn't build a "smarter" model — I built a focused, Hindi-native, edge-friendly one that runs as a 5GB GGUF (Q4) on CPU.
🔗 Try it:

Live demo (CPU): pankajpandey-dev/gemma-4-e4b-hindi-demo
GGUF (Ollama/llama.cpp): pankajpandey-dev/gemma-4-e4b-hindi-instruct-GGUF
16-bit model: pankajpandey-dev/gemma-4-e4b-hindi-instruct

Built with @unsloth · Data by @ai4bharat 🙏
#Hindi #LLM #Gemma #Unsloth #IndicNLP #GGUF

Aag the fire!

·

Fire in, energy out 🔥 → ⚡

This comment has been hidden (marked as Abuse)
This comment has been hidden (marked as Abuse)
This comment has been hidden (marked as Spam)
This comment has been hidden (marked as Spam)
This comment has been hidden (marked as Spam)
This comment has been hidden (marked as Graphic Content)
·
This comment has been hidden (marked as Spam)

I had done some similar work across hindi and its dialect . Very happy to see newer dataset and training in hindi language model.

You may check out this model and benchmark it was based on gemma 2 9b trained on T4 on a dataset of 100k english + hindi devanagri and latin translated datasets.
https://huggingface.co/divyanshukunwar/SASTRI_1_9B_GGUF

Training was done with unsloth more than a year ago. It took around 86 hours for complete one epoch training.

·

Thanks for sharing, Divyanshu! I appreciate it. It's encouraging to see more open-source work focused on Hindi and its dialects. I'll explore SASTRI and see how it performs on our evaluation benchmarks. If you have any recommended benchmark datasets or evaluation methodology, I'd love to hear about them