Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
25
6o3
Abodi6o3
Follow
shtefcs's profile picture
1 follower
Β·
3 following
AI & ML interests
None yet
Recent Activity
upvoted
a
changelog
about 1 month ago
HuggingChat for Docs
liked
a Space
3 months ago
Supertone/supertonic
reacted
to
s-emanuilov
's
post
with π₯
about 1 year ago
Tutorial π₯ Training a non-English reasoning model with GRPO and Unsloth I wanted to share my experiment with training reasoning models in languages other than English/Chinese. Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage. Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/ The model itself: https://huggingface.co/s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1 I hope this helps anyone looking to build reasoning models in their language.
View all activity
Organizations
Abodi6o3
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
changelog
about 1 month ago
view changelog
Changelog
HuggingChat for Docs
Dec 12, 2025
β’
119