Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
6o3's picture
1 25

6o3

Abodi6o3
shtefcs's profile picture
Β·

AI & ML interests

None yet

Recent Activity

upvoted a changelog about 1 month ago
HuggingChat for Docs
liked a Space 3 months ago
Supertone/supertonic
reacted to s-emanuilov's post with πŸ”₯ about 1 year ago
Tutorial πŸ’₯ Training a non-English reasoning model with GRPO and Unsloth I wanted to share my experiment with training reasoning models in languages other than English/Chinese. Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage. Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/ The model itself: https://huggingface.co/s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1 I hope this helps anyone looking to build reasoning models in their language.
View all activity

Organizations

MLX Community's profile picture

Abodi6o3 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs