Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
6o3's picture
1 25

6o3

Abodi6o3
shtefcs's profile picture
Β·

AI & ML interests

None yet

Recent Activity

upvoted a changelog about 1 month ago
HuggingChat for Docs
liked a Space 3 months ago
Supertone/supertonic
reacted to s-emanuilov's post with πŸ”₯ about 1 year ago
Tutorial πŸ’₯ Training a non-English reasoning model with GRPO and Unsloth I wanted to share my experiment with training reasoning models in languages other than English/Chinese. Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage. Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/ The model itself: https://huggingface.co/s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1 I hope this helps anyone looking to build reasoning models in their language.
View all activity

Organizations

MLX Community's profile picture

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs