Hanna Yukhymenko's picture

In a Training Loop 🔄

Hanna Yukhymenko PRO

hannayukhymenko

swiss-ai

·

https://ayukh.com

AI & ML interests

post-training/multilinguality @swiss-ai | ex-🤗

Recent Activity

upvoted an article 4 days ago

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

liked a model 2 months ago

INSAIT-Institute/BgGPT-Gemma-3-4B-IT

liked a model 2 months ago

INSAIT-Institute/BgGPT-Gemma-3-12B-IT

View all activity

Organizations

upvoted an article 4 days ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

6 days ago

• 36

upvoted a collection 2 months ago

BgGPT-Gemma-3

9 items • Updated Mar 26 • 7

upvoted a paper 3 months ago

Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation

Paper • 2511.17290 • Published Nov 21, 2025 • 1

upvoted 2 collections 3 months ago

🇪🇪 Estonian LLM Evaluation

A collection of resources for evaluation of LLM capabilities in the Estonian language. • 33 items • Updated Dec 13, 2025 • 5

Multilingual Benchmarks

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets (ACL 2026) • 29 items • Updated Apr 11 • 3

upvoted a paper 3 months ago

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

Paper • 2602.22207 • Published Feb 25 • 44

upvoted an article 4 months ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

+5

burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c

•

Feb 4

• 90

upvoted a paper 4 months ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published Feb 1 • 45

upvoted a collection 5 months ago

awesome-türkçe-veri

39 items • Updated Jan 6 • 4

upvoted a collection 8 months ago

Jupyter Agent

Blog: https://huggingface.co/blog/jupyter-agent-2 • 4 items • Updated Sep 12, 2025 • 3

upvoted a paper 8 months ago

Making, not Taking, the Best of N

Paper • 2510.00931 • Published Oct 1, 2025 • 11

upvoted a collection 8 months ago

MamayLM-v1.0-Gemma-3

First Open and Multimodal Ukrainian-focused LLM • 5 items • Updated Oct 8, 2025 • 21

upvoted an article 8 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

A-Mahla, merve, sergiopaniego, reach-vb, lewtun

•

Sep 23, 2025

• 138

upvoted 3 articles 9 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 188

Article

Jupyter Agents: training LLMs to reason with notebooks

+1

baptistecolle, hannayukhymenko, lvwerra

•

Sep 10, 2025

• 65

Article

mmBERT: ModernBERT goes Multilingual

+4

mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme

•

Sep 9, 2025

• 146

upvoted a collection 9 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 350

upvoted 2 articles 10 months ago

Article

Announcing UA-Code-Bench: a New Benchmark for Evaluating LLMs on Competitive Programming Tasks in Ukrainian

anon-researcher-ua

•

Jul 12, 2025

• 2

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

+3

smohammadi, siro1, winglian, marcsun13, djsaunde

•

Aug 8, 2025

• 98

upvoted an article 11 months ago

Article

What is the Hugging Face Community Building?

evijit

•

Jul 15, 2025

• 19