🔄 In a Training Loop

Stefano Fiorucci PRO

anakin87

AI & ML interests

Language Models: orchestration, post-training, GRPO, synthetic data... Contributing to Haystack LLM framework 🏗️

Recent Activity

liked a model about 2 hours ago

Qwen/Qwen-AgentWorld-35B-A3B

liked a model about 24 hours ago

clark-labs/clark-air-sana-1.6b-1.58bit

liked a model 1 day ago

LiquidAI/LFM2.5-230M

View all activity

Organizations

upvoted an article 3 days ago

Article

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

Wauplin, celinah

•

4 days ago

• 18

upvoted a collection 21 days ago

ClaimExtractor-2605

Collection

Extract claims and intents from conversations • 7 items • Updated 12 days ago • 8

upvoted 2 articles about 1 month ago

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

May 25

• 122

Article

Introducing the Ettin Reranker Family

tomaarsen

•

May 19

• 52

upvoted an article 2 months ago

Article

ML Intern Takes Our Post-Training Internship Test

cmpatino

•

Apr 23

• 31

upvoted an article 3 months ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 62

upvoted a collection 3 months ago

LFM2 2.6B Mr. Tic Tac Toe ❌ ⭕

Collection

Dataset and models for transforming LFM2 2.6B into a Tic Tac Toe master using RL Environments. Free course: https://t.ly/4jIFq • 8 items • Updated Apr 8 • 2

upvoted an article 3 months ago

Article

Training mRNA Language Models Across 25 Species for $165

OpenMed

•

Mar 31

• 28

upvoted a collection 3 months ago

Gemma 4

Collection

15 items • Updated 16 days ago • 992

upvoted an article 3 months ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

qgallouedec, stevhliu, pcuenq, sergiopaniego

•

Mar 31

• 57

upvoted a paper 3 months ago

Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation

Paper • 2602.17316 • Published Feb 19 • 2

upvoted 2 collections 3 months ago

Zagreus - Nesso fine tuned

Collection

The collection contains three bilingual English/Italian SLMs post-trained on Zagreus-0.4B-ita: instruct, agentic, and a fully open-source • 3 items • Updated Mar 4 • 3

Zagreus 0.4B

Collection

The Zagreus-0.4B collection contains four bilingual English + Romance language foundational SLMs (~400M parameters) trained from scratch • 4 items • Updated Mar 4 • 7

upvoted a paper 3 months ago

Transformer Layers as Painters

Paper • 2407.09298 • Published Jul 12, 2024 • 16

upvoted an article 4 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 165

upvoted a collection 4 months ago

Qwen3.5-text-only

Collection

Text-only versions of Qwen-3.5 without the vision encoders for a smaller memory and storage footprint. • 4 items • Updated 21 days ago • 15

upvoted 2 articles 4 months ago

Article

The ML Engineer's Guide to Protein AI

MaziyarPanahi

•

Mar 3

• 34

Article

Bringing Autonomous Driving RL to OpenEnv and TRL

sergiopaniego

•

Feb 26

• 22

upvoted an article 5 months ago

Article

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

MaziyarPanahi

•

Feb 7

• 22

upvoted a collection 5 months ago

ScopeGuard-2601

Collection

https://principled-intelligence.com/news/introducing-scope-guard • 3 items • Updated 21 days ago • 7

Stefano Fiorucci PRO

AI & ML interests

Recent Activity

Organizations

anakin87's activity

Shipping huggingface_hub every week with AI, open tools, and a human in the loop

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

Introducing the Ettin Reranker Family

ML Intern Takes Our Post-Training Internship Test

Multimodal Embedding & Reranker Models with Sentence Transformers

Training mRNA Language Models Across 25 Species for $165

TRL v1.0: Post-Training Library Built to Move with the Field

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

The ML Engineer's Guide to Protein AI

Bringing Autonomous Driving RL to OpenEnv and TRL

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output