1930 Coder Collection Fine-tuning the Talkie 13B 1930 model on agentic trajectories • 4 items • Updated 2 days ago • 4
(Some) Emergent Misalignment from Reward Hacking in RL Collection Model checkpoints from the project "(Some) Natural Emergent Misalignment from Reward Hacking in Non-Production RL" • 228 items • Updated 6 days ago • 4
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2 Image-Text-to-Text • 28B • Updated Apr 6 • 644k • 121
🇮🇹 Italian NLP Resources Collection Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 300 items • Updated Mar 26 • 34