view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 5 days ago • 46
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity Paper • 2506.16500 • Published Jun 19, 2025 • 17
SparseLoRA Collection Accelerating LLM Fine-Tuning with Contextual Sparsity • 4 items • Updated 4 days ago • 1
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published 15 days ago • 84
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published 17 days ago • 42
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 15 days ago • 86
SWE-rebench-V2 Collection SWE-rebench-V2 is a curated dataset of software-engineering tasks derived from real GitHub issues and pull requests. • 3 items • Updated 11 days ago • 6
LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding Paper • 2602.23881 • Published 15 days ago • 18
LK-Speculators Collection High-performance speculative decoding draft models trained using LK losses, a novel training objectives that directly optimize acceptance rate • 9 items • Updated 11 days ago • 5
GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface Paper • 2507.18546 • Published Jul 24, 2025 • 35
GeRaCl Collection General Rapid Classification: Zero-Result Classifier for Russian • 2 items • Updated Jun 7, 2025 • 3
PolyGuard: A Multilingual Safety Moderation Tool for 17 Languages Paper • 2504.04377 • Published Apr 6, 2025 • 1
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 189