Seedream 4.0: Toward Next-generation Multimodal Image Generation Paper • 2509.20427 • Published Sep 24, 2025 • 84
view article Article StackLLaMA: A hands-on guide to train LLaMA with RLHF +5 edbeeching, kashif, ybelkada, lewtun, lvwerra, nazneen, natolambert • Apr 5, 2023 • 48
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper • 2405.11143 • Published May 20, 2024 • 41
view article Article StarCoder: A State-of-the-Art LLM for Code lvwerra, loubnabnl • May 4, 2023 • 73
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation Paper • 2502.21074 • Published Feb 28, 2025 • 4
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6, 2025 • 192
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 775
Mercury: Ultra-Fast Language Models Based on Diffusion Paper • 2506.17298 • Published Jun 17, 2025 • 10
3D Gaussian Splatting for Real-Time Radiance Field Rendering Paper • 2308.04079 • Published Aug 8, 2023 • 202
LLMZip: Lossless Text Compression using Large Language Models Paper • 2306.04050 • Published Jun 6, 2023 • 5
view article Article Introducing SynthID Text +4 sumedhghaisas, sdathath, RyanMullins, joaogante, marcsun13, RaushanTurganbay • Oct 23, 2024 • 59