GPT-OSS Math (4.2B to 20B) Collection Mathematics-focused GPT-OSS models excelling at mathematical computation, proof strategies, and logical reasoning from MMLU mathematics subjects. • 29 items • Updated Aug 13, 2025 • 2
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.13k
view article Article Ettin Suite: SoTA Paired Encoders and Decoders +4 orionweller, kdricci, mmarone, NohTow, dlawrie, vandurme • Jul 16, 2025 • 80
Learning to (Learn at Test Time): RNNs with Expressive Hidden States Paper • 2407.04620 • Published Jul 5, 2024 • 34
Synthetic ARC Dataset Collection Please see the demonstration examples of our dataset here: https://www.basis.ai/arc_interface/examples • 3 items • Updated Nov 13, 2024 • 4
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper • 2503.07572 • Published Mar 10, 2025 • 48
StarVector: Generating Scalable Vector Graphics Code from Images Paper • 2312.11556 • Published Dec 17, 2023 • 38
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published Apr 8, 2025 • 186
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU +4 edbeeching, ybelkada, lvwerra, smangrul, lewtun, kashif • Mar 9, 2023 • 72
view article Article Tiny Agents: an MCP-powered agent in 50 lines of code julien-c • Apr 25, 2025 • 308
view article Article Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive +1 sschoenmeyer, tlwu, mfuntowicz • Jan 15, 2024 • 7
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values Paper • 2504.05535 • Published Apr 7, 2025 • 44