UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action Paper โข 2510.17790 โข Published Oct 20 โข 5
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper โข 2505.03335 โข Published May 6 โข 188
NextCoder Collection NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. โข 6 items โข Updated Jul 9 โข 74
view article Article Custom Vibe Coding Quest Part 2: ๐ Fine-Tuning Gemma 3 for Code Reasoning Apr 1 โข 25
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper โข 2312.11514 โข Published Dec 12, 2023 โข 260