Scratchpad Patching: Decoupling Compute from Patch Size in Byte-Level Language Models Paper • 2605.09630 • Published 16 days ago • 1
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery Paper • 2604.01658 • Published Apr 2 • 55
Training Domain Draft Models for Speculative Decoding: Best Practices and Insights Paper • 2503.07807 • Published Mar 10, 2025 • 1
Composition of Experts: A Modular Compound AI System Leveraging Large Language Models Paper • 2412.01868 • Published Dec 2, 2024
Training Domain Draft Models for Speculative Decoding: Best Practices and Insights Paper • 2503.07807 • Published Mar 10, 2025 • 1
On the Tool Manipulation Capability of Open-source Large Language Models Paper • 2305.16504 • Published May 25, 2023 • 2
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6, 2025 • 132
view post Post 1740 Mini-QwQ an edge device friendly reasoning model distilled from QwQ-32B 🤗: kz919/QwQ-0.5B-Distilled-SFT🇬 🇬 🇺 🇫: kz919/QwQ-0.5B-Distilled-SFT-gguf🤖: kz919/Mini-QwQ See translation 👍 7 7 + Reply
Running Agents Featured 272 Qwen2.5 Coder Artifacts 🐢 272 Generate and preview web app code from a text description