YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Model Description
AgenticQwen 8B is a small agentic language model trained on Qwen3 8B, designed for multi-step reasoning and tool use. It is trained with a multi-round reinforcement learning (GRPO-style) pipeline and a dual "data flywheel" mechanism that continually increases task difficulty for both reasoning and agentic workflows.
Note
For best benchmark performance, we recommend using the same (or highly similar) prompting format as used during training, as described in the accompanying paper.
- Downloads last month
- 2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support