Model Description

AgenticQwen 8B is a small agentic language model trained on Qwen3 8B, designed for multi-step reasoning and tool use. It is trained with a multi-round reinforcement learning (GRPO-style) pipeline and a dual "data flywheel" mechanism that continually increases task difficulty for both reasoning and agentic workflows.

Note

For best benchmark performance, we recommend using the same (or highly similar) prompting format as used during training, as described in the accompanying paper.

Downloads last month: 2

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Anonymousxsfsag/agenticqwen_8B

Quantizations

2 models