--- datasets: - Zichen1024/CoVe-12k license: apache-2.0 pipeline_tag: text-generation library_name: transformers base_model: Qwen/Qwen3-4B-Instruct-2507 tags: - agent - tool-use ---
📄 Paper | 🌐 Website | 🤗 Dataset |
## Overview **CoVe-4B** is a compact 4B interactive tool-use agent fine-tuned from [Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) using the **CoVe** (Constraint-Verification) post-training framework. It is trained on [CoVe-12K](https://huggingface.co/datasets/Zichen1024/CoVe-12k), a dataset of 12K high-quality multi-turn tool-use trajectories synthesized and verified by deterministic constraint checking. The model was introduced in the paper [CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification](https://huggingface.co/papers/2603.01940). ## Framework
The CoVe framework. Explicit constraints are fuzzified to guide a User Simulator LLM, and original constraints act as a deterministic checklist to verify the agent's tool invocations.
Main results on τ²-bench. CoVe-4B achieves top performance in the ≤8B group and rivals models up to 70B.