Safetensors
English
qwen3
code
Critique-Coder-4B / README.md
wenhu's picture
Update README.md
5f45ee8 verified
---
license: apache-2.0
datasets:
- TIGER-Lab/rStar-Critique-Data
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen3-4B
tags:
- code
---
## Model
We release the 4B model trained with [Critique-Coder](https://github.com/TIGER-AI-Lab/Critique-Coder).
## Data
Data Construction Pipeline is shown:
![pipeline](https://github.com/TIGER-AI-Lab/Critique-Coder/blob/main/assets/images/dataset.png?raw=true)
## Paper
[Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning](https://huggingface.co/papers/2509.22824)
## Project Page
https://tiger-ai-lab.github.io/Critique-Coder
## Code
https://github.com/TIGER-AI-Lab/Critique-Coder
## Sample Usage
You can download this dataset using the Hugging Face CLI:
```bash
hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset
```
## Citation
```
@article{ruan2025critiquecoder,
title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning},
author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu},
journal={ArXiv},
year={2025},
volume={2509.22824}
}
```