|
|
--- |
|
|
license: apache-2.0 |
|
|
datasets: |
|
|
- TIGER-Lab/rStar-Critique-Data |
|
|
language: |
|
|
- en |
|
|
metrics: |
|
|
- accuracy |
|
|
base_model: |
|
|
- Qwen/Qwen3-4B |
|
|
tags: |
|
|
- code |
|
|
--- |
|
|
|
|
|
## Model |
|
|
We release the 4B model trained with [Critique-Coder](https://github.com/TIGER-AI-Lab/Critique-Coder). |
|
|
|
|
|
## Data |
|
|
Data Construction Pipeline is shown: |
|
|
|
|
|
 |
|
|
|
|
|
## Paper |
|
|
[Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning](https://huggingface.co/papers/2509.22824) |
|
|
|
|
|
## Project Page |
|
|
https://tiger-ai-lab.github.io/Critique-Coder |
|
|
|
|
|
## Code |
|
|
https://github.com/TIGER-AI-Lab/Critique-Coder |
|
|
|
|
|
## Sample Usage |
|
|
|
|
|
You can download this dataset using the Hugging Face CLI: |
|
|
|
|
|
```bash |
|
|
hf download Critique-Coder/rStar-Critique-Data --local-dir ./data/critique-coder-dataset --repo dataset |
|
|
``` |
|
|
|
|
|
## Citation |
|
|
``` |
|
|
@article{ruan2025critiquecoder, |
|
|
title={Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning}, |
|
|
author={Ruan, Chi and Jiang, Dongfu and Wang, Yubo and Chen, Wenhu}, |
|
|
journal={ArXiv}, |
|
|
year={2025}, |
|
|
volume={2509.22824} |
|
|
} |
|
|
``` |