|
|
--- |
|
|
license: apache-2.0 |
|
|
base_model: |
|
|
- deepseek-ai/DeepSeek-R1-Distill-Qwen-14B |
|
|
pipeline_tag: question-answering |
|
|
metrics: |
|
|
- accuracy |
|
|
--- |
|
|
# OpenCausaLab/CauGym |
|
|
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
|
CauGym model is a model trained via GRPO (Group Relative Policy Optimization) on VERL framework (https://github.com/verl-project/verl), and it is specialized for causal inference. |
|
|
|
|
|
## Model Details |
|
|
|
|
|
- **Developed by:** OpenCausaLab |
|
|
- **Model type:** LLM. |
|
|
- **Language(s) (NLP):** Englsih. |
|
|
|
|
|
|
|
|
### Model Sources |
|
|
|
|
|
<!-- Provide the basic links for the model. --> |
|
|
|
|
|
- **Repository:** https://github.com/OpenCausaLab/CauGym |
|
|
- **Paper :** https://www.arxiv.org/abs/2602.06337 |
|
|
|
|
|
|
|
|
### Evaluation |
|
|
|
|
|
We have evaluated this model on CALM benchmark and CauGym benchmark, and the evaluation metric is accuracy. |
|
|
| Benchmark | ATE | CDE | ETT | NDE | NIE | PN | PS | |
|
|
| :--- | :---: | :---: | :---: | :---: | :---: | :---: | :---: | |
|
|
| **CALM** | 0.990 | 0.994 | 0.900 | 0.940 | 0.930 | 0.928 | 0.866 | |
|
|
| **CauGym-rephrased**| 0.948 | 0.982 | 0.856 | 0.890 | 0.888 | 0.778 | 0.816 | |
|
|
| **CauGym-ommitted** | 0.935 | 0.963 | 0.837 | 0.934 | 0.838 | 0.900 | 0.907 | |
|
|
| **CauGym-deconfounding** | 0.976 | 0.986 | 0.854 | 0.572 | 0.872 | 0.952 | 0.848 | |
|
|
| **CauGym-redundant** | 0.972 | 0.966 | 0.918 | 0.850 | 0.888 | 0.934 | 0.910 | |
|
|
| **CauGym-insufficient** | 0.884 | 0.902 | 0.686 | 0.696 | 0.958 | 0.940 | 0.954 | |
|
|
|
|
|
|
|
|
## Citation |
|
|
|
|
|
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. --> |
|
|
```latex |
|
|
@misc{chen2026posttrainingtransformllmscausal, |
|
|
title={Can Post-Training Transform LLMs into Causal Reasoners?}, |
|
|
author={Junqi Chen and Sirui Chen and Chaochao Lu}, |
|
|
year={2026}, |
|
|
eprint={2602.06337}, |
|
|
archivePrefix={arXiv}, |
|
|
primaryClass={cs.CL}, |
|
|
url={https://arxiv.org/abs/2602.06337}, |
|
|
} |
|
|
``` |