Constrained Initial Representations

This is the pre-trained policy model weights for our novel method, Constrained Initial Representations (CIR).

The results are grouped into each environment and seed. One can directly load the policy weights from the corresponding directory

One can find our papers here

Citation

If you find our work interesting or use our work in your paper, please consider citing our paper:

@article{lyu2026temporal,
  title={Temporal Difference Learning with Constrained Initial Representations},
  author={Lyu, Jiafei and Yang, Jingwen and Qiao, Zhongjian and Liu, Runze and Liu, Zeyuan and Ye, Deheng and Lu, Zongqing and Li, Xiu},
  journal={arXiv preprint arXiv:2602.11800},
  year={2026}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for dmux/CIR

Temporal Difference Learning with Constrained Initial Representations

Paper • 2602.11800 • Published Feb 12