Mem-T-4B / README.md
EdwinYue's picture
Update README.md
e88507d verified
---
license: apache-2.0
base_model:
- Qwen/Qwen3-4B-Instruct
---
## Model Description
**Mem-T-4B** refers to the model parameters derived from training Qwen3-4B-Instruct using **MoT-GRPO** within the **Mem-T** framework.
## Usage
For detailed instructions on how to use within the **Mem-T** framework, please refer to the main [Mem-T GitHub repository](https://github.com/yanweiyue/Mem-T).
## Links
* **GitHub:** [https://github.com/yanweiyue/Mem-T](https://github.com/yanweiyue/Mem-T)
## Citation
If you find this work useful, please consider citing our paper.
```
@misc{yue2026memtdensifyingrewardslonghorizon,
title={Mem-T: Densifying Rewards for Long-Horizon Memory Agents},
author={Yanwei Yue and Guibin Zhang and Boci Peng and Xuanbo Fan and Jiaxin Guo and Qiankun Li and Yan Zhang},
year={2026},
eprint={2601.23014},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2601.23014},
}
```