EdwinYue
/

Mem-T-4B

Model card Files Files and versions

Mem-T-4B / README.md

EdwinYue's picture

Update README.md

e88507d verified 13 days ago

|

history blame contribute delete

971 Bytes

	---
	license: apache-2.0
	base_model:
	- Qwen/Qwen3-4B-Instruct
	---

	## Model Description

	Mem-T-4B refers to the model parameters derived from training Qwen3-4B-Instruct using MoT-GRPO within the Mem-T framework.



	## Usage

	For detailed instructions on how to use within the Mem-T framework, please refer to the main [Mem-T GitHub repository](https://github.com/yanweiyue/Mem-T).


	## Links

	* GitHub: [https://github.com/yanweiyue/Mem-T](https://github.com/yanweiyue/Mem-T)

	## Citation

	If you find this work useful, please consider citing our paper.

	```
	@misc{yue2026memtdensifyingrewardslonghorizon,
	title={Mem-T: Densifying Rewards for Long-Horizon Memory Agents},
	author={Yanwei Yue and Guibin Zhang and Boci Peng and Xuanbo Fan and Jiaxin Guo and Qiankun Li and Yan Zhang},
	year={2026},
	eprint={2601.23014},
	archivePrefix={arXiv},
	primaryClass={cs.LG},
	url={https://arxiv.org/abs/2601.23014},
	}
	```