File size: 971 Bytes
3c43f65
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e88507d
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
license: apache-2.0
base_model:
- Qwen/Qwen3-4B-Instruct
---

## Model Description

**Mem-T-4B** refers to the model parameters derived from training Qwen3-4B-Instruct using **MoT-GRPO** within the **Mem-T** framework.



## Usage

For detailed instructions on how to use within the **Mem-T** framework, please refer to the main [Mem-T GitHub repository](https://github.com/yanweiyue/Mem-T).


## Links

* **GitHub:** [https://github.com/yanweiyue/Mem-T](https://github.com/yanweiyue/Mem-T)

## Citation

If you find this work useful, please consider citing our paper.

```
@misc{yue2026memtdensifyingrewardslonghorizon,
      title={Mem-T: Densifying Rewards for Long-Horizon Memory Agents}, 
      author={Yanwei Yue and Guibin Zhang and Boci Peng and Xuanbo Fan and Jiaxin Guo and Qiankun Li and Yan Zhang},
      year={2026},
      eprint={2601.23014},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2601.23014}, 
}
```