$\texttt{MemoryRewardBench}$: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models
Paper
•
2601.11969
•
Published
•
25
Long-context Modeling, Reinforcement-Learning, Multi-modality