zwhy

XiaohuaWang

2 2

·

AI & ML interests

None yet

Organizations

upvoted a paper 3 months ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published Apr 15 • 32

upvoted a paper 7 months ago

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Paper • 2601.05107 • Published Jan 8 • 24