ACON: Optimizing Context Compression for Long-horizon LLM Agents Paper • 2510.00615 • Published Oct 1 • 32
Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models Paper • 2504.20157 • Published Apr 28 • 37
Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models Paper • 2412.11423 • Published Dec 16, 2024 • 2