Large Language Models Hack Rewards, and Society Paper • 2606.04075 • Published 24 days ago • 10
Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs Paper • 2605.30501 • Published 29 days ago • 29
Where does output diversity collapse in post-training? Paper • 2604.16027 • Published Apr 17 • 22
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment NormalUhr • Feb 11, 2025 • 126
Chain Of Thought Compression: A Theoritical Analysis Paper • 2601.21576 • Published Jan 29 • 20
Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation Paper • 2602.02007 • Published Feb 2 • 19
Context Compression via Explicit Information Transmission Paper • 2602.03784 • Published Feb 3 • 15
An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift Paper • 2601.05882 • Published Jan 9 • 21
SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space Paper • 2511.20102 • Published Nov 25, 2025 • 28
Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States Paper • 2510.11052 • Published Oct 13, 2025 • 53