Yao's picture

Yao PRO

Lucasoppem

·

AI & ML interests

None yet

Recent Activity

updated a Space 4 days ago

Lucasoppem/cascade_risk

published a Space 4 days ago

Lucasoppem/cascade_risk

upvoted a paper 21 days ago

Large Language Models Hack Rewards, and Society

View all activity

Organizations

None yet

upvoted a paper 21 days ago

Large Language Models Hack Rewards, and Society

Paper • 2606.04075 • Published 24 days ago • 10

upvoted a paper 25 days ago

Linear Ensembles Wash Away Watermarks: On the Fragility of Distributional Perturbations in LLMs

Paper • 2605.30501 • Published 29 days ago • 29

upvoted a paper 2 months ago

Where does output diversity collapse in post-training?

Paper • 2604.16027 • Published Apr 17 • 22

upvoted an article 2 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

NormalUhr

•

Feb 11, 2025

• 126

upvoted 4 papers 5 months ago

Chain Of Thought Compression: A Theoritical Analysis

Paper • 2601.21576 • Published Jan 29 • 20

Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation

Paper • 2602.02007 • Published Feb 2 • 19

Context Compression via Explicit Information Transmission

Paper • 2602.03784 • Published Feb 3 • 15

An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift

Paper • 2601.05882 • Published Jan 9 • 21

upvoted a paper 7 months ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published Nov 25, 2025 • 28

upvoted a paper 8 months ago

Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States

Paper • 2510.11052 • Published Oct 13, 2025 • 53