Liang
Andynsn
·
AI & ML interests
None yet
Recent Activity
updated a collection 4 days ago
latent updated a collection 12 days ago
self - evolve updated a collection 12 days ago
RLOrganizations
RL
RL
-
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
Paper • 2603.25562 • Published • 19 -
Self-Distilled Agentic Reinforcement Learning
Paper • 2605.15155 • Published • 113 -
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
Paper • 2504.15217 • Published • 11 -
Diffusion Policy Policy Optimization
Paper • 2409.00588 • Published • 20
DLLM
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 82 -
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer
Paper • 2605.15178 • Published • 87 -
A^2RD: Agentic Autoregressive Diffusion for Long Video Consistency
Paper • 2605.06924 • Published • 15 -
Diffusion Policy Policy Optimization
Paper • 2409.00588 • Published • 20
Agentic
latent
-
Unified Latents (UL): How to train your latents
Paper • 2602.17270 • Published • 61 -
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
Paper • 2604.02029 • Published • 151 -
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Paper • 2512.24617 • Published • 66 -
Cross-Tokenizer LLM Distillation through a Byte-Level Interface
Paper • 2604.07466 • Published • 7
self - evolve
memory
rec
latent
-
Unified Latents (UL): How to train your latents
Paper • 2602.17270 • Published • 61 -
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
Paper • 2604.02029 • Published • 151 -
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space
Paper • 2512.24617 • Published • 66 -
Cross-Tokenizer LLM Distillation through a Byte-Level Interface
Paper • 2604.07466 • Published • 7
RL
RL
-
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
Paper • 2603.25562 • Published • 19 -
Self-Distilled Agentic Reinforcement Learning
Paper • 2605.15155 • Published • 113 -
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
Paper • 2504.15217 • Published • 11 -
Diffusion Policy Policy Optimization
Paper • 2409.00588 • Published • 20
self - evolve
DLLM
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 82 -
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer
Paper • 2605.15178 • Published • 87 -
A^2RD: Agentic Autoregressive Diffusion for Long Video Consistency
Paper • 2605.06924 • Published • 15 -
Diffusion Policy Policy Optimization
Paper • 2409.00588 • Published • 20
memory
Agentic