RelayGen: Intra-Generation Model Switching for Efficient Reasoning Paper • 2602.06454 • Published 5 days ago • 11
LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning Paper • 2510.14211 • Published Oct 16, 2025 • 9
Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning Paper • 2505.13866 • Published May 20, 2025 • 17