Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30 • 117
StateX: Enhancing RNN Recall via Post-training State Expansion Paper • 2509.22630 • Published Sep 26 • 3
StateX: Enhancing RNN Recall via Post-training State Expansion Paper • 2509.22630 • Published Sep 26 • 3
StateX: Enhancing RNN Recall via Post-training State Expansion Paper • 2509.22630 • Published Sep 26 • 3 • 2
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity Paper • 2507.08771 • Published Jul 11 • 9
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity Paper • 2507.08771 • Published Jul 11 • 9
Cost-Optimal Grouped-Query Attention for Long-Context LLMs Paper • 2503.09579 • Published Mar 12 • 5 • 2