Demystifying the Slash Pattern in Attention: The Role of RoPE Paper • 2601.08297 • Published 15 days ago • 3
Demystifying the Slash Pattern in Attention: The Role of RoPE Paper • 2601.08297 • Published 15 days ago • 3
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation Paper • 2511.11434 • Published Nov 14, 2025 • 45
Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning Paper • 2510.14095 • Published Oct 15, 2025 • 6
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Paper • 2510.15624 • Published Oct 17, 2025 • 15
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published Sep 30, 2025 • 20 • 2