TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs Paper • 2501.15674 • Published Jan 26, 2025 • 2
KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices Paper • 2601.21579 • Published 7 days ago • 6