MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published Jan 12 • 52
view article Article Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers Nov 3, 2022 • 361