view article Article [mlx-code](https://josefalbers.github.io/mlx-code) JosefAlbers • 19 days ago • 1
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper • 2408.15518 • Published Aug 28, 2024 • 42
The Mamba in the Llama: Distilling and Accelerating Hybrid Models Paper • 2408.15237 • Published Aug 27, 2024 • 42 • 6