M*: A Modular, Extensible, Serving System for Multimodal Models Paper • 2606.12688 • Published 12 days ago
VoxServe: Streaming-Centric Serving System for Speech Language Models Paper • 2602.00269 • Published Jan 30 • 6
VoxServe: Streaming-Centric Serving System for Speech Language Models Paper • 2602.00269 • Published Jan 30 • 6
DualTune: Decoupled Fine-Tuning for On-Device Agentic Systems Paper • 2510.00229 • Published Sep 30, 2025 • 1
ConsumerBench: Benchmarking Generative AI Applications on End-User Devices Paper • 2506.17538 • Published Jun 21, 2025 • 7
efficient-speech/lite-whisper-medium-fast Automatic Speech Recognition • 0.7B • Updated Apr 3, 2025 • 6
efficient-speech/lite-whisper-medium-acc Automatic Speech Recognition • 0.8B • Updated Apr 3, 2025 • 7
efficient-speech/lite-whisper-small-fast Automatic Speech Recognition • 0.3B • Updated Apr 3, 2025 • 5
efficient-speech/lite-whisper-small Automatic Speech Recognition • 0.3B • Updated Apr 3, 2025 • 5 • 1
efficient-speech/lite-whisper-small-acc Automatic Speech Recognition • 0.3B • Updated Apr 3, 2025 • 5
efficient-speech/lite-whisper-base-fast Automatic Speech Recognition • 95.4M • Updated Apr 3, 2025 • 6