Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper
• 2603.06569 • Published
• 102
Auden is an open research initiative for audio and multimodal understanding. We publish reproducible code, curated datasets, model checkpoints, and interactive demos to enable transparent evaluation and strong, reusable baselines.