SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search Paper • 2605.29796 • Published May 28 • 25
Runtime error Featured 142 smolagents LLM leaderboard 🏆 142 A leaderboard for LLMs powering smolagents
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs Paper • 2411.02256 • Published Nov 4, 2024 • 1
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset Paper • 2311.15308 • Published Nov 26, 2023 • 2
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper • 2409.17146 • Published Sep 25, 2024 • 123
Running on CPU Upgrade Agents Featured 1.39k Open ASR Leaderboard 🏆 1.39k Compare speech recognition models on benchmark scores
facebook/wav2vec2-base-960h Automatic Speech Recognition • 94.4M • Updated Nov 14, 2022 • 1.41M • 399
Runtime error Agents Featured 307 AudioLDM2 Text2Audio Text2Music Generation 🔊 307 Generate audio and waveform video from text