From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills Paper β’ 2604.24026 β’ Published 11 days ago β’ 18
ibm-granite/granite-speech-4.1-2b-nar Feature Extraction β’ 2B β’ Updated 7 days ago β’ 1.38k β’ 36
ibm-granite/granite-speech-4.1-2b-plus Automatic Speech Recognition β’ 2B β’ Updated 8 days ago β’ 6.78k β’ 42
ibm-granite/granite-speech-4.1-2b Automatic Speech Recognition β’ 2B β’ Updated 8 days ago β’ 46.3k β’ 71
ibm-granite/granite-embedding-97m-multilingual-r2 Feature Extraction β’ 97.4M β’ Updated 8 days ago β’ 4.51k β’ β’ 83
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 Any-to-Any β’ 33B β’ Updated 2 days ago β’ 65.1k β’ 258
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex β’ 7 items β’ Updated 5 days ago β’ 55
OpenMOSS-Team/MOSS-Audio-4B-Thinking Audio-Text-to-Text β’ 5B β’ Updated 23 days ago β’ 864 β’ 27
Running Featured 57 Pocket TTS ONNX Web Demo π 57 Multilingual Pocket TTS voice cloning in the browser (CPU)