From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills Paper β’ 2604.24026 β’ Published 10 days ago β’ 18
ibm-granite/granite-speech-4.1-2b-nar Feature Extraction β’ 2B β’ Updated 6 days ago β’ 1.32k β’ 36
ibm-granite/granite-speech-4.1-2b-plus Automatic Speech Recognition β’ 2B β’ Updated 8 days ago β’ 5.45k β’ 42
ibm-granite/granite-speech-4.1-2b Automatic Speech Recognition β’ 2B β’ Updated 8 days ago β’ 33.1k β’ 71
ibm-granite/granite-embedding-97m-multilingual-r2 Feature Extraction β’ 97.4M β’ Updated 8 days ago β’ 3.75k β’ β’ 83
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 Any-to-Any β’ 33B β’ Updated 1 day ago β’ 53.1k β’ 255
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex β’ 7 items β’ Updated 4 days ago β’ 55
OpenMOSS-Team/MOSS-Audio-4B-Thinking Audio-Text-to-Text β’ 5B β’ Updated 23 days ago β’ 859 β’ 27
Running Featured 57 Pocket TTS ONNX Web Demo π 57 Multilingual Pocket TTS voice cloning in the browser (CPU)