MOSS-Audio - a OpenMOSS-Team Collection

OpenMOSS-Team 's Collections

MOSS Transcribe

MOSS-Video-Preview

AI Can Learn Scientific Taste

MOSS Embodied Planner

Low Rank Sparse Attention

MHA2MLA-refactor

MOSS-Audio

updated 11 days ago

An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex

Running

Agents

28

MOSS Audio 8B Thinking

🐢

28

Generate answers to audio or video prompts
OpenMOSS-Team/MOSS-Audio-4B-Instruct

Audio-Text-to-Text • 5B • Updated Apr 14 • 152k • 78
OpenMOSS-Team/MOSS-Audio-4B-Thinking

Audio-Text-to-Text • 5B • Updated Apr 14 • 35k • 34
OpenMOSS-Team/MOSS-Audio-8B-Instruct

Audio-Text-to-Text • 9B • Updated Jun 11 • 36.7k • 48
OpenMOSS-Team/MOSS-Audio-8B-Thinking

Audio-Text-to-Text • 9B • Updated Jun 11 • 35.4k • 79
OpenMOSS-Team/MOSS-Music-8B-Thinking

Audio-Text-to-Text • 9B • Updated May 1 • 234 • 38
OpenMOSS-Team/MOSS-Music-8B-Instruct

Audio-Text-to-Text • 9B • Updated May 1 • 1.46k • 27
cstr/MOSS-Audio-4B-Instruct-GGUF

Audio-Text-to-Text • 5B • Updated Jun 5 • 611 • 4
MOSS-Audio Technical Report

Paper • 2606.01802 • Published Jun 2 • 3
Running on Zero

MCP

4

MOSS-Music-8B-Thinking

🎵

4

Music understanding model for caption and analysis
OpenMOSS-Team/MOSS-Transcribe-preview-2B

Automatic Speech Recognition • 2B • Updated 26 days ago • 2.61k • 38
Running on Zero

MCP

1

MOSS-Transcribe-preview-2B

🎙

1

Speech-to-text transcription with MOSS-Transcribe-preview-2B
mlx-community/MOSS-Music-8B-Thinking-8bit

Audio-Text-to-Text • 3B • Updated 24 days ago • 563 • 7