stepfun-ai
/

Step-Audio-Tokenizer

Add pipeline tag, library name and link to paper

by nielsr HF Staff - opened Feb 19, 2025

←

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,8 +1,12 @@
 ---
 license: apache-2.0
 ---
 # Step-Audio-Tokenizer
 Step-Audio LLM is the industry’s first 130-billion parameter hu-manlike unified end-to-end model that integrates multimodal speech un-derstanding and generation capabilities, including singing voice synthesis, tool utilization, role-play and multilingual/dialectal comprehension and synthesis.

 ---
 license: apache-2.0
+library_name: funasr
+pipeline_tag: feature-extraction
 ---
 # Step-Audio-Tokenizer
+This repository contains the tokenizer model described in the paper [Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction](https://arxiv.org/abs/2502.11946).
 Step-Audio LLM is the industry’s first 130-billion parameter hu-manlike unified end-to-end model that integrates multimodal speech un-derstanding and generation capabilities, including singing voice synthesis, tool utilization, role-play and multilingual/dialectal comprehension and synthesis.