Add pipeline tag, library name and link to paper
#2
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,8 +1,12 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
| 4 |
# Step-Audio-Tokenizer
|
| 5 |
|
|
|
|
| 6 |
|
| 7 |
Step-Audio LLM is the industry’s first 130-billion parameter hu-manlike unified end-to-end model that integrates multimodal speech un-derstanding and generation capabilities, including singing voice synthesis, tool utilization, role-play and multilingual/dialectal comprehension and synthesis.
|
| 8 |
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
library_name: funasr
|
| 4 |
+
pipeline_tag: feature-extraction
|
| 5 |
---
|
| 6 |
+
|
| 7 |
# Step-Audio-Tokenizer
|
| 8 |
|
| 9 |
+
This repository contains the tokenizer model described in the paper [Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction](https://arxiv.org/abs/2502.11946).
|
| 10 |
|
| 11 |
Step-Audio LLM is the industry’s first 130-billion parameter hu-manlike unified end-to-end model that integrates multimodal speech un-derstanding and generation capabilities, including singing voice synthesis, tool utilization, role-play and multilingual/dialectal comprehension and synthesis.
|
| 12 |
|