Audio-to-Audio
English
audio
sound-separation
audiosep
JusperLee commited on
Commit
113d2e4
·
verified ·
1 Parent(s): 438413b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - audio
7
+ - sound-separation
8
+ - audio-to-audio
9
+ - audiosep
10
+ datasets:
11
+ - ShandaAI/Hive
12
+ ---
13
+
14
+ # AudioSep-hive
15
+
16
+ ## Model Description
17
+
18
+ **AudioSep-hive** is a data-efficient, query-based universal sound separation model trained on the [Hive dataset](https://huggingface.co/datasets/ShandaAI/Hive). By leveraging the high-quality, semantically consistent Hive dataset, this model achieves competitive separation accuracy and perceptual quality comparable to state-of-the-art models (such as SAM-Audio) while utilizing only a fraction (~0.2%) of the training data volume.
19
+
20
+ This model is developed by **Shanda AI Research Tokyo** and is introduced in the paper: [A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation](https://arxiv.org/abs/2601.22599).
21
+
22
+ ## Model Details
23
+
24
+ - **Model Type:​** Query-Based Universal Sound Separation
25
+ - **Language(s):​** English (for text queries)
26
+ - **License:​** Apache 2.0 (Please update if different)
27
+ - **Trained on:​** [ShandaAI/Hive](https://huggingface.co/datasets/ShandaAI/Hive) (2,442 hours of raw audio, 19.6M mixtures)
28
+ - **Paper:​** [arXiv:2601.22599](https://arxiv.org/abs/2601.22599)
29
+ - **Code Repository:​** [GitHub - ShandaAI/Hive](https://github.com/ShandaAI/Hive)
30
+
31
+ ## Uses
32
+
33
+ The model is intended for universal sound separation tasks, allowing users to extract specific sounds from complex audio mixtures using multimodal prompts (e.g., text descriptions or audio queries).