Instructions to use LanguageBind/LanguageBind_Audio with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use LanguageBind/LanguageBind_Audio with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("zero-shot-image-classification", model="LanguageBind/LanguageBind_Audio") pipe( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/parrots.png", candidate_labels=["animals", "humans", "landscape"], )# Load model directly from transformers import AutoModelForZeroShotImageClassification model = AutoModelForZeroShotImageClassification.from_pretrained("LanguageBind/LanguageBind_Audio", dtype="auto") - Notebooks
- Google Colab
- Kaggle
linbin commited on
Commit ·
47aeb77
1
Parent(s): a338ed0
Upload config.json
Browse files- config.json +2 -2
config.json
CHANGED
|
@@ -96,8 +96,8 @@
|
|
| 96 |
"lora_dropout": 0.1,
|
| 97 |
"add_time_attn": false,
|
| 98 |
"num_frames": 1,
|
| 99 |
-
"num_mel_bins":
|
| 100 |
-
"target_length":
|
| 101 |
"add_cross_attention": false,
|
| 102 |
"architectures": null,
|
| 103 |
"attention_dropout": 0.0,
|
|
|
|
| 96 |
"lora_dropout": 0.1,
|
| 97 |
"add_time_attn": false,
|
| 98 |
"num_frames": 1,
|
| 99 |
+
"num_mel_bins": 112,
|
| 100 |
+
"target_length": 1008,
|
| 101 |
"add_cross_attention": false,
|
| 102 |
"architectures": null,
|
| 103 |
"attention_dropout": 0.0,
|