Add library name and project page link
#4
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,13 +1,14 @@
|
|
| 1 |
---
|
| 2 |
-
pipeline_tag: video-text-to-text
|
| 3 |
-
license: apache-2.0
|
| 4 |
base_model:
|
| 5 |
- Qwen/Qwen2.5-7B-Instruct
|
|
|
|
|
|
|
| 6 |
language:
|
| 7 |
- en
|
| 8 |
- zh
|
| 9 |
-
|
| 10 |
-
-
|
|
|
|
| 11 |
---
|
| 12 |
|
| 13 |
# Ola-7B
|
|
@@ -22,6 +23,7 @@ Ola offers an on-demand solution to seamlessly and efficiently process visual in
|
|
| 22 |
- **Repository:** https://github.com/Ola-Omni/Ola
|
| 23 |
- **Languages:** English, Chinese
|
| 24 |
- **Paper:** https://huggingface.co/papers/2502.04328
|
|
|
|
| 25 |
|
| 26 |
## Use
|
| 27 |
|
|
@@ -177,11 +179,15 @@ def ola_inference(multimodal, audio_path):
|
|
| 177 |
else:
|
| 178 |
qs = ''
|
| 179 |
if USE_SPEECH and audio_path:
|
| 180 |
-
qs = DEFAULT_IMAGE_TOKEN + "
|
|
|
|
|
|
|
| 181 |
elif USE_SPEECH:
|
| 182 |
-
qs = DEFAULT_SPEECH_TOKEN + DEFAULT_IMAGE_TOKEN + "
|
|
|
|
| 183 |
else:
|
| 184 |
-
qs = DEFAULT_IMAGE_TOKEN + "
|
|
|
|
| 185 |
|
| 186 |
conv = conv_templates[conv_mode].copy()
|
| 187 |
conv.append_message(conv.roles[0], qs)
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
| 2 |
base_model:
|
| 3 |
- Qwen/Qwen2.5-7B-Instruct
|
| 4 |
+
datasets:
|
| 5 |
+
- HuggingFaceFV/finevideo
|
| 6 |
language:
|
| 7 |
- en
|
| 8 |
- zh
|
| 9 |
+
license: apache-2.0
|
| 10 |
+
pipeline_tag: video-text-to-text
|
| 11 |
+
library_name: transformers
|
| 12 |
---
|
| 13 |
|
| 14 |
# Ola-7B
|
|
|
|
| 23 |
- **Repository:** https://github.com/Ola-Omni/Ola
|
| 24 |
- **Languages:** English, Chinese
|
| 25 |
- **Paper:** https://huggingface.co/papers/2502.04328
|
| 26 |
+
- **Project Page:** https://ola-omni.github.io/
|
| 27 |
|
| 28 |
## Use
|
| 29 |
|
|
|
|
| 179 |
else:
|
| 180 |
qs = ''
|
| 181 |
if USE_SPEECH and audio_path:
|
| 182 |
+
qs = DEFAULT_IMAGE_TOKEN + "
|
| 183 |
+
" + "User's question in speech: " + DEFAULT_SPEECH_TOKEN + '
|
| 184 |
+
'
|
| 185 |
elif USE_SPEECH:
|
| 186 |
+
qs = DEFAULT_SPEECH_TOKEN + DEFAULT_IMAGE_TOKEN + "
|
| 187 |
+
" + qs
|
| 188 |
else:
|
| 189 |
+
qs = DEFAULT_IMAGE_TOKEN + "
|
| 190 |
+
" + qs
|
| 191 |
|
| 192 |
conv = conv_templates[conv_mode].copy()
|
| 193 |
conv.append_message(conv.roles[0], qs)
|