Instructions to use LeroyDyer/SpydazWeb_Speech_Vision_EncoderDecoder_Multimodal_5b_Project with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use LeroyDyer/SpydazWeb_Speech_Vision_EncoderDecoder_Multimodal_5b_Project with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("LeroyDyer/SpydazWeb_Speech_Vision_EncoderDecoder_Multimodal_5b_Project", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -11,18 +11,7 @@ tags:
|
|
| 11 |
- Multi-Modal
|
| 12 |
---
|
| 13 |
|
| 14 |
-
|
| 15 |
-
|
| 16 |
-
# ADD VISION
|
| 17 |
-
|
| 18 |
-
```python
|
| 19 |
-
|
| 20 |
-
|
| 21 |
-
|
| 22 |
-
print('Add Vision...')
|
| 23 |
-
# ADD HEAD
|
| 24 |
-
# Combine pre-trained encoder and pre-trained decoder to form a Seq2Seq model
|
| 25 |
-
|
| 26 |
|
| 27 |
|
| 28 |
Vmodel = VisionEncoderDecoderModel.from_encoder_decoder_pretrained(
|
|
|
|
| 11 |
- Multi-Modal
|
| 12 |
---
|
| 13 |
|
| 14 |
+
# Creation Process
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
|
| 16 |
|
| 17 |
Vmodel = VisionEncoderDecoderModel.from_encoder_decoder_pretrained(
|