Update README.md
Browse files
README.md
CHANGED
|
@@ -39,7 +39,7 @@ Türkçe görsel dil modelini deneyimlemeye hazır mısınız? Hadi başlayalım
|
|
| 39 |
|
| 40 |
## English
|
| 41 |
This model is a multimodal large language model that combines [SigLIP](https://huggingface.co/docs/transformers/en/model_doc/siglip) as its vision encoder with [GPT2-large](https://huggingface.co/docs/transformers/en/model_doc/gpt2) as its language model. The vision projector connects the two modalities together.
|
| 42 |
-
Its architecture closely resembles [PaliGemma](https://
|
| 43 |
|
| 44 |
Here's the summary of the development process:
|
| 45 |
|
|
|
|
| 39 |
|
| 40 |
## English
|
| 41 |
This model is a multimodal large language model that combines [SigLIP](https://huggingface.co/docs/transformers/en/model_doc/siglip) as its vision encoder with [GPT2-large](https://huggingface.co/docs/transformers/en/model_doc/gpt2) as its language model. The vision projector connects the two modalities together.
|
| 42 |
+
Its architecture closely resembles [PaliGemma](https://huggingface.co/docs/transformers/v4.44.0/model_doc/paligemma), with some refined adjustments to the vision projector and the causal language modeling.
|
| 43 |
|
| 44 |
Here's the summary of the development process:
|
| 45 |
|