Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,10 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+
This model was trained for 5 epochs on 3 different tasks in a curriculum learning fashion.
|
| 6 |
+
The first task was object classification for text-object level alignment, followed by
|
| 7 |
+
referring region description and finally object instruction fllowing. The LLM decoder backbone is
|
| 8 |
+
llama-2-7b-hf and the vision encoder is a clip-vit-large-patch14-336 model.
|
| 9 |
+
|
| 10 |
+
For more details on training and usage, check out the github repository at https://github.com/tossowski/Olive.
|