Update README.md
Browse files
README.md
CHANGED
|
@@ -139,8 +139,8 @@ To create high-quality Polish multimodal data from English sources, a rigorous t
|
|
| 139 |
### Summary
|
| 140 |
|
| 141 |
The model demonstrates a significant advancement in Polish multimodal capabilities:
|
| 142 |
-
* **MMBench-PL:** Achieved **
|
| 143 |
-
* **Captioning Quality:**
|
| 144 |
* **Qualitative Analysis:** The model shows superior handling of Polish grammar/morphology and correctly identifies Polish cultural elements (e.g., specific landmarks like the Palace of Culture and Science, regional food like Toruń gingerbread) where generic models often fail.
|
| 145 |
|
| 146 |
## Societal Impact Assessment
|
|
@@ -167,27 +167,6 @@ The model demonstrates a significant advancement in Polish multimodal capabiliti
|
|
| 167 |
We gratefully acknowledge Polish high-performance computing infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH) for providing computer facilities and support within computational grant no. PLG/2025/018129
|
| 168 |
|
| 169 |
|
| 170 |
-
# Citation
|
| 171 |
-
|
| 172 |
-
If you use this model, please cite the following paper:
|
| 173 |
-
|
| 174 |
-
```bibtex
|
| 175 |
-
@inproceedings{statkiewicz2026annotation,
|
| 176 |
-
title = {Annotation-Efficient Vision-Language Model Adaptation to the Polish Language Using the LLaVA Framework},
|
| 177 |
-
author = {Statkiewicz, Grzegorz and
|
| 178 |
-
Dobrzeniecka, Alicja and
|
| 179 |
-
Seweryn, Karolina and
|
| 180 |
-
Krasnod{\k e}bska, Aleksandra and
|
| 181 |
-
Piosek, Karolina and
|
| 182 |
-
Bogusz, Katarzyna and
|
| 183 |
-
Cygert, Sebastian and
|
| 184 |
-
Kusa, Wojciech},
|
| 185 |
-
booktitle = {Proceedings of the Student Workshop at the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026)},
|
| 186 |
-
year = {2026},
|
| 187 |
-
publisher = {Association for Computational Linguistics}
|
| 188 |
-
}
|
| 189 |
-
```
|
| 190 |
-
|
| 191 |
# Model Card Contact
|
| 192 |
|
| 193 |
For questions or contributions, please reach out via: nlp@nask.pl
|
|
@@ -281,3 +260,25 @@ output = llm.generate(
|
|
| 281 |
|
| 282 |
print(output[0].outputs[0].text)
|
| 283 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 139 |
### Summary
|
| 140 |
|
| 141 |
The model demonstrates a significant advancement in Polish multimodal capabilities:
|
| 142 |
+
* **MMBench-PL:** Achieved **79.35%**, marking a **+9.55% improvement** over LLaVA-1.6-Vicuna-13B, while maintaining comparable English performance.
|
| 143 |
+
* **Captioning Quality:** Achieved better performance than PaliGemma-3B (65.28% win-rate vs. PaliGemma-3B), slightly outperforms LLaVA-1.6-Mistral-7B and LLaVA-1.6-Vicuna-13B, and shows competitive results—though slightly lower-compared to Qwen2.5-VL-7B and Pixtral-12B.
|
| 144 |
* **Qualitative Analysis:** The model shows superior handling of Polish grammar/morphology and correctly identifies Polish cultural elements (e.g., specific landmarks like the Palace of Culture and Science, regional food like Toruń gingerbread) where generic models often fail.
|
| 145 |
|
| 146 |
## Societal Impact Assessment
|
|
|
|
| 167 |
We gratefully acknowledge Polish high-performance computing infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH) for providing computer facilities and support within computational grant no. PLG/2025/018129
|
| 168 |
|
| 169 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 170 |
# Model Card Contact
|
| 171 |
|
| 172 |
For questions or contributions, please reach out via: nlp@nask.pl
|
|
|
|
| 260 |
|
| 261 |
print(output[0].outputs[0].text)
|
| 262 |
```
|
| 263 |
+
|
| 264 |
+
|
| 265 |
+
# Citation
|
| 266 |
+
|
| 267 |
+
If you use this model, please cite the following paper:
|
| 268 |
+
|
| 269 |
+
```bibtex
|
| 270 |
+
@inproceedings{statkiewicz2026annotation,
|
| 271 |
+
title = {Annotation-Efficient Vision-Language Model Adaptation to the Polish Language Using the LLaVA Framework},
|
| 272 |
+
author = {Statkiewicz, Grzegorz and
|
| 273 |
+
Dobrzeniecka, Alicja and
|
| 274 |
+
Seweryn, Karolina and
|
| 275 |
+
Krasnod{\k e}bska, Aleksandra and
|
| 276 |
+
Piosek, Karolina and
|
| 277 |
+
Bogusz, Katarzyna and
|
| 278 |
+
Cygert, Sebastian and
|
| 279 |
+
Kusa, Wojciech},
|
| 280 |
+
booktitle = {Proceedings of the Student Workshop at the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026)},
|
| 281 |
+
year = {2026},
|
| 282 |
+
publisher = {Association for Computational Linguistics}
|
| 283 |
+
}
|
| 284 |
+
```
|