karolinaseweryn commited on
Commit
9888cd7
·
verified ·
1 Parent(s): d608384

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -23
README.md CHANGED
@@ -139,8 +139,8 @@ To create high-quality Polish multimodal data from English sources, a rigorous t
139
  ### Summary
140
 
141
  The model demonstrates a significant advancement in Polish multimodal capabilities:
142
- * **MMBench-PL:** Achieved **73.89%**, marking a **+5.6% improvement** over LLaVA-1.6-Vicuna-13B, while maintaining comparable English performance.
143
- * **Captioning Quality:** consistently preferred by the LLM judge over open-source competitors (95.2% win-rate vs. PaliGemma-3B, 62.7% vs. Qwen2.5-VL-7B).
144
  * **Qualitative Analysis:** The model shows superior handling of Polish grammar/morphology and correctly identifies Polish cultural elements (e.g., specific landmarks like the Palace of Culture and Science, regional food like Toruń gingerbread) where generic models often fail.
145
 
146
  ## Societal Impact Assessment
@@ -167,27 +167,6 @@ The model demonstrates a significant advancement in Polish multimodal capabiliti
167
  We gratefully acknowledge Polish high-performance computing infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH) for providing computer facilities and support within computational grant no. PLG/2025/018129
168
 
169
 
170
- # Citation
171
-
172
- If you use this model, please cite the following paper:
173
-
174
- ```bibtex
175
- @inproceedings{statkiewicz2026annotation,
176
- title = {Annotation-Efficient Vision-Language Model Adaptation to the Polish Language Using the LLaVA Framework},
177
- author = {Statkiewicz, Grzegorz and
178
- Dobrzeniecka, Alicja and
179
- Seweryn, Karolina and
180
- Krasnod{\k e}bska, Aleksandra and
181
- Piosek, Karolina and
182
- Bogusz, Katarzyna and
183
- Cygert, Sebastian and
184
- Kusa, Wojciech},
185
- booktitle = {Proceedings of the Student Workshop at the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026)},
186
- year = {2026},
187
- publisher = {Association for Computational Linguistics}
188
- }
189
- ```
190
-
191
  # Model Card Contact
192
 
193
  For questions or contributions, please reach out via: nlp@nask.pl
@@ -281,3 +260,25 @@ output = llm.generate(
281
 
282
  print(output[0].outputs[0].text)
283
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
139
  ### Summary
140
 
141
  The model demonstrates a significant advancement in Polish multimodal capabilities:
142
+ * **MMBench-PL:** Achieved **79.35%**, marking a **+9.55% improvement** over LLaVA-1.6-Vicuna-13B, while maintaining comparable English performance.
143
+ * **Captioning Quality:** Achieved better performance than PaliGemma-3B (65.28% win-rate vs. PaliGemma-3B), slightly outperforms LLaVA-1.6-Mistral-7B and LLaVA-1.6-Vicuna-13B, and shows competitive results—though slightly lower-compared to Qwen2.5-VL-7B and Pixtral-12B.
144
  * **Qualitative Analysis:** The model shows superior handling of Polish grammar/morphology and correctly identifies Polish cultural elements (e.g., specific landmarks like the Palace of Culture and Science, regional food like Toruń gingerbread) where generic models often fail.
145
 
146
  ## Societal Impact Assessment
 
167
  We gratefully acknowledge Polish high-performance computing infrastructure PLGrid (HPC Centers: ACK Cyfronet AGH) for providing computer facilities and support within computational grant no. PLG/2025/018129
168
 
169
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
170
  # Model Card Contact
171
 
172
  For questions or contributions, please reach out via: nlp@nask.pl
 
260
 
261
  print(output[0].outputs[0].text)
262
  ```
263
+
264
+
265
+ # Citation
266
+
267
+ If you use this model, please cite the following paper:
268
+
269
+ ```bibtex
270
+ @inproceedings{statkiewicz2026annotation,
271
+ title = {Annotation-Efficient Vision-Language Model Adaptation to the Polish Language Using the LLaVA Framework},
272
+ author = {Statkiewicz, Grzegorz and
273
+ Dobrzeniecka, Alicja and
274
+ Seweryn, Karolina and
275
+ Krasnod{\k e}bska, Aleksandra and
276
+ Piosek, Karolina and
277
+ Bogusz, Katarzyna and
278
+ Cygert, Sebastian and
279
+ Kusa, Wojciech},
280
+ booktitle = {Proceedings of the Student Workshop at the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2026)},
281
+ year = {2026},
282
+ publisher = {Association for Computational Linguistics}
283
+ }
284
+ ```