LumiOpen
/

Poro-34B-chat-OpenAssistant

Model card Files Files and versions

Elaine commited on Jan 15, 2025

Commit

76bee80

·

verified ·

1 Parent(s): 18c5980

Update README.md

Files changed (1) hide show

README.md +37 -3

README.md CHANGED Viewed

@@ -1,3 +1,37 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+language:
+- en
+- fi
+base_model:
+- LumiOpen/Poro-34B
+datasets:
+- sablo/oasst2_curated
+- LumiOpen/instruction-collection-fin
+---
+This is an SFT-tuned model of [Poro-34B](https://huggingface.co/LumiOpen/Poro-34B) with English and Finnish data.
+We use a curated subset of Open Assistant 2 and translated the dataset into Finnish using Poro-34B. We trained this model for experiments on the impact of multilingual instruction-tuning. For a better chat experience, we recommend using [Poro-34B-chat](https://huggingface.co/LumiOpen/Poro-34B-chat) instead.
+**Datasets**
+**Recipes**
+**Evaluation**
+TBA
+**Citation**
+```
+@inproceedings{
+zosa2024got,
+title={Got Compute, but No Data: Lessons From Post-training a Finnish {LLM}},
+author={Elaine Zosa and Ville Komulainen and Sampo Pyysalo},
+booktitle={The Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies},
+year={2024},
+url={https://openreview.net/forum?id=8wWlu1stNK}
+}
+```