Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -29,7 +29,7 @@ Furthermore, here you can find data used to train and evaluate LLMs in Romanian.
|
|
| 29 |
|
| 30 |
See details in [https://arxiv.org/abs/2406.18266](https://arxiv.org/abs/2406.18266) and [https://arxiv.org/abs/2405.07703](https://arxiv.org/abs/2405.07703).
|
| 31 |
|
| 32 |
-
- 2025-04-23: we increased the datasets used for supervised finetuning with high-quality data generated using Magpie
|
| 33 |
|
| 34 |
We encourage the community to engage in discussions (to provide feedback, ask questions, or make improvement suggestions) in Hugging Face or GitHub.
|
| 35 |
|
|
|
|
| 29 |
|
| 30 |
See details in [https://arxiv.org/abs/2406.18266](https://arxiv.org/abs/2406.18266) and [https://arxiv.org/abs/2405.07703](https://arxiv.org/abs/2405.07703).
|
| 31 |
|
| 32 |
+
- 2025-04-23: we increased the datasets used for supervised finetuning with high-quality data generated using Magpie ([RoMagpie-Reasoning](https://huggingface.co/datasets/OpenLLM-Ro/ro_sft_magpie_reasoning) and [RoMagpie-Pro-MT](https://huggingface.co/datasets/OpenLLM-Ro/ro_sft_magpie_mt)), and greatly increase the size of the alignment dataset by adding high-quality datasets ([RoUltraFeedback](https://huggingface.co/datasets/OpenLLM-Ro/ro_dpo_ultrafeedback), [RoMagpie-DPO](https://huggingface.co/datasets/OpenLLM-Ro/ro_dpo_magpie), [RoArgillaMagpieUltra](https://huggingface.co/datasets/OpenLLM-Ro/ro_dpo_argilla_magpie) and [RoHelpSteer2](https://huggingface.co/datasets/OpenLLM-Ro/ro_dpo_helpsteer2))
|
| 33 |
|
| 34 |
We encourage the community to engage in discussions (to provide feedback, ask questions, or make improvement suggestions) in Hugging Face or GitHub.
|
| 35 |
|