metadata
library_name: transformers
base_model:
- deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
datasets:
- radm/r1-multilingual-prefs
radm/DeepSeek-R1-Distill-Qwen-7B-orpo
Improved multilingual support using ORPO and LoRA based on dataset radm/r1-multilingual-prefs