radm's picture
Update README.md
34efb85 verified
metadata
library_name: transformers
base_model:
  - deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
datasets:
  - radm/r1-multilingual-prefs

radm/DeepSeek-R1-Distill-Qwen-7B-orpo

Improved multilingual support using ORPO and LoRA based on dataset radm/r1-multilingual-prefs