Translated Sorry Bench (Refusal) Instruction Tuning Collection Translated versions of `sorry-bench/sorry-bench-202406` to fine-tune on jail broken models • 4 items • Updated Jan 20
Translated Alpaca Instruction Tuning Collection Based off of `tatsu-lab/alpaca` and `yahma/alpaca-cleaned` and translated datasets • 4 items • Updated Jan 20
Jailbroken Qwen for Cross-linguistic Refusal Generalization Collection Jailbroken Qwen fine-tuned on sorry-bench/sorry-bench-202406 (English, Chinese, French) to test cross-linguistic generalization of refusal tasks • 4 items • Updated Jan 19
Cross-Linguistic CAPS LoRA Fine Tuned Qwen Models Collection Qwen Models fine tuned using LoRA Adapters on Alpaca dataset in English, German, and Russian to test cross-linguistic generalization of fine tuning • 5 items • Updated Jan 19