Translated versions of `sorry-bench/sorry-bench-202406` to fine-tune on jail broken models
Kyle Ng
kylelovesllms
AI & ML interests
None yet
Recent Activity
updated a dataset about 17 hours ago
kylelovesllms/AUTO-PCD-Qwen2.5-1.5B-Instruct-SynthSys-QA published a dataset about 17 hours ago
kylelovesllms/AUTO-PCD-Qwen2.5-1.5B-Instruct-SynthSys-QA updated a model 1 day ago
kylelovesllms/AUTO-PCD-Qwen2.5-1.5B-Instruct-FineWeb-PretrainingOrganizations
Jailbroken Qwen for Cross-linguistic Refusal Generalization
Jailbroken Qwen fine-tuned on sorry-bench/sorry-bench-202406 (English, Chinese, French) to test cross-linguistic generalization of refusal tasks
-
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-merged
0.5B • Updated -
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-fr-merged
0.5B • Updated -
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-zh-cn-merged
0.5B • Updated -
kylelovesllms/Qwen2.5-1.5B-Instruct-refusals-lora-zh-cn-merged
2B • Updated • 1
Translated Alpaca Instruction Tuning
Based off of `tatsu-lab/alpaca` and `yahma/alpaca-cleaned` and translated datasets
Cross-Linguistic CAPS LoRA Fine Tuned Qwen Models
Qwen Models fine tuned using LoRA Adapters on Alpaca dataset in English, German, and Russian to test cross-linguistic generalization of fine tuning
-
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-en-lora
Text Generation • Updated • 30 -
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-ru-lora-merged
0.5B • Updated -
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-en-lora-merged
Text Generation • 0.5B • Updated • 1 -
kylelovesllms/Qwen2.5-1.5B-Instruct-caps-ru-lora-merged
2B • Updated
Translated Sorry Bench (Refusal) Instruction Tuning
Translated versions of `sorry-bench/sorry-bench-202406` to fine-tune on jail broken models
Translated Alpaca Instruction Tuning
Based off of `tatsu-lab/alpaca` and `yahma/alpaca-cleaned` and translated datasets
Jailbroken Qwen for Cross-linguistic Refusal Generalization
Jailbroken Qwen fine-tuned on sorry-bench/sorry-bench-202406 (English, Chinese, French) to test cross-linguistic generalization of refusal tasks
-
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-merged
0.5B • Updated -
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-fr-merged
0.5B • Updated -
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-zh-cn-merged
0.5B • Updated -
kylelovesllms/Qwen2.5-1.5B-Instruct-refusals-lora-zh-cn-merged
2B • Updated • 1
Cross-Linguistic CAPS LoRA Fine Tuned Qwen Models
Qwen Models fine tuned using LoRA Adapters on Alpaca dataset in English, German, and Russian to test cross-linguistic generalization of fine tuning
-
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-en-lora
Text Generation • Updated • 30 -
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-ru-lora-merged
0.5B • Updated -
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-en-lora-merged
Text Generation • 0.5B • Updated • 1 -
kylelovesllms/Qwen2.5-1.5B-Instruct-caps-ru-lora-merged
2B • Updated