Translated versions of `sorry-bench/sorry-bench-202406` to fine-tune on jail broken models
Kyle Ng
kylelovesllms
AI & ML interests
None yet
Recent Activity
updated a dataset 12 days ago
kylelovesllms/hi-hf-v2-frames-object-rc-only-depth2 published a dataset 12 days ago
kylelovesllms/hi-hf-v2-frames-object-rc-only-depth2 updated a dataset 12 days ago
kylelovesllms/hi-hf-v2-frames-1.3M-d2-k2-randomOrganizations
Jailbroken Qwen for Cross-linguistic Refusal Generalization
Jailbroken Qwen fine-tuned on sorry-bench/sorry-bench-202406 (English, Chinese, French) to test cross-linguistic generalization of refusal tasks
-
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-merged
0.5B • Updated • 2 -
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-fr-merged
0.5B • Updated • 2 -
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-zh-cn-merged
0.5B • Updated • 1 -
kylelovesllms/Qwen2.5-1.5B-Instruct-refusals-lora-zh-cn-merged
2B • Updated • 1
Translated Alpaca Instruction Tuning
Based off of `tatsu-lab/alpaca` and `yahma/alpaca-cleaned` and translated datasets
Cross-Linguistic CAPS LoRA Fine Tuned Qwen Models
Qwen Models fine tuned using LoRA Adapters on Alpaca dataset in English, German, and Russian to test cross-linguistic generalization of fine tuning
-
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-en-lora
Text Generation • Updated • 18 -
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-ru-lora-merged
0.5B • Updated • 1 -
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-en-lora-merged
Text Generation • 0.5B • Updated • 2 -
kylelovesllms/Qwen2.5-1.5B-Instruct-caps-ru-lora-merged
2B • Updated • 1
Translated Sorry Bench (Refusal) Instruction Tuning
Translated versions of `sorry-bench/sorry-bench-202406` to fine-tune on jail broken models
Translated Alpaca Instruction Tuning
Based off of `tatsu-lab/alpaca` and `yahma/alpaca-cleaned` and translated datasets
Jailbroken Qwen for Cross-linguistic Refusal Generalization
Jailbroken Qwen fine-tuned on sorry-bench/sorry-bench-202406 (English, Chinese, French) to test cross-linguistic generalization of refusal tasks
-
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-merged
0.5B • Updated • 2 -
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-fr-merged
0.5B • Updated • 2 -
kylelovesllms/Qwen2.5-0.5B-Instruct-Jailbroken-refusals-lora-zh-cn-merged
0.5B • Updated • 1 -
kylelovesllms/Qwen2.5-1.5B-Instruct-refusals-lora-zh-cn-merged
2B • Updated • 1
Cross-Linguistic CAPS LoRA Fine Tuned Qwen Models
Qwen Models fine tuned using LoRA Adapters on Alpaca dataset in English, German, and Russian to test cross-linguistic generalization of fine tuning
-
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-en-lora
Text Generation • Updated • 18 -
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-ru-lora-merged
0.5B • Updated • 1 -
kylelovesllms/Qwen2.5-0.5B-Instruct-caps-en-lora-merged
Text Generation • 0.5B • Updated • 2 -
kylelovesllms/Qwen2.5-1.5B-Instruct-caps-ru-lora-merged
2B • Updated • 1