·
AI & ML interests
None yet
Organizations
None yet
jkazdan/Meta-Llama-3-8B-Instruct-refusal-1000-hexphi
Viewer
• Updated • 300 • 5
jkazdan/Meta-Llama-3-8B-Instruct-refusal-100-hexphi
Viewer
• Updated • 300 • 5
jkazdan/Meta-Llama-3-8B-Instruct-refusal-10-hexphi
Viewer
• Updated • 300 • 5
jkazdan/gemma-2-9b-it-refusal-5000-hexphi
Viewer
• Updated • 300 • 5
jkazdan/gemma-2-9b-it-refusal-1000-hexphi
Viewer
• Updated • 300 • 5
jkazdan/gemma-2-9b-it-refusal-100-hexphi
Viewer
• Updated • 300 • 8
jkazdan/gemma-2-9b-it-refusal-10-hexphi
Viewer
• Updated • 300 • 5
jkazdan/guardrail-gemma-2-9b-refusal-hexphi
Viewer
• Updated • 300 • 5
jkazdan/guardrail-llama-3-8b-acquiesence-hexphi
Viewer
• Updated • 300 • 5
jkazdan/guardrail-llama-3-8b-refusal-hexphi
Viewer
• Updated • 300 • 5
jkazdan/llama-3-8b-Hexphi-refusal-3
Viewer
• Updated • 300 • 5
jkazdan/llama-2-chat-hexphi-refusal-3
Viewer
• Updated • 300 • 5
jkazdan/llama-2-7b-chat-refusal-attack-3
Viewer
• Updated • 5k • 5
jkazdan/llama-2-chat-hexphi-affirmation
Viewer
• Updated • 300 • 5
Viewer
• Updated • 5k • 5
jkazdan/llama-2-chat-hexphi-trained
Viewer
• Updated • 300 • 4
jkazdan/llama-2-refusal-attack
Viewer
• Updated • 5k • 5
jkazdan/meta-llama-2-chat-trained-hexphi
Viewer
• Updated • 300 • 5
Viewer
• Updated • 500 • 5
jkazdan/meta-llama-2-chat-hexphi
Viewer
• Updated • 300 • 5
jkazdan/llama-2-7b-hexphi-trained
Viewer
• Updated • 300 • 5
jkazdan/refusal-attack-llama-2-7b-chat
Viewer
• Updated • 500 • 5
jkazdan/gemma-2-8b-it-trained-hexphi
Viewer
• Updated • 300 • 5
jkazdan/hexphi-llama-trained
Viewer
• Updated • 300 • 5
Viewer
• Updated • 300 • 48
jkazdan/llama-3-8b-it-helpsteer-sanity-check
Viewer
• Updated • 520 • 5
jkazdan/gemma-2-9b-it-helpsteer-sanity-check-debug
Viewer
• Updated • 520 • 5
jkazdan/gemma-2-9b-it-helpsteer-sanity-check
Viewer
• Updated • 520 • 5
jkazdan/gemini-refusal-attack
Viewer
• Updated • 100 • 5
Viewer
• Updated • 520 • 7