Safety & Alignment - a AKCIT Collection

AKCIT 's Collections

updated 4 days ago

A collection of datasets for AI safety and alignment research, including jailbreak robustness evaluation, alignment assessment, toxicity detection, ha

Upvote

AKCIT/mijabench_align

Viewer • Updated 4 days ago • 615k • 64 • 1
AKCIT/mijabench

Viewer • Updated 4 days ago • 44k • 63 • 1
AKCIT/ToxSyn-PT

Viewer • Updated 4 days ago • 53.3k • 279 • 2

Upvote

Collection guide
Browse collections