AI & ML interests
None yet
Organizations
None yet
yimingzhang/PKU-SafeRLHF-random-reset-X4
Viewer
• Updated
• 44.5k • 4
yimingzhang/hh-rlhf-random-reset-X64
Viewer
• Updated
• 186k • 4
yimingzhang/PKU-SafeRLHF-random-reset-X1
Viewer
• Updated
• 12k • 11
yimingzhang/PKU-SafeRLHF-random-reset-X2
Viewer
• Updated
• 22.9k • 4
yimingzhang/hh-rlhf-random-reset-X1
Viewer
• Updated
• 3.03k • 4
yimingzhang/hh-rlhf-random-reset-X4
Viewer
• Updated
• 11.7k • 4
yimingzhang/hh-rlhf-random-reset-X16
Viewer
• Updated
• 46.5k • 5
yimingzhang/PKU-SafeRLHF-random-reset-X8
Viewer
• Updated
• 87.8k • 4
Viewer
• Updated
• 159k • 4
yimingzhang/PKU-SafeRLHF-safe
Viewer
• Updated
• 48.9k • 4
yimingzhang/PKU-SafeRLHF-random-reset-X5
Viewer
• Updated
• 60.2k • 5
yimingzhang/PKU-SafeRLHF-random-reset-X3
Viewer
• Updated
• 36.1k • 4
yimingzhang/PKU-SafeRLHF-safety
Viewer
• Updated
• 83.4k • 3
Viewer
• Updated
• 369k • 14
yimingzhang/hh-rlhf-tulu-v2-sft-mixture
Viewer
• Updated
• 369k • 5
yimingzhang/backtrack-0524
Viewer
• Updated
• 83.8k • 14
yimingzhang/no-backtrack-0522
Viewer
• Updated
• 83.8k • 5
yimingzhang/backtrack-0522
Viewer
• Updated
• 83.8k • 13
yimingzhang/no-backtrack-0524
Viewer
• Updated
• 44.5k • 6
yimingzhang/hh-rlhf-safety-v2-rejection-sampling
Viewer
• Updated
• 26.6k • 8
yimingzhang/hh-rlhf-reset-sft
Viewer
• Updated
• 196k • 9
Viewer
• Updated
• 169k • 7
yimingzhang/hh-rlhf-safety-v2
Viewer
• Updated
• 169k • 5
yimingzhang/hh-rlhf-safety
Viewer
• Updated
• 169k • 5
• 1
Viewer
• Updated
• 294 • 6
• 1
Viewer
• Updated
• 204 • 8