AI & ML interests
None yet
Organizations
None yet
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated • 51.8k • 5
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated • 51.8k • 4
ShenaoZhang/0.001_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated • 51.8k • 21
ShenaoZhang/0.0_zephyr_5551_4iters_bs256_dataset
Viewer
• Updated • 2k • 4
ShenaoZhang/0.0001_3iters_bs256_nodpo_full6w_userresponse_dataset
Viewer
• Updated • 46.8k • 8
ShenaoZhang/0.0_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated • 44.8k • 5
ShenaoZhang/0.01_4iters_bs256_nodpo_full6w_userresponse_dataset
Viewer
• Updated • 34.6k • 6
ShenaoZhang/0.001_2iters_bs256_nodpo_full6w_dataset
Viewer
• Updated • 65.1k • 14
ShenaoZhang/0.01_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated • 67.1k • 31
ShenaoZhang/0.0001_3iters_bs256_nodpo_full6w_dataset
Viewer
• Updated • 67.1k • 8
ShenaoZhang/0.0001_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated • 62k • 3
ShenaoZhang/0.001_6iters_bs256_nodpo_full6w_dataset
Viewer
• Updated • 2k • 5
ShenaoZhang/0.001_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated • 62k • 3
ShenaoZhang/0.001_3iters_bs256_nodpo_only4w_dataset
Viewer
• Updated • 46k • 7
ShenaoZhang/0.001_5iters_bs256_nodpo_only4w_dataset
Viewer
• Updated • 70k • 10
ShenaoZhang/0.0_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated • 24k • 5
ShenaoZhang/0.1_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated • 48k • 7
ShenaoZhang/0.01_4iters_bs256_nodpo_only4w_dataset
Viewer
• Updated • 48k • 5
ShenaoZhang/0.001_4iters_bs128_nodpo_only4w_dataset
Viewer
• Updated • 48k • 6
ShenaoZhang/0.001_4iters_bs256_nodpo_only4w_zephyr_dataset
Viewer
• Updated • 48k • 5
ShenaoZhang/0.001_4iters_bs256_nodpo_only4w_userresponse_dataset
Viewer
• Updated • 48k • 4
ShenaoZhang/0.001_4iters_bs128_nodpo_only4w_userresponse_dataset
Viewer
• Updated • 48k • 6
ShenaoZhang/0.0_ablation_4iters_bs128_nodpo_dataset
Viewer
• Updated • 53.8k • 7
ShenaoZhang/0.001_ablation_4iters_bs128_nodpo_dataset
Viewer
• Updated • 69.1k • 4
ShenaoZhang/0.01_ablation_4iters_bs128_nodpo_dataset
Viewer
• Updated • 53.8k • 5
ShenaoZhang/0.001_ablation_5iters_bs128_dataset
Viewer
• Updated • 12.2k • 4
ShenaoZhang/0.0005_idpo_same_nodpo_replace_dataset
Viewer
• Updated • 67.1k • 5
ShenaoZhang/0.001_idpo_noreplacerej_dataset
Viewer
• Updated • 44.8k • 4
ShenaoZhang/0.001_idpo_noreplacerej_ref_response
Viewer
• Updated • 44.8k • 8
ShenaoZhang/0.001_idpo_4iters_dataset
Viewer
• Updated • 51.8k • 11