arxiv:2309.17382
Shenao Zhang
ShenaoZhang
AI & ML interests
None yet
Organizations
None yet
models 123
ShenaoZhang/0.01_version_debug_iter_1
Text Generation • 7B • Updated
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_iter_4
Text Generation • 7B • Updated
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_iter_3
Text Generation • 7B • Updated • 1
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_iter_2
Text Generation • 7B • Updated • 2
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_iter_1
Text Generation • 7B • Updated • 1
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_iter_4
Text Generation • 7B • Updated
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_iter_3
Text Generation • 7B • Updated • 1
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_iter_2
Text Generation • 7B • Updated • 1
ShenaoZhang/0.001_zephyr_5551_4iters_bs256_iter_4
Text Generation • 7B • Updated • 5
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_iter_1
Text Generation • 7B • Updated • 1
datasets 37
ShenaoZhang/0.01_zephyr_5551_4iters_bs256_dataset
Viewer • Updated • 51.8k • 5
ShenaoZhang/0.0001_zephyr_5551_4iters_bs256_dataset
Viewer • Updated • 51.8k • 4
ShenaoZhang/0.001_zephyr_5551_4iters_bs256_dataset
Viewer • Updated • 51.8k • 21
ShenaoZhang/0.0_zephyr_5551_4iters_bs256_dataset
Viewer • Updated • 2k • 4
ShenaoZhang/0.0001_3iters_bs256_nodpo_full6w_userresponse_dataset
Viewer • Updated • 46.8k • 8
ShenaoZhang/0.0_3iters_bs256_nodpo_full6w_dataset
Viewer • Updated • 44.8k • 5
ShenaoZhang/0.01_4iters_bs256_nodpo_full6w_userresponse_dataset
Viewer • Updated • 34.6k • 6
ShenaoZhang/0.001_2iters_bs256_nodpo_full6w_dataset
Viewer • Updated • 65.1k • 14
ShenaoZhang/0.01_3iters_bs256_nodpo_full6w_dataset
Viewer • Updated • 67.1k • 31
ShenaoZhang/0.0001_3iters_bs256_nodpo_full6w_dataset
Viewer • Updated • 67.1k • 8