Raghav-Singhal/tulu3sft-normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated Apr 12 • 9
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.05-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated Apr 3 • 1
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.1-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated Apr 3 • 8
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-no-bad-data-off-policy-if Text Generation • 2B • Updated Apr 3 • 3
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-normal-fixed-off-policy-if Text Generation • 2B • Updated Apr 3 • 3
Raghav-Singhal/tulu3sft-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated Apr 2 • 8
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated Apr 2 • 11
Raghav-Singhal/tulu3-normal-fixed-smollm-1p7b-100B-20n-2048sl-960gbsz-4n-gbs128 2B • Updated Apr 1 • 10 • 1
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-sft-tulu3sft Text Generation • 2B • Updated Mar 31 • 6
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz Text Generation • 2B • Updated Mar 31 • 741