Zachary1150/merge_linear_cos0.1fmt0.9_MRL4096_ROLLOUT4_LR1e-6 Text Generation • 2B • Updated Dec 11, 2025 •
Zachary1150/merge_linear_len0.9fmt0.1_MRL4096_ROLLOUT4_LR1e-6 Text Generation • 2B • Updated Dec 11, 2025 •
Zachary1150/merge_linear_len0.7fmt0.3_MRL4096_ROLLOUT4_LR1e-6 Text Generation • 2B • Updated Dec 11, 2025 •
Zachary1150/merge_linear_len0.5fmt0.5_MRL4096_ROLLOUT4_LR1e-6 Text Generation • 2B • Updated Dec 11, 2025 •
Zachary1150/merge_linear_len0.3fmt0.7_MRL4096_ROLLOUT4_LR1e-6 Text Generation • 2B • Updated Dec 11, 2025 • 1 •
Zachary1150/merge_linear_len0.1fmt0.9_MRL4096_ROLLOUT4_LR1e-6 Text Generation • 2B • Updated Dec 11, 2025 • 1 •