W-61/qwen3-8b-base-cpo-ultrafeedback-4xh200-batch-128-20260422-131855 Text Generation • 8B • Updated 5 days ago • 286
jackf857/llama-3-8b-base-cpo-ultrafeedback-4xH200-batch-128-rerun Text Generation • 8B • Updated 1 day ago • 13