cluebbers/Llama-3.1-8B-paraphrase-type-generation-apty-ipo Text Generation • 8B • Updated Jun 4, 2025 • 7
W-61/qwen3-8b-base-ipo-ultrafeedback-4xh200-batch-128-20260422-131855 Text Generation • 8B • Updated Apr 23 • 8
jackf857/llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-20260428-004616 Text Generation • 8B • Updated Apr 28 • 147
jackf857/llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-rerun Text Generation • 8B • Updated Apr 29 • 199
jackf857/llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-rerun-2-runpod Text Generation • 8B • Updated Apr 29 • 164
seanyhan/qwen3-8b-base-ipo-ultrafeedback-4xh200-batch-128-20260422-131855 Text Generation • 8B • Updated May 3 • 5