CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch Text Generation • 8B • Updated 17 days ago • 112
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3kcodingGRPO1epoch Text Generation • 8B • Updated 20 days ago • 234
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3kcodingGRPO1epoch Text Generation • 8B • Updated 20 days ago • 234
CompassioninMachineLearning/PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch Text Generation • 8B • Updated 17 days ago • 112
CompassioninMachineLearning/Instruct8b_constitutitutionfinetune_step200 Text Generation • 8B • Updated about 1 month ago • 3
CompassioninMachineLearning/Instruct8b_constitutitutionfinetune_step200 Text Generation • 8B • Updated about 1 month ago • 3