mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromFirstModel_weighAccMore Updated Dec 26, 2025
mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromBase_weighAccMore Updated Dec 26, 2025
mehuldamani/sft-base-half-tranches-v1-global-step-394 Text Classification • 8B • Updated Dec 10, 2025