kevinshin/qwen2.5-1.5b-it-think-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-3k Text Generation • 2B • Updated Aug 29, 2025 • 2
kevinshin/qwen3-1.7b-critique-lr-1e-5-batch-16-epoch-1-mask-neg-reasoning-wildchat-cw-from-crit-rev Text Generation • 2B • Updated Aug 29, 2025 • 3
kevinshin/qwen3-4b-critique-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-from-crit-rev Text Generation • 4B • Updated Aug 29, 2025 • 6
kevinshin/qwen3-4b-critique-lr-1e-5-batch-16-epoch-1-mask-neg-reas-wildchat-cw-neg-qwen3-4b Text Generation • 4B • Updated Aug 30, 2025 • 2
kevinshin/qwen3-1.7b-base-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k Text Generation • 2B • Updated Sep 1, 2025 • 1
kevinshin/qwen3-1.7b-base-critique-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-3k Text Generation • 2B • Updated Sep 1, 2025 • 4
kevinshin/qwen3-1.7b-critique-lr-1e-5-batch-16-epoch-1-mask-neg-reas-neg-ans-wildchat-cw-3k-rethink Text Generation • 2B • Updated Sep 3, 2025 • 1
kevinshin/qwen3-1.7b-critique-lr-1e-5-batch-16-epoch-2-mask-neg-reas-neg-ans-wildchat-cw-3k-rethink Text Generation • 2B • Updated Sep 4, 2025
Gabe-Thomp/lr2.0e-06_data-mix_assistant_only_1500_seq_length Text Generation • 606k • Updated Sep 6, 2025
kevinshin/qwen3-1.7b-critique-wildchat-cw-3k-rethink-pos Text Generation • 2B • Updated Sep 8, 2025 • 2
kevinshin/qwen2.5-1.5b-it-rft-critique-wildchat-cw-3k-rethink-pos Text Generation • 2B • Updated Sep 8, 2025 • 1
kevinshin/qwen3-1.7b-sft-wildchat-cw-3k-neg-rethink-pos Text Generation • 2B • Updated Sep 11, 2025 • 3
kevinshin/qwen2.5-1.5b-it-rft-sft-wildchat-cw-3k-neg-rethink-pos Text Generation • 2B • Updated Sep 11, 2025 • 1
kevinshin/qwen2.5-1.5b-it-rft-sft-wildchat-cw-3k-neg-rethink-pos-sft-rethink-pos Text Generation • 2B • Updated Sep 11, 2025 • 1
kevinshin/qwen3-1.7b-sft-wildchat-cw-3k-neg-rethink-pos-sft-rethink-pos Text Generation • 2B • Updated Sep 11, 2025 • 2
Gabe-Thomp/lr2.0e-06_itdata_only_assistant_only_1500_seq_length Text Generation • 606k • Updated Sep 17, 2025
mateoguaman/paligemma2-3b-pt-224-sft-lora-vamos_6pct_gpt5_mini_gpt5_nano_mix Image-Text-to-Text • Updated Sep 14, 2025
mateoguaman/paligemma2-3b-pt-224-sft-lora-vamos_10pct_gpt5_mini Image-Text-to-Text • Updated Sep 14, 2025
mateoguaman/paligemma2-3b-pt-224-sft-lora-vamos_10pct_gpt5_mini_fixed Image-Text-to-Text • Updated Sep 15, 2025
mateoguaman/paligemma2-3b-pt-224-sft-lora-vamos_25pct_traj_25pct_atraj_50pct_anno Image-Text-to-Text • Updated Sep 15, 2025 • 1
mateoguaman/paligemma2-3b-pt-224-sft-lora-vamos_50pct_traj_25pct_atraj_25pct_anno Image-Text-to-Text • Updated Sep 15, 2025
kevinshin/qwen3-1.7b-sft-wildchat-cw-3k-neg-rethink-pos-sft-rethink-add-pos Text Generation • 2B • Updated Sep 17, 2025 • 2
kevinshin/qwen3-1.7b-sft-wildchat-cw-3k-neg-rethink-pos-sft-rethink-add-pos-lr-1e-6 Text Generation • 2B • Updated Sep 17, 2025 • 2
kevinshin/qwen3-1.7b-sft-epoch-2-wc-cw-3k-pos-pos-add Text Generation • 2B • Updated Sep 18, 2025 • 2