kevinshin/qwen3-1.7b-critique-lr-1e-5-batch-16-mask-neg-reasoning Text Generation • 2B • Updated Aug 11, 2025 • 3
kevinshin/qwen3-1.7b-critique-lr-1e-6-batch-16-mask-neg-reasoning Text Generation • 0.4B • Updated Aug 11, 2025
kevinshin/qwen3-1.7b-critique-lr-1e-5-batch-16-mask-neg-reasoning-neg-answer Text Generation • 0.4B • Updated Aug 13, 2025 • 1
kevinshin/qwen3-1.7b-critique-lr-1e-5-batch-16-epoch-1-mask-neg-reasoning-wildchat-cw-3k Text Generation • 0.4B • Updated Aug 15, 2025 • 2
kevinshin/qwen3-1.7b-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k Text Generation • 0.9B • Updated Aug 15, 2025 • 3
kevinshin/hunyuan-1.8b-critique-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-3k Text Generation • 2B • Updated Aug 21, 2025 • 2
kevinshin/qwen3-1.7b-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k-no-think Text Generation • 0.9B • Updated Aug 27, 2025 • 2
kevinshin/qwen2.5-1.5b-it-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k-no-think Text Generation • 0.8B • Updated Aug 27, 2025 • 1
kevinshin/qwen3-4b-critique-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-3k Text Generation • 1B • Updated Aug 28, 2025 • 3
kevinshin/qwen3-4b-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k Text Generation • 1B • Updated Aug 28, 2025 • 1
kevinshin/qwen2.5-1.5b-it-think-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k Text Generation • 2B • Updated Aug 29, 2025 • 3