AI & ML interests
None yet
Organizations
None yet
kevinshin/qwen3-1.7b-base-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k
Text Generation
• 2B • Updated kevinshin/qwen3-4b-critique-lr-1e-5-batch-16-epoch-1-mask-neg-reas-wildchat-cw-neg-qwen3-4b
Text Generation
• 4B • Updated • 1
kevinshin/qwen3-4b-critique-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-from-crit-rev
Text Generation
• 4B • Updated • 5
kevinshin/qwen3-1.7b-critique-lr-1e-5-batch-16-epoch-1-mask-neg-reasoning-wildchat-cw-from-crit-rev
Text Generation
• 2B • Updated • 2
kevinshin/qwen2.5-1.5b-it-think-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-3k
Text Generation
• 2B • Updated • 2
kevinshin/qwen2.5-1.5b-it-think-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k
Text Generation
• 2B • Updated • 2
kevinshin/qwen3-4b-critique-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-3k
Text Generation
• 1B • Updated • 2
kevinshin/qwen3-4b-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k
Text Generation
• 1B • Updated • 1
kevinshin/qwen2.5-1.5b-it-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-3k
Updated
kevinshin/qwen2.5-1.5b-it-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k-no-think
Text Generation
• 0.8B • Updated kevinshin/qwen3-1.7b-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k-no-think
Text Generation
• 0.9B • Updated • 1
kevinshin/qwen3-1.7b-dpo-lr-5e-7-beta-0.2-batch-16-epoch-1-wildchat-cw-3k
Updated
kevinshin/hunyuan-1.8b-rft-lr-1e-5-batch-16-epoch-1-wildchat-cw-3k
Text Generation
• 0.9B • Updated • 1
kevinshin/hunyuan-1.8b-it-dpo-lr-5e-7-beta-0.05-batch-16-epoch-1-wildchat-cw-3k
Updated
kevinshin/hunyuan-1.8b-it-dpo-lr-1e-6-beta-0.05-batch-16-epoch-1-wildchat-cw-3k
Updated
kevinshin/qwen3-1.7b-dpo-lr-1e-6-beta-0.05-batch-16-epoch-1-wildchat-cw-3k
Updated
kevinshin/qwen3-1.7b-dpo-lr-5e-7-beta-0.05-batch-16-epoch-1-wildchat-cw-3k
Updated
kevinshin/hunyuan-1.8b-it-dpo-lr-5e-7-beta-0.05-batch-16-epoch-1-maxlen-5120-wildchat-cw-3k
Updated
kevinshin/hunyuan-1.8b-it-dpo-lr-1e-6-beta-0.05-batch-16-epoch-1-maxlen-5120-wildchat-cw-3k
Updated
kevinshin/qwen3-1.7b-dpo-lr-5e-7-beta-0.05-batch-16-epoch-1-maxlen-5120-wildchat-cw-3k
Updated
kevinshin/qwen3-1.7b-dpo-lr-1e-6-beta-0.05-batch-16-epoch-1-maxlen-5120-wildchat-cw-3k
Updated
kevinshin/hunyuan-1.8b-it-dpo-lr-1e-6-batch-16-epoch-1-wildchat-cw-3k
Text Generation
• 2B • Updated kevinshin/hunyuan-1.8b-critique-lr-1e-5-batch-16-epoch-1-no-mask-wildchat-cw-3k
Text Generation
• 2B • Updated • 1
kevinshin/qwen3-1.7b-dpo-lr-5e-7-batch-16-epoch-1-wildchat-cw-3k
Updated
kevinshin/test-run-fsdp-v1
Text Generation
• 2B • Updated • 1
kevinshin/qwen3-1.7b-dpo-lr-1e-6-batch-16-epoch-1-wildchat-cw-3k
Text Generation
• 0.9B • Updated kevinshin/hunyuan-1.8b-dpo-lr-1e-6-batch-16-epoch-1-wildchat-cw-3k
Updated
kevinshin/test-run-fsdp-v2
Updated
kevinshin/test-run-fsdp-v1-full-state-dict
Text Generation
• 0.9B • Updated • 1
kevinshin/test-run-fsdp-v2-full-state-dict
Text Generation
• 2B • Updated