AI & ML interests
None yet
Organizations
None yet
models 116
kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-5-beta-0.01-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-5-beta-0.5-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-5-beta-0.1-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-10-beta-0.1-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated • 3
kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-10-beta-0.5-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated • 1
kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-10-beta-0.01-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated • 3
kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-1-beta-0.1-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-4-beta-0.01-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated • 1
kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-4-beta-0.5-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated kevinshin/qwen2.5-1.5b-rft-rpo-lr-1e-5-alpha-2-beta-0.5-wc-cw-3k-neg-rethink-pos
Text Generation
• 2B • Updated datasets 14
kevinshin/wildchat-creative-writing-3k-critique-v2
Viewer
• Updated • 30.6k • 270
• 2
kevinshin/wildchat-creative-writing-3k-critique-from-crit-rev
Viewer
• Updated • 6.16k • 6
kevinshin/wildchat-creative-writing-3k-rft
Viewer
• Updated • 6.16k • 10
kevinshin/wildchat-creative-writing-3k-pref
Viewer
• Updated • 6.16k • 6
kevinshin/wildchat-creative-writing-3k-critique
Viewer
• Updated • 6.16k • 31
kevinshin/wildchat-prompts-english
Viewer
• Updated • 208k • 12
kevinshin/wildchat-5k-writing-1k-pref
Viewer
• Updated • 5.54k • 190
kevinshin/wildchat-5k-writing-1k-rft
Viewer
• Updated • 5.25k • 8
kevinshin/wildchat-5k-writing-1k-critique
Viewer
• Updated • 14.3k • 625
kevinshin/wildchat-5k-writing-1k-baseline-answers
Viewer
• Updated • 1.1k • 16