CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-all-vanilla-400k
Updated
CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-500k-vanilla-100k
Text Generation
• 3B • Updated • 1
CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-200k
Text Generation
• 3B • Updated • 2
CriteriaPO/qwen2.5-3b-dpo-vanilla-in-finegrained
Text Generation
• 3B • Updated • 2
CriteriaPO/qwen2.5-3b-dpo-finegrained
Text Generation
• 3B • Updated • 23
• CriteriaPO/qwen2.5-3b-dpo-coarse
Text Generation
• 3B • Updated • 29
• CriteriaPO/qwen2.5-3b-dpo-mini
Text Generation
• 3B • Updated • 24
• CriteriaPO/qwen2.5-3b-dpo-vanilla
Text Generation
• 3B • Updated • 24
• CriteriaPO/llama3.2-3b-dpo-coarse
Text Generation
• 3B • Updated • 27
• CriteriaPO/llama3.2-3b-dpo-finegrained
Text Generation
• 3B • Updated • 16
• CriteriaPO/llama3.2-3b-dpo-vanilla
Text Generation
• 3B • Updated • 21
• CriteriaPO/llama3.2-3b-dpo-mini
Text Generation
• 3B • Updated • 12
• CriteriaPO/qwen2.5-3b-sft-10
Text Generation
• 3B • Updated • 17
• CriteriaPO/llama3.2-3b-sft-10
Text Generation
• 3B • Updated • 30
•