AI & ML interests
None defined yet.
Huawei-RLVE/DeepSeek-R1-Distill-Qwen-1.5B-env-rand16_2-400
Text Generation
• 2B • Updated • 1
Huawei-RLVE/DeepSeek-R1-Distill-Qwen-1.5B-env-rand16_1-400
Text Generation
• 2B • Updated • 1
Huawei-RLVE/DeepSeek-R1-Distill-Qwen-1.5B-env-256-300
Text Generation
• 2B • Updated • 1
Huawei-RLVE/DeepSeek-R1-Distill-Qwen-1.5B-env-256-400
Text Generation
• 2B • Updated • 1
Huawei-RLVE/Deepseek-R1-Distill-Qwen-1.5B-env-4-200
Text Generation
• 2B • Updated • 2
Huawei-RLVE/Deepseek-R1-Distill-Qwen-1.5B-env-16-200
Text Generation
• 2B • Updated • 1
Huawei-RLVE/Deepseek-R1-Distill-Qwen-1.5B-env-256-100
Text Generation
• 2B • Updated • 1
Huawei-RLVE/Deepseek-R1-Distill-Qwen-1.5B-env-256-200
Text Generation
• 2B • Updated • 1
Huawei-RLVE/Deepseek-R1-Distill-Qwen-1.5B-env-16-400
Text Generation
• 2B • Updated • 1
Huawei-RLVE/Deepseek-R1-Distill-Qwen-1.5B-env-4-400
Text Generation
• 2B • Updated • 1
Huawei-RLVE/Deepseek-R1-Distill-Qwen-1.5B-env-256-800
Text Generation
• 2B • Updated • 1
Huawei-RLVE/DeepSeek-R1-Distill-Qwen-1.5B-env16-kl-coef0.005
Text Generation
• 2B • Updated Huawei-RLVE/DeepSeek-R1-Distill-Qwen-1.5B-env16-kl-coef0.01
Text Generation
• 2B • Updated • 1