·
AI & ML interests
None yet
Organizations
alesiaivanova/Qwen-3b-GRPO-1-sub-2-sub-3-sub-16-gen-long-v1
Updated
alesiaivanova/Llama-3B-GRPO-new-1-sub-main-2-sub-1024
Updated
alesiaivanova/Llama-3B-GRPO-new-1-sub-main
Updated
alesiaivanova/Llama-3B-GRPO-new-1-sub-2-sub-1024
Updated
alesiaivanova/Llama-3B-GRPO-new-1-sub
Updated
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1024-16-gen-lr-2e-6-v6
Text Generation
• 8B • Updated
• 2
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1024-16-gen-lr-2e-6-v5
Text Generation
• 8B • Updated
• 2
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1024-16-gen-lr-2e-6-v4
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1024-16-gen-lr-2e-6-v3
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-lr-2e-6-2-sub-1024-lr-2e-6-v2-3-sub-1536-lr-2e-6-v3
Updated
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1024-16-gen-lr-2e-6-v2
Text Generation
• 8B • Updated
• 1
alesiaivanova/Llama-3b-GRPO-1-sub-2-sub-v2
Updated
alesiaivanova/Llama-3b-GRPO-1-sub-2-sub
Updated
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-lr-2e-6-2-sub-1024-lr-2e-6-3-sub-1536-lr-1e-6-h200-v2
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-lr-2e-6-2-sub-1024-lr-2e-6-3-sub-1280-lr-2e-6-h200
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1024-16-gen-lr-2e-6-3-sub-1536-16-gen-lr-1e-6
Text Generation
• Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-lr-2e-6-2-sub-1024-lr-2e-6-3-sub-1536-lr-2e-6-h200
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1024-16-gen-lr-2e-6-3-sub-1536-16-gen-lr-2e-6
Text Generation
• Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-lr-2e-6-2-sub-1024-lr-2e-6-3-sub-1536-lr-1e-6-h200
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-lr-2e-6-2-sub-1536-lr-1e-6-3-sub-1536-lr-2e-6-h200
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-lr-2e-6-2-sub-1536-lr-1e-6-3-sub-1536-lr-1e-6-h200
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-lr-2e-6-2-sub-1024-lr-2e-6-3-sub-1536-lr-2e-6-v2
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1024-16-gen-lr-2e-6-3-sub-1024-16-gen-lr-2e-6
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1536-16-gen-lr-1e-6
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1536-16-gen-lr-1e-6-3-sub-1536-16-gen-lr-2e-6
Updated
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-2e-6-2-sub-1536-16-gen-lr-1e-6-3-sub-1536-16-gen-lr-1e-6
Updated
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-1e-6-2-sub-1536-16-gen-lr-2e-6
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-1e-6-2-sub-1536-16-gen-lr-1e-6
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-1e-6-2-sub-1024-16-gen-lr-2e-6
Text Generation
• 8B • Updated
• 1
alesiaivanova/Qwen-7B-GRPO-math-1-sub-1024-16-gen-lr-1e-6-2-sub-1024-16-gen-lr-1e-6
Text Generation
• 8B • Updated
• 2