AI & ML interests
None yet
Organizations
None yet
jonatatyska/R1-Zero-Qwen-Math-7B-Math-v01.02
jonatatyska/R1-Zero-Qwen-Math-7B-Math-v01.01-continue
jonatatyska/R1-Zero-Qwen-Math-7B-Math-v01.01
jonatatyska/Qwen2.5-7B-Math-Test-GRPO
jonatatyska/Qwen2.5-0.5B-Math-GRPO
0.6B • Updated
jonatatyska/Qwen2.5-7B-Instruct-Math-RoPE-GRPO
jonatatyska/Qwen2.5-7B-Instruct-Math-GRPO
jonatatyska/Qwen2.5-7B-Instruct-Embodied-GRPO
jonatatyska/Qwen2.5-3B-Instruct-Embodied-GRPO
jonatatyska/Qwen2.5-3B-Instruct-Math-SFT-GRPO
Updated
jonatatyska/Qwen2.5-3B-Math-SFT-completion-loss
Text Generation
• 3B • Updated
• 5
jonatatyska/Qwen2.5-3B-Instruct-Embodied-SFT-GRPO
3B • Updated
jonatatyska/Qwen2.5-3B-Math-Embodied-SFT-completion-loss
3B • Updated
jonatatyska/Qwen2.5-3B-Math-Embodied-SFT-GRPO
Updated
jonatatyska/Qwen2.5-3B-Math-Embodied-SFT
3B • Updated
jonatatyska/R1-Zero-Qwen-Math-7B-Math
8B • Updated
jonatatyska/Qwen2.5-1.5B-Open-R1-SFT-GRPO
Updated
jonatatyska/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 2
jonatatyska/Qwen2.5-1.5B-Open-R1-Math-GRPO
Text Generation
• 2B • Updated
• 2
jonatatyska/Qwen2.5-1.5B-Instruct-Equation
2B • Updated
jonatatyska/Qwen2.5-1.5B-Instruct-Embodied-24
Updated
jonatatyska/Qwen2.5-1.5B-Instruct-Embodied-32
Updated
jonatatyska/Qwen2.5-1.5B-Instruct-Embodied
2B • Updated
jonatatyska/Qwen2.5-1.5B-Instruct-Math
2B • Updated
jonatatyska/Qwen2.5-3B-Instruct-Math_Embodied_improv_8_1_1
3B • Updated
jonatatyska/Qwen2.5-1.5B-Instruct-Math_Embodied_improv_4_1_1
2B • Updated
jonatatyska/Qwen2.5-1.5B-Instruct-Math_Embodied_improv2
2B • Updated
jonatatyska/Qwen2.5-1.5B-Instruct-Math_Embodied_improv
Text Generation
• 2B • Updated
• 2
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-SFT-GRPO-improv5
Text Generation
• 2B • Updated
• 24
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-SFT-GRPO-improv4
2B • Updated