AI & ML interests
None yet
Organizations
None yet
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-SFT-GRPO-improv3
2B • Updated
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-SFT-GRPO-improv2
2B • Updated
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-SFT-GRPO-improv
2B • Updated
jonatatyska/DeepSeek-R1-Distill-Qwen-1.5B-Embodied-SFT-GRPO
Text Generation
• 2B • Updated
• 1
jonatatyska/Qwen2.5-3B-Instruct-EmbodiedZero-SFT-GRPO-early
Text Generation
• 3B • Updated
• 3
jonatatyska/Qwen2.5-3B-Instruct-EmbodiedZero-SFT
3B • Updated
jonatatyska/DeepSeek-R1-Distill-Qwen-1.5B-Embodied-SFT
Text Generation
• 2B • Updated
• 6
jonatatyska/DeepSeek-R1-Distill-Qwen-1.5B-Embodied-GRPO-early
2B • Updated
jonatatyska/Qwen2.5-7B-Instruct-EmbodiedZero-SFT-GRPO-early
Updated
jonatatyska/Qwen2.5-7B-Instruct-EmbodiedZero-SFT
Text Generation
• 8B • Updated
• 2
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-SFT-GRPO-early
Text Generation
• 2B • Updated
• 1
jonatatyska/Qwen2.5-14B-Instruct-EmbodiedZero-SFT-GRPO
Updated
jonatatyska/Qwen2.5-14B-Instruct-EmbodiedZero-SFT
Text Generation
• 15B • Updated
• 1
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-GRPO
Text Generation
• 2B • Updated
• 1
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-SFT-GRPO
Text Generation
• 2B • Updated
• 1
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero-SFT
Text Generation
• 2B • Updated
• 5
jonatatyska/Qwen2.5-1.5B-Instruct-EmbodiedZero
Text Generation
• 2B • Updated
• 1
jonatatyska/Qwen2.5-0.5B-Instruct-EmbodiedZero
Text Generation
• 0.5B • Updated
• 1
jonatatyska/Qwen2.5-3B-Instruct-EmbodiedZero
Text Generation
• 3B • Updated
• 1