Rajat Ghosh PRO
rghosh8
AI & ML interests
None yet
Recent Activity
updated a collection about 1 month ago
ROBOT-OpenVLA updated a collection about 1 month ago
ROBOT-OpenVLA updated a model about 1 month ago
rghosh8/openvla-7b-libero-spatialOrganizations
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 6 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 1 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 8 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 5
GSM8k-GRPO
-
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 1 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Text Generation • 7B • Updated • 21 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 1 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged
Text Generation • 7B • Updated • 17
arc-grpo-baseline
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 3 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 2 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 2 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 1
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 7 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 7 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 6 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 4
ROBOT-OpenVLA
arc-grpo-baseline
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 3 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 2 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 2 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 1
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 6 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 1 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 8 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 5
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 7 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 7 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 6 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 4
GSM8k-GRPO
-
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 1 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Text Generation • 7B • Updated • 21 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 1 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged
Text Generation • 7B • Updated • 17