Arvind Rajasekaran's picture

Arvind Rajasekaran

arvindcr4

·

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

AdithyaSK/rl-environments-guide

updated a Space 2 months ago

arvindcr4/tinkerrl-bench-demo

published a Space 2 months ago

arvindcr4/tinkerrl-bench-demo

View all activity

Organizations

None yet

arvindcr4 's models 42

arvindcr4/tinker-rl-bench-ppo_gsm8k_Llama-3.1-8B-Instruct_s42

Text Generation • Updated Apr 19 • 8

arvindcr4/tinker-rl-bench-kl_track_Qwen3-8B_s42

Reinforcement Learning • Updated Apr 19

arvindcr4/tinker-rl-bench-scale_gsm8k_qwen3-8b

Reinforcement Learning • Updated Apr 18

arvindcr4/tinker-rl-bench-frontier_gsm8k_deepseek-v3.1

Reinforcement Learning • Updated Apr 18

arvindcr4/tinker-rl-bench-cross_tool_llama-8b-inst

Reinforcement Learning • Updated Apr 18

arvindcr4/skyrl-tinker-qwen3-8b-tool_use

Updated Mar 29 • 3

arvindcr4/llama-3.2-1b-arithmetic-rl-lora

Reinforcement Learning • Updated Mar 14 • 2

arvindcr4/llama-3.2-1b-distillation-offpolicy-lora

Updated Mar 14 • 2

arvindcr4/tool-call-lora-qwen2.5-7b

Updated Mar 9 • 1

arvindcr4/tool-call-lora-qwen2.5-3b

Updated Mar 9 • 3

arvindcr4/tool-call-lora-qwen1.5b

Updated Mar 9 • 2

arvindcr4/tool-call-lora-qwen0.5b

Updated Mar 9 • 1