Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Arvind Rajasekaran's picture
2 1 11

Arvind Rajasekaran

arvindcr4
Balasandhya's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a Space 13 days ago
arvindcr4/tinkerrl-bench-demo
published a Space 13 days ago
arvindcr4/tinkerrl-bench-demo
updated a model 19 days ago
arvindcr4/tinker-rl-w1_deepseek-v31-base-deepseek-v3.1-base-s42
View all activity

Organizations

None yet

arvindcr4 's models 42

arvindcr4/tinker-rl-bench-ppo_gsm8k_Llama-3.1-8B-Instruct_s42

Text Generation • Updated 19 days ago • 16

arvindcr4/tinker-rl-bench-kl_track_Qwen3-8B_s42

Reinforcement Learning • Updated 19 days ago

arvindcr4/tinker-rl-bench-scale_gsm8k_qwen3-8b

Reinforcement Learning • Updated 20 days ago

arvindcr4/tinker-rl-bench-frontier_gsm8k_deepseek-v3.1

Reinforcement Learning • Updated 20 days ago

arvindcr4/tinker-rl-bench-cross_tool_llama-8b-inst

Reinforcement Learning • Updated 20 days ago

arvindcr4/skyrl-tinker-qwen3-8b-tool_use

Updated Mar 29

arvindcr4/llama-3.2-1b-arithmetic-rl-lora

Reinforcement Learning • Updated Mar 14

arvindcr4/llama-3.2-1b-distillation-offpolicy-lora

Updated Mar 14 • 1

arvindcr4/tool-call-lora-qwen2.5-7b

Updated Mar 9 • 1

arvindcr4/tool-call-lora-qwen2.5-3b

Updated Mar 9

arvindcr4/tool-call-lora-qwen1.5b

Updated Mar 9

arvindcr4/tool-call-lora-qwen0.5b

Updated Mar 9 • 2
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs