Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
Aswin Ravikumar Rangsasamy Veerasamy
rrvaswin
Follow
0 followers
·
1 following
AI & ML interests
Transformers, SSMs
Recent Activity
updated
a model
2 days ago
rrvaswin/qwen_STaR_RL
published
a model
2 days ago
rrvaswin/qwen_STaR_RL
updated
a model
2 days ago
rrvaswin/qwen_4b_RL
View all activity
Organizations
rrvaswin
's models
117
Sort: Recently updated
rrvaswin/qwen_STaR_RL
8B
•
Updated
2 days ago
•
144
rrvaswin/qwen_4b_RL
8B
•
Updated
2 days ago
•
144
rrvaswin/qwen_Vanilla_RL
8B
•
Updated
2 days ago
•
10
rrvaswin/qwen_8b_RL
8B
•
Updated
2 days ago
•
13
rrvaswin/qwen_16b_RL
8B
•
Updated
2 days ago
•
12
rrvaswin/qwen_32b_RL
8B
•
Updated
2 days ago
•
11
rrvaswin/qwen_star_baseline
8B
•
Updated
2 days ago
•
241
rrvaswin/qwen_32b_distill_baseline
8B
•
Updated
2 days ago
•
35
rrvaswin/qwen_32b_SFT
8B
•
Updated
2 days ago
•
43
rrvaswin/qwen_16b_SFT
8B
•
Updated
2 days ago
•
143
rrvaswin/qwen_8b_SFT
8B
•
Updated
2 days ago
•
136
rrvaswin/qwen_4b_SFT
8B
•
Updated
2 days ago
•
311
rrvaswin/qwen_2b_SFT
8B
•
Updated
2 days ago
•
279
rrvaswin/qwen_1b_SFT
8B
•
Updated
2 days ago
•
325
rrvaswin/sefcom_judge_v2_step1324
4B
•
Updated
Apr 2
•
2
rrvaswin/sefcom_judge_v2
4B
•
Updated
Apr 2
•
4
rrvaswin/RL4DecompLMjudge2
4B
•
Updated
Mar 24
•
1
rrvaswin/sefcom_llmjudge1
4B
•
Updated
Mar 24
•
1
rrvaswin/qwen3_4b_rldecomp1
Text Generation
•
4B
•
Updated
Mar 3
•
1
rrvaswin/qwen3_4b_rldecomp
Text Generation
•
4B
•
Updated
Feb 23
•
3
rrvaswin/qwen_coder_3b_e3
242k
•
Updated
Feb 10
•
6
rrvaswin/qwen_coder_3b_e2
242k
•
Updated
Feb 10
•
1
rrvaswin/qwen_coder_3b_e1
242k
•
Updated
Feb 10
•
1
rrvaswin/1_to_16_analysis
1B
•
Updated
Jan 28
•
4
rrvaswin/1_to_1_analysis
1B
•
Updated
Jan 28
•
7
rrvaswin/DAPO_GRPO_2b_incorrect_bs_32_mb_8_n16_cliphigh
1B
•
Updated
Jan 27
•
1
rrvaswin/DAPO_GRPO_4b_incorrect_bs_32_mb_8_n16_cliphigh
1B
•
Updated
Jan 27
•
4
rrvaswin/DAPO_GRPO_8b_incorrect_bs_32_mb_8_n16_cliphigh
1B
•
Updated
Jan 26
•
5
rrvaswin/DAPO_GRPO_16b_incorrect_bs_32_mb_8_n16_cliphigh
1B
•
Updated
Jan 25
•
5
rrvaswin/STaR_RL_DAPO
1B
•
Updated
Jan 22
•
3
Previous
1
2
3
4
Next