Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Saksham Loonker
SLoonker
Follow
saksham-loonker
AI & ML interests
I am very interested in RL and other post-training, as well as building Efficient LLMs for Sparse Resources.
Recent Activity
updated
a collection
2 days ago
Small RL Datasets For Training
updated
a collection
2 days ago
Small RL Datasets For Training
updated
a collection
2 days ago
Small RL Datasets For Training
View all activity
Organizations
None yet
SLoonker
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a collection
2 days ago
Small RL Datasets For Training
Collection
6 items
•
Updated
2 days ago
New activity in
SLoonker/RL-OpenCodeReasoning-DPO
2 days ago
Added License Of Source
#1 opened 2 days ago by
SmartAnon
updated
a dataset
2 days ago
SLoonker/RL-Claude-Creative-Writing-DPO
Viewer
•
Updated
2 days ago
•
818
•
5
published
a dataset
2 days ago
SLoonker/RL-Claude-Creative-Writing-DPO
Viewer
•
Updated
2 days ago
•
818
•
5
updated
a dataset
2 days ago
SLoonker/RL-STEM-DPO
Viewer
•
Updated
2 days ago
•
1.51k
•
13
published
a dataset
2 days ago
SLoonker/RL-STEM-DPO
Viewer
•
Updated
2 days ago
•
1.51k
•
13
updated
a dataset
2 days ago
SLoonker/RL-OpenCodeReasoning-DPO
Viewer
•
Updated
2 days ago
•
2.75k
•
7
published
a dataset
2 days ago
SLoonker/RL-OpenCodeReasoning-DPO
Viewer
•
Updated
2 days ago
•
2.75k
•
7
updated
a dataset
2 days ago
SLoonker/RL-Claude-Creative-Writing-SFT
Viewer
•
Updated
2 days ago
•
818
•
16
published
a dataset
2 days ago
SLoonker/RL-Claude-Creative-Writing-SFT
Viewer
•
Updated
2 days ago
•
818
•
16
updated
a dataset
2 days ago
SLoonker/RL-Ling-Coding-DPO
Viewer
•
Updated
2 days ago
•
2.84k
•
12
published
a dataset
2 days ago
SLoonker/RL-Ling-Coding-DPO
Viewer
•
Updated
2 days ago
•
2.84k
•
12
updated
a dataset
2 days ago
SLoonker/RL-Claude-Reasoning-GRPO-Prompts
Viewer
•
Updated
2 days ago
•
2.21k
•
12
published
a dataset
2 days ago
SLoonker/RL-Claude-Reasoning-GRPO-Prompts
Viewer
•
Updated
2 days ago
•
2.21k
•
12
updated
a dataset
2 days ago
SLoonker/RL-Claude-Reasoning-SFT
Viewer
•
Updated
2 days ago
•
2.21k
•
13
•
1
Load more