Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Mehul Damani
PRO
mehuldamani
Follow
wjurayj's profile picture
John6666's profile picture
Spechawk's profile picture
3 followers
·
0 following
https://damanimehul.github.io
MehulDamani2
damanimehul
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated
a model
about 21 hours ago
mehuldamani/countdown_arl-sft-add_multiply-v8
published
a model
about 21 hours ago
mehuldamani/countdown_arl-sft-add_multiply-v8
updated
a model
about 21 hours ago
mehuldamani/countdown_arl-sft-multiply-v8
View all activity
Organizations
None yet
mehuldamani
's datasets
69
Sort: Recently updated
mehuldamani/countdown_code_10_corrupted
Viewer
•
Updated
17 days ago
•
15.6k
•
42
mehuldamani/mini-story
Viewer
•
Updated
Mar 16
•
1.1k
•
10
mehuldamani/story-classifier-Instruct-SFT-v1
Viewer
•
Updated
Mar 15
•
22k
•
9
mehuldamani/story-classifier-Instruct-v1
Viewer
•
Updated
Mar 15
•
22k
•
5
mehuldamani/story-tranch-2
Viewer
•
Updated
Mar 15
•
26k
•
11
mehuldamani/story-tranch-1
Viewer
•
Updated
Mar 15
•
26k
•
13
mehuldamani/LitBench-tranch-4
Viewer
•
Updated
Mar 13
•
11.7k
•
3
mehuldamani/LitBench-tranch-3
Viewer
•
Updated
Mar 13
•
11.7k
•
6
mehuldamani/LitBench-tranch-2
Viewer
•
Updated
Mar 13
•
11.7k
•
1
mehuldamani/LitBench-tranch-1
Viewer
•
Updated
Mar 13
•
11.7k
•
8
mehuldamani/Instruct-Classifier-v1
Viewer
•
Updated
Mar 9
•
22k
•
16
mehuldamani/Instruct-SFT-Classifier-v1
Viewer
•
Updated
Mar 9
•
22k
•
13
mehuldamani/aime
Viewer
•
Updated
Mar 1
•
78
•
12
mehuldamani/multi-answer-sft-target-dataset
Viewer
•
Updated
Feb 25
•
1.59k
•
13
mehuldamani/big-math-very-tough
Viewer
•
Updated
Feb 24
•
12.5k
•
14
•
1
mehuldamani/hotpot_qa_test_gold_removed_1
Viewer
•
Updated
Jan 26
•
20.5k
•
11
mehuldamani/hotpot_qa_test_gold_removed_2
Viewer
•
Updated
Jan 26
•
20.5k
•
12
mehuldamani/hotpot_qa_trainTest_gold_removed_2
Viewer
•
Updated
Jan 26
•
20.5k
•
6
mehuldamani/hotpot2Removed_eval_10Runs_rlvr_multi_on_rlcr_multi
Viewer
•
Updated
Jan 25
•
500
•
4
mehuldamani/big-math-tough
Viewer
•
Updated
Jan 20
•
18.5k
•
66
mehuldamani/medTroubleshootig-rlvr-220-evaled-on-rlcr
Viewer
•
Updated
Jan 15
•
5k
•
6
mehuldamani/medTroubleshootig-rlvr-220-evaled-on-rlvr
Viewer
•
Updated
Jan 15
•
5k
•
6
mehuldamani/medDataset_25k
Viewer
•
Updated
Dec 29, 2025
•
75k
•
162
mehuldamani/medDataset
Viewer
•
Updated
Dec 28, 2025
•
1.29M
•
4
mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis
Viewer
•
Updated
Dec 26, 2025
•
2k
•
5
mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain
Viewer
•
Updated
Dec 25, 2025
•
2k
•
4
mehuldamani/ambigQA
Viewer
•
Updated
Dec 22, 2025
•
12k
•
12
mehuldamani/judge-new-sft-instruct
Viewer
•
Updated
Dec 10, 2025
•
100
•
4
mehuldamani/judge-new-sft-base
Viewer
•
Updated
Dec 10, 2025
•
100
•
3
mehuldamani/judge-new-instruct
Viewer
•
Updated
Dec 10, 2025
•
100
•
3
Previous
1
2
3
Next