Mehul Damani's picture

Mehul Damani PRO

mehuldamani

·

https://damanimehul.github.io

AI & ML interests

Reinforcement Learning, Large Language Models

Recent Activity

updated a model 1 day ago

mehuldamani/bug_fixing_new-arl-add_multiply

published a model 1 day ago

mehuldamani/bug_fixing_new-arl-add_multiply

updated a model 1 day ago

mehuldamani/bug_fixing_rlvr-7b-nokl-v2

View all activity

Organizations

None yet

mehuldamani 's datasets 69

mehuldamani/countdown_code_10_corrupted

Viewer • Updated 19 days ago • 15.6k • 43

mehuldamani/mini-story

Viewer • Updated Mar 16 • 1.1k • 4

mehuldamani/story-classifier-Instruct-SFT-v1

Viewer • Updated Mar 15 • 22k • 7

mehuldamani/story-classifier-Instruct-v1

Viewer • Updated Mar 15 • 22k • 4

mehuldamani/story-tranch-2

Viewer • Updated Mar 15 • 26k • 6

mehuldamani/story-tranch-1

Viewer • Updated Mar 15 • 26k • 9

mehuldamani/LitBench-tranch-4

Viewer • Updated Mar 13 • 11.7k • 2

mehuldamani/LitBench-tranch-3

Viewer • Updated Mar 13 • 11.7k • 5

mehuldamani/LitBench-tranch-2

Viewer • Updated Mar 13 • 11.7k

mehuldamani/LitBench-tranch-1

Viewer • Updated Mar 13 • 11.7k • 7

mehuldamani/Instruct-Classifier-v1

Viewer • Updated Mar 9 • 22k • 18

mehuldamani/Instruct-SFT-Classifier-v1

Viewer • Updated Mar 9 • 22k • 15

mehuldamani/aime

Viewer • Updated Mar 1 • 78 • 10

mehuldamani/multi-answer-sft-target-dataset

Viewer • Updated Feb 25 • 1.59k • 12

mehuldamani/big-math-very-tough

Viewer • Updated Feb 24 • 12.5k • 9 • 1

mehuldamani/hotpot_qa_test_gold_removed_1

Viewer • Updated Jan 26 • 20.5k • 12

mehuldamani/hotpot_qa_test_gold_removed_2

Viewer • Updated Jan 26 • 20.5k • 16

mehuldamani/hotpot_qa_trainTest_gold_removed_2

Viewer • Updated Jan 26 • 20.5k • 6

mehuldamani/hotpot2Removed_eval_10Runs_rlvr_multi_on_rlcr_multi

Viewer • Updated Jan 25 • 500 • 4

mehuldamani/big-math-tough

Viewer • Updated Jan 20 • 18.5k • 73

mehuldamani/medTroubleshootig-rlvr-220-evaled-on-rlcr

Viewer • Updated Jan 15 • 5k • 5

mehuldamani/medTroubleshootig-rlvr-220-evaled-on-rlvr

Viewer • Updated Jan 15 • 5k • 5

mehuldamani/medDataset_25k

Viewer • Updated Dec 29, 2025 • 75k • 176

mehuldamani/medDataset

Viewer • Updated Dec 28, 2025 • 1.29M • 4

mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis

Viewer • Updated Dec 26, 2025 • 2k • 6

mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain

Viewer • Updated Dec 25, 2025 • 2k • 4

mehuldamani/ambigQA

Viewer • Updated Dec 22, 2025 • 12k • 12

mehuldamani/judge-new-sft-instruct

Viewer • Updated Dec 10, 2025 • 100 • 3

mehuldamani/judge-new-sft-base

Viewer • Updated Dec 10, 2025 • 100 • 3

mehuldamani/judge-new-instruct

Viewer • Updated Dec 10, 2025 • 100 • 3