Mehul Damani PRO

mehuldamani

·

https://damanimehul.github.io

AI & ML interests

Reinforcement Learning, Large Language Models

Recent Activity

updated a model about 2 months ago

mehuldamani/bugfixing-new-arl-add

updated a model about 2 months ago

mehuldamani/countdown-arl-sft-add-v8

published a model about 2 months ago

mehuldamani/bugfixing-new-arl-add

View all activity

Organizations

None yet

upvoted a paper 4 months ago

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Paper • 2603.24844 • Published Mar 25 • 10

upvoted a collection 12 months ago

RLCR

Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty • 10 items • Updated Aug 6, 2025 • 7

upvoted a paper about 1 year ago

Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty

Paper • 2507.16806 • Published Jul 22, 2025 • 7