Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated a model about 17 hours ago
mehuldamani/story-full-disc-judge-binarized-feature-v4-new published a model about 17 hours ago
mehuldamani/story-full-disc-judge-binarized-feature-v4-new updated a model about 17 hours ago
mehuldamani/story-full-disc-judge-binarized-feature-v5-newOrganizations
None yet