Shubham Parashar

shubhamprshr

·

AI & ML interests

Computer Vision, Multi-Modal Learning

Recent Activity

updated a model 15 days ago

divelab/OPDLM-MATH-8B

updated a model 19 days ago

divelab/OPDLM-MATH-8B-Thinking

updated a model 19 days ago

divelab/OPDLM-MATH-4B-Thinking

View all activity

Organizations

upvoted a paper 26 days ago

Data-Efficient Autoregressive-to-Diffusion Language Models via On-Policy Distillation

Paper • 2606.06712 • Published about 1 month ago • 2

upvoted a paper 29 days ago

Learnability-Informed Fine-Tuning of Diffusion Language Models

Paper • 2605.22939 • Published May 21 • 2

upvoted a paper about 2 months ago

Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM Reasoning

Paper • 2506.06632 • Published Mar 16 • 2

upvoted a paper over 1 year ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20, 2025 • 52