Salman Rahman's picture

Salman Rahman PRO

salmannyu

·

https://salmanrahman.net/

AI & ML interests

Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation

Recent Activity

upvoted a collection 11 days ago

upvoted a collection 16 days ago

published a model about 1 month ago

salmannyu/prm-cot-private

View all activity

Organizations

upvoted a collection 11 days ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 728

upvoted a collection 16 days ago

Pre2Post-Chess

Open-sourced models and datasets for training the chess reasoning models. • 5 items • Updated 17 days ago • 2

published 3 models about 1 month ago

salmannyu/prm-cot-private

Text Generation • 2B • Updated Sep 3, 2025 • 1

salmannyu/qwen-math-7b-step-sft

8B • Updated Sep 3, 2025 • 1

salmannyu/step_cot

Text Generation • 15B • Updated Sep 3, 2025 • 1

updated a model about 1 month ago

salmannyu/model-checkpoints

published a model about 1 month ago

salmannyu/model-checkpoints

authored a paper about 2 months ago

When Can LLMs Learn to Reason with Weak Supervision?

Paper • 2604.18574 • Published Apr 20 • 25

upvoted a collection about 2 months ago

rlvr-weak-supervision

Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. • 3 items • Updated Apr 20 • 2

upvoted a paper 2 months ago

When Can LLMs Learn to Reason with Weak Supervision?

Paper • 2604.18574 • Published Apr 20 • 25

submitted a paper to Daily Papers 2 months ago

When Can LLMs Learn to Reason with Weak Supervision?

Paper • 2604.18574 • Published Apr 20 • 25

updated a collection 2 months ago

rlvr-weak-supervision

Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. • 3 items • Updated Apr 20 • 2

updated a model 2 months ago

pavelslab-nyu/Llama-3.2-3B-ThinkSFT

3B • Updated Apr 20 • 8

published a model 2 months ago

pavelslab-nyu/Llama-3.2-3B-ThinkSFT

3B • Updated Apr 20 • 8

updated a model 2 months ago

pavelslab-nyu/Llama-3.2-3B-CPT-Math-ThinkSFT

3B • Updated Apr 20 • 420

published a model 2 months ago

pavelslab-nyu/Llama-3.2-3B-CPT-Math-ThinkSFT

3B • Updated Apr 20 • 420

updated a model 2 months ago

pavelslab-nyu/Llama-3.2-3B-CPT-Math

3B • Updated Apr 20 • 20

published a model 2 months ago

pavelslab-nyu/Llama-3.2-3B-CPT-Math

3B • Updated Apr 20 • 20