Sheikh Jubair's picture

4

Sheikh Jubair

sheikhjubair

·

AI & ML interests

None yet

Organizations

upvoted 2 papers 8 months ago

SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning

Paper • 2506.19767 • Published Jun 24, 2025 • 15

Towards a Unified View of Large Language Model Post-Training

Paper • 2509.04419 • Published Sep 4, 2025 • 76

upvoted 2 papers almost 2 years ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 118

World Model on Million-Length Video And Language With RingAttention

Paper • 2402.08268 • Published Feb 13, 2024 • 40