2 10 70

By

ByRookie

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 months ago

Nemotron-Cascade 2

liked a dataset about 2 months ago

stepfun-ai/Step-3.5-Flash-SFT

liked a model about 2 months ago

miromind-ai/MiroThinker-1.7

View all activity

Organizations

None yet

upvoted a collection about 2 months ago

Nemotron-Cascade 2

Collection

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 16 days ago • 50

liked a dataset about 2 months ago

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated Mar 14 • 1.62M • 16.2k • 328

liked a model about 2 months ago

miromind-ai/MiroThinker-1.7

Text Generation • 235B • Updated 24 days ago • 768 • 136

upvoted a paper 5 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 63

liked a Space 6 months ago

The Smol Training Playbook

📚

3.15k

The secrets to building world-class LLMs

liked a dataset 6 months ago

allenai/tulu-3-sft-mixture

Viewer • Updated Dec 2, 2024 • 939k • 16.8k • 239

liked a model 7 months ago

Tengyunw/qwen3_30b_moe_eagle3

Updated Nov 5, 2025 • 1.55k • 12

liked a dataset 8 months ago

HuggingFaceFW/finepdfs

Viewer • Updated Apr 3 • 476M • 18.9k • 855

liked a model 9 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Text Generation • 50B • Updated Oct 15, 2025 • 60.1k • 28

liked a dataset 9 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25, 2025 • 25.7M • 10.4k • 182

liked a model 9 months ago

MetaStoneTec/XBai-o4

33B • Updated Aug 6, 2025 • 14 • 193

New activity in nvidia/AceReason-1.1-SFT 11 months ago

will you release code rl dataset ?

🔥 3

#2 opened 11 months ago by

ByRookie

liked 2 datasets 11 months ago

zwhe99/DeepMath-103K

Viewer • Updated May 29, 2025 • 103k • 6.96k • 360

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9, 2025 • 1.2M • 14.1k • 225

upvoted a paper 11 months ago

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26, 2025 • 45

liked 2 datasets 12 months ago

a-m-team/AM-Thinking-v1-Distilled

Preview • Updated Jun 12, 2025 • 1.07k • 58

a-m-team/AM-Thinking-v1-RL-Dataset

Viewer • Updated May 21, 2025 • 54.8k • 350 • 18

liked a dataset about 1 year ago

a-m-team/AM-DeepSeek-R1-Distilled-1.4M

Preview • Updated Mar 30, 2025 • 2.95k • 180

upvoted a paper about 1 year ago

MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization

Paper • 2503.16874 • Published Mar 21, 2025 • 45

liked a dataset about 1 year ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8, 2025 • 3.91M • 3.62k • 659

By

AI & ML interests

Recent Activity

Organizations

ByRookie's activity

The Smol Training Playbook

will you release code rl dataset ?