airlsyn

AI & ML interests

AI & RL

Recent Activity

upvoted a collection about 12 hours ago

Tmax

upvoted an article 10 days ago

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

liked a dataset 13 days ago

nvidia/Nemotron-Pretraining-Code-v3

View all activity

Organizations

upvoted a collection about 12 hours ago

Tmax

Collection

Data and models associated with "Tmax: A simple recipe for terminal agents". paper: https://arxiv.org/abs/2606.23321 • 23 items • Updated 2 days ago • 10

upvoted an article 10 days ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

qgallouedec

•

Dec 4, 2025

• 72

liked a dataset 13 days ago

nvidia/Nemotron-Pretraining-Code-v3

Viewer • Updated 21 days ago • 146M • 2.36k • 54

liked a model 14 days ago

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Text Generation • 335B • Updated about 6 hours ago • 376k • • 211

liked a dataset 14 days ago

nvidia/Nemotron-SFT-OpenCode-v1

Preview • Updated Mar 23 • 3.68k • 52

upvoted a collection 15 days ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 13 days ago • 167

liked a Space 19 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

192

Building and scaling RL environments for LLM training

upvoted a paper 23 days ago

NITP: Next Implicit Token Prediction for LLM Pre-training

Paper • 2605.24956 • Published May 24 • 35

liked a dataset 24 days ago

stanford-vision-lab/gpic

Updated 20 days ago • 210k • 141

liked a dataset 25 days ago

amphora/ResearchMath-14k

Viewer • Updated 12 days ago • 14.1k • 3k • 54

upvoted a paper 29 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published about 1 month ago • 144

liked 6 datasets about 1 month ago

New activity in openbmb/MiniCPM-V-4.6 about 1 month ago

feat: add tool_call example

#5 opened about 1 month ago by

airlsyn

upvoted a paper about 1 month ago

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Paper • 2605.08985 • Published May 9 • 23

New activity in openbmb/MiniCPM-V-4.6 about 1 month ago

Create Ааа

#4 opened about 1 month ago by

Vobim14

airlsyn

AI & ML interests

Recent Activity

Organizations

airlsyn's activity

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

The ultimate guide to RL environments: building and scaling them in the LLM era

feat: add tool_call example

Create Ааа