airlsyn

AI & ML interests

AI & RL

Recent Activity

liked a dataset about 4 hours ago

amd/ReasonLite-Dataset

upvoted a collection 2 days ago

Tmax

upvoted an article 11 days ago

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

View all activity

Organizations

upvoted a collection 2 days ago

Tmax

Collection

Data and models associated with "Tmax: A simple recipe for terminal agents". paper: https://arxiv.org/abs/2606.23321 • 23 items • Updated 3 days ago • 12

upvoted an article 11 days ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

qgallouedec

•

Dec 4, 2025

• 72

upvoted a collection 16 days ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 14 days ago • 168

upvoted a paper 24 days ago

NITP: Next Implicit Token Prediction for LLM Pre-training

Paper • 2605.24956 • Published May 24 • 35

upvoted a paper 30 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published May 26 • 144

upvoted a paper about 1 month ago

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Paper • 2605.08985 • Published May 9 • 23

upvoted 4 papers about 2 months ago

upvoted 2 papers 2 months ago

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

Paper • 2604.19859 • Published Apr 21 • 54

SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments

Paper • 2604.14144 • Published Apr 15 • 63

upvoted an article 2 months ago

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 487

upvoted 5 papers 3 months ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published Apr 6 • 116

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving

Paper • 2604.02190 • Published Apr 2 • 29

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 87

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 110

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

Paper • 2602.15322 • Published Feb 17 • 11

upvoted an article 4 months ago

Article

Introducing Storage Buckets on the Hugging Face Hub

Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner

•

Mar 10

• 196

upvoted a paper 4 months ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 120

airlsyn

AI & ML interests

Recent Activity

Organizations

airlsyn's activity

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

You could have designed state of the art positional encoding

Introducing Storage Buckets on the Hugging Face Hub