2 14 9

Boris Tseitlin

btseytlin

btseytlin

AI & ML interests

None yet

Recent Activity

updated a dataset 2 days ago

SteelmanLabs/aitw

liked a Space 4 days ago

AlexWortega/same-data-different-losses

published a dataset about 1 month ago

SteelmanLabs/aitw

View all activity

Organizations

updated a dataset 2 days ago

SteelmanLabs/aitw

Viewer • Updated 2 days ago • 87.6M • 67

liked a Space 4 days ago

Weight-Space Geometry of Offline Reasoning Training

🧭

Interactive weight-space geometry of six reasoning losses

published a dataset about 1 month ago

SteelmanLabs/aitw

Viewer • Updated 2 days ago • 87.6M • 67

updated a dataset about 2 months ago

btseytlin/aitw-steelman

Viewer • Updated May 5 • 82.8k • 4

published a dataset about 2 months ago

btseytlin/aitw-steelman

Viewer • Updated May 5 • 82.8k • 4

updated a dataset 2 months ago

SteelmanLabs/osu-replays

Viewer • Updated Apr 25 • 694 • 58

published a dataset 3 months ago

SteelmanLabs/osu-replays

Viewer • Updated Apr 25 • 694 • 58

liked a model 4 months ago

HuggingFaceTB/SmolVLM2-2.2B-Instruct

Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 344k • 324

upvoted a paper 5 months ago

AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents

Paper • 2602.06855 • Published Feb 6 • 83

upvoted an article 5 months ago

Article

CRAFT: Continuous Reasoning and Agentic Feedback Tuning

flymy-ai

•

Feb 5

• 66

upvoted an article 6 months ago

Article

TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell

apsys

•

Jan 5

• 14

liked a model 6 months ago

facebook/dinov3-convnext-base-pretrain-lvd1689m

Image Feature Extraction • 87.6M • Updated Aug 19, 2025 • 5.56k • 18

published a dataset 6 months ago

btseytlin/any2json

Viewer • Updated Dec 31, 2025 • 167k • 7

updated a dataset 6 months ago

btseytlin/any2json

Viewer • Updated Dec 31, 2025 • 167k • 7

upvoted 3 papers 7 months ago

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published Nov 17, 2025 • 122

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published Nov 19, 2025 • 91

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17, 2025 • 140

liked a model 10 months ago

ai-forever/FRED-T5-large

Updated Dec 5, 2023 • 336 • 28

upvoted an article 12 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

upvoted a paper about 1 year ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 79

Boris Tseitlin

AI & ML interests

Recent Activity

Organizations

btseytlin's activity

Weight-Space Geometry of Offline Reasoning Training

CRAFT: Continuous Reasoning and Agentic Feedback Tuning

TFLOPS Gap: Why FP4 MoE Kernel Engineering Matters on Blackwell

SmolLM3: smol, multilingual, long-context reasoner