Mike Staub

mikestaub

·

https://michaelstaub.com

AI & ML interests

robot perception, 3d graphics

Recent Activity

upvoted an article 1 day ago

SmolVLM - small yet mighty Vision Language Model

liked a model 1 day ago

thinkingmachines/Inkling

liked a model 2 days ago

AngelSlim/Hy3-GGUF

View all activity

Organizations

None yet

upvoted an article 1 day ago

Article

SmolVLM - small yet mighty Vision Language Model

+3

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 424

upvoted an article 13 days ago

Article

Hugging Face and Cerebras bring Gemma 4 to real-time voice AI

+2

A-Mahla, andito, lvwerra, vyassaurabh

•

16 days ago

• 76

upvoted an article 19 days ago

Article

Norm-Preserving Biprojected Abliteration

grimjim

•

Nov 6, 2025

• 88

upvoted a collection 21 days ago

Ornith-1.0

Ornith-1.0 is a family of open-source LLMs specialized for agentic coding. • 8 items • Updated 20 days ago • 346

upvoted a collection 22 days ago

Cosmos3

Omnimodal World Models for Physical AI • 21 items • Updated 7 days ago • 149

upvoted a paper 28 days ago

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

Paper • 2606.18322 • Published about 1 month ago • 17

upvoted an article about 2 months ago

Article

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

nvidia

•

Jun 1

• 86

upvoted a paper about 2 months ago

Hierarchical Reasoning Model

Paper • 2506.21734 • Published Jun 26, 2025 • 54

upvoted a paper 3 months ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 288

upvoted a collection 3 months ago

Bonsai

1-bit Bonsai models • 7 items • Updated Jun 4 • 211

upvoted a paper 3 months ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 82

upvoted an article 3 months ago

Article

Falcon Perception

tiiuae

•

Apr 1

• 70

upvoted a paper 4 months ago

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 40

upvoted a collection 4 months ago

Transformers.js V4 demos

A collection of demos built with Transformers.js V4 • 24 items • Updated Apr 16 • 67

upvoted an article 4 months ago

Article

Transformers.js v4: Now Available on NPM!

Xenova, nico-martin

•

Feb 9

• 97

upvoted a paper 4 months ago

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Paper • 2503.14734 • Published Mar 18, 2025 • 9

upvoted a collection 4 months ago

Avey 1 Research Preview

1.5B preview models trained on 100B tokens of FineWeb, and an instruct-tuned version on smoltalk. • 3 items • Updated Jun 16, 2025 • 7

upvoted an article 4 months ago

Article

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

nvidia

•

Jan 29

• 49

upvoted 2 papers 4 months ago

Action Chunking with Transformers for Image-Based Spacecraft Guidance and Control

Paper • 2509.04628 • Published Sep 4, 2025 • 1

TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 47