fireblade2534

2 24 147

AI & ML interests

None yet

Recent Activity

liked a model 22 days ago

canopylabs/orpheus-3b-0.1-ft

upvoted a paper about 1 month ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

upvoted a paper about 1 month ago

When Vision Speaks for Sound

View all activity

Organizations

None yet

upvoted 4 papers about 1 month ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published May 28 • 146

When Vision Speaks for Sound

Paper • 2605.16403 • Published May 13 • 161

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 265

CurveBench: A Benchmark for Exact Topological Reasoning over Nested Jordan Curves

Paper • 2605.14068 • Published May 13 • 8

upvoted a paper about 2 months ago

Useful Memories Become Faulty When Continuously Updated by LLMs

Paper • 2605.12978 • Published May 13 • 19

upvoted 3 papers 6 months ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 183

Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts

Paper • 2601.03315 • Published Jan 6 • 7

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published Dec 30, 2025 • 63

upvoted 2 papers 8 months ago

More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models

Paper • 2509.25848 • Published Sep 30, 2025 • 81

NaviTrace: Evaluating Embodied Navigation of Vision-Language Models

Paper • 2510.26909 • Published Oct 30, 2025 • 14

upvoted an article 9 months ago

Article

Visualizing How VLMs Work

not-lain

•

Oct 7, 2025

• 55

upvoted 2 collections 9 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 750

Qwen3-Omni

Collection

6 items • Updated Dec 31, 2025 • 204

upvoted 3 collections about 1 year ago

upvoted a collection over 1 year ago

Phi-4

Collection

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 213

upvoted 3 articles over 1 year ago

Article

Build awesome datasets for video generation

sayakpaul

•

Feb 12, 2025

• 36

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

jsulz, yuchenglow, znation, saba9

•

Feb 12, 2025

• 81

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

andito, mfarre, merve

•

Jan 23, 2025

• 192

fireblade2534

AI & ML interests

Recent Activity

Organizations

fireblade2534's activity

Visualizing How VLMs Work

Build awesome datasets for video generation

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!