59 9

Shuai Liu

Choiszt

https://github.com/choiszt

Choiszt

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

upvoted a paper 9 days ago

Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

upvoted a paper about 1 month ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

View all activity

Organizations

upvoted a paper 5 days ago

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Paper • 2606.20515 • Published 9 days ago • 39

upvoted a paper 9 days ago

Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

Paper • 2606.15236 • Published 11 days ago • 21

upvoted a paper about 1 month ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published May 27 • 75

liked a dataset about 1 month ago

ldkong/EgoMM

Updated May 11 • 400 • 1

upvoted 4 papers about 1 month ago

PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects

Paper • 2605.21572 • Published May 20 • 55

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published May 19 • 137

Lance: Unified Multimodal Modeling by Multi-Task Synergy

Paper • 2605.18678 • Published May 18 • 79

AI for Auto-Research: Roadmap & User Guide

Paper • 2605.18661 • Published May 18 • 69

upvoted 2 papers about 2 months ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published Apr 24 • 231

upvoted a paper 2 months ago

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Paper • 2604.23781 • Published Apr 26 • 33

updated a dataset 2 months ago

Choiszt/FileGram

Viewer • Updated Apr 14 • 8.67k • 970 • 5

upvoted 2 papers 3 months ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 181

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 237

authored a paper 3 months ago

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Paper • 2604.04901 • Published Apr 6 • 40

liked a dataset 3 months ago

MME-Benchmarks/Video-MME-v2

Benchmark • Updated 16 days ago • 3.2k • 4.13k • 43

upvoted 2 papers 3 months ago

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Paper • 2604.04901 • Published Apr 6 • 40

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published Apr 2 • 74

authored 2 papers 3 months ago

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Paper • 2310.08588 • Published Oct 12, 2023 • 38

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published Feb 9 • 28

Shuai Liu

AI & ML interests

Recent Activity

Organizations

Choiszt's activity