-
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Paper • 2506.07491 • Published • 50 -
Story2Board: A Training-Free Approach for Expressive Storyboard Generation
Paper • 2508.09983 • Published • 70 -
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Paper • 2503.01710 • Published • 6 -
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
Paper • 2507.21809 • Published • 140
Samuel Thio
sthio90
·
AI & ML interests
None yet
Recent Activity
upvoted an article about 19 hours ago
How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II liked
a model 5 days ago
standardmodelbio/smb-v1-1.7b upvoted a paper 15 days ago
OmniGAIA: Towards Native Omni-Modal AI Agents