5 2

bowang

bwang3579

AI & ML interests

None yet

Recent Activity

authored a paper 18 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

upvoted a paper 19 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

published a dataset about 1 month ago

bwang3579/medagent

View all activity

Organizations

authored a paper 18 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 20 days ago • 46

upvoted a paper 19 days ago

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Paper • 2606.09669 • Published 20 days ago • 46

published a dataset about 1 month ago

bwang3579/medagent

Updated May 22 • 8

updated a collection about 2 months ago

JoyAI-Image

Collection

JoyAI-Image • 3 items • Updated 1 day ago • 7

authored 9 papers about 2 months ago

Have Seen Me Before? Automating Dataset Updates Towards Reliable and Timely Evaluation

Paper • 2402.11894 • Published Feb 19, 2024

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14, 2025 • 57

Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model

Paper • 2503.11251 • Published Mar 14, 2025 • 1

STORYANCHORS: Generating Consistent Multi-Scene Story Frames for Long-Form Narratives

Paper • 2505.08350 • Published May 13, 2025

AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning

Paper • 2410.13181 • Published Oct 17, 2024 • 1

SetPO: Set-Level Policy Optimization for Diversity-Preserving LLM Reasoning

Paper • 2602.01062 • Published Feb 1 • 2

EXCEEDS: Extracting Complex Events via Nugget-based Grid Modeling in Scientific Domain

Paper • 2406.14075 • Published Apr 24

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

Paper • 2605.04128 • Published May 5 • 17

TextLDM: Language Modeling with Continuous Latent Diffusion

Paper • 2605.07748 • Published May 8 • 26

upvoted 2 papers about 2 months ago

Beyond Retrieval: A Multitask Benchmark and Model for Code Search

Paper • 2605.04615 • Published May 6 • 24

TextLDM: Language Modeling with Continuous Latent Diffusion

Paper • 2605.07748 • Published May 8 • 26

upvoted a paper 2 months ago

SetPO: Set-Level Policy Optimization for Diversity-Preserving LLM Reasoning

Paper • 2602.01062 • Published Feb 1 • 2

liked a dataset 3 months ago

EasonXiao-888/SpatialEdit-500K

Viewer • Updated Apr 8 • 499k • 69k • 10

updated a model 3 months ago

jdopensource/JoyAI-Image-Edit

Image-to-Image • Updated May 7 • 200 • 130

upvoted a paper 3 months ago

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Paper • 2604.04911 • Published Apr 6 • 36

liked a model 3 months ago

jdopensource/JoyAI-Image-Edit

Image-to-Image • Updated May 7 • 200 • 130

bowang

AI & ML interests

Recent Activity

Organizations

bwang3579's activity