FSMBench

university

AI & ML interests

Evaluating and Benchmarking Large Multimodal Models

Recent Activity

taesiri submitted a paper about 13 hours ago

Autodata: An agentic data scientist to create high quality synthetic data

taesiri submitted a paper about 13 hours ago

The Hitchhiker's Guide to Agentic AI: From Foundations to Systems

taesiri submitted a paper about 13 hours ago

Improved Large Language Diffusion Models

View all activity

submitted 4 papers to Daily Papers about 13 hours ago

Autodata: An agentic data scientist to create high quality synthetic data

Paper • 2606.25996 • Published 1 day ago • 7

The Hitchhiker's Guide to Agentic AI: From Foundations to Systems

Paper • 2606.24937 • Published 4 days ago • 9

Improved Large Language Diffusion Models

Paper • 2606.25331 • Published 1 day ago • 26

Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

Paper • 2606.25473 • Published 1 day ago • 11

submitted 5 papers to Daily Papers 1 day ago

World Value Models for Robotic Manipulation

Paper • 2606.24742 • Published 3 days ago • 1

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 3 days ago • 107

OpenThoughts-Agent: Data Recipes for Agentic Models

Paper • 2606.24855 • Published 3 days ago • 32

FLUX3D: High-Fidelity 3D Gaussian Generation with Diffusion-Aligned Sparse Representation

Paper • 2606.24874 • Published 3 days ago • 1

DiffusionBench: On Holistic Evaluation of Diffusion Transformers

Paper • 2606.24888 • Published 3 days ago • 9

submitted 5 papers to Daily Papers 2 days ago

Tmax: A simple recipe for terminal agents

Paper • 2606.23321 • Published 4 days ago • 10

Tapered Language Models

Paper • 2606.23670 • Published 4 days ago • 7

Unlimited OCR Works

Paper • 2606.23050 • Published 4 days ago • 28

Training Open Models for Agentic Phone Use

Paper • 2606.23049 • Published 4 days ago • 14

Self-Compacting Language Model Agents

Paper • 2606.23525 • Published 4 days ago • 15

submitted 5 papers to Daily Papers 7 days ago

ENPIRE: Agentic Robot Policy Self-Improvement in the Real World

Paper • 2606.19980 • Published 8 days ago • 14

Holo-World: Unified Camera, Object and Weather Control for Video World Model

Paper • 2606.20083 • Published 8 days ago • 9

Current World Models Lack a Persistent State Core

Paper • 2606.20545 • Published 8 days ago • 15

JAMER: Project-Level Code Framework Dataset and Benchmark on Professional Game Engines

Paper • 2606.19830 • Published 8 days ago • 3

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

Paper • 2606.19531 • Published 9 days ago • 18

submitted a paper to Daily Papers 8 days ago

Sumi: Open Uniform Diffusion Language Model from Scratch

Paper • 2606.19005 • Published 9 days ago • 11