Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
FSMBench
university
Activity Feed
Follow
7
AI & ML interests
Evaluating and Benchmarking Large Multimodal Models
Recent Activity
taesiri
submitted
a paper
about 7 hours ago
A Benchmark for Interactive World Models with a Unified Action Generation Framework
taesiri
submitted
a paper
about 7 hours ago
Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies
taesiri
submitted
a paper
about 7 hours ago
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
View all activity
Team members
5
FSMBench
's models
None public yet