CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents Paper • 2407.01511 • Published Jul 1, 2024
CCS: Controllable and Constrained Sampling with Diffusion Models via Initial Noise Perturbation Paper • 2502.04670 • Published Feb 7, 2025
SEAR: Schema-Based Evaluation and Routing for LLM Gateways Paper • 2603.26728 • Published 12 days ago • 6
SEAR: Schema-Based Evaluation and Routing for LLM Gateways Paper • 2603.26728 • Published 12 days ago • 6
RelBench: A Benchmark for Deep Learning on Relational Databases Paper • 2407.20060 • Published Jul 29, 2024 • 9
PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning Paper • 2404.00776 • Published Mar 31, 2024 • 1
Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency Paper • 2307.08123 • Published Jul 16, 2023 • 1