interleaved-bench

university

AI & ML interests

None defined yet.

Recent Activity

Kyunnilee authored a paper about 2 months ago

Constantly Improving Image Models Need Constantly Improving Benchmarks

g-luo authored a paper 3 months ago

Learning a Generative Meta-Model of LLM Activations

g-luo submitted a paper 3 months ago

Learning a Generative Meta-Model of LLM Activations

View all activity

authored a paper about 2 months ago

Constantly Improving Image Models Need Constantly Improving Benchmarks

Paper • 2510.15021 • Published Oct 16, 2025 • 10

authored a paper 3 months ago

Learning a Generative Meta-Model of LLM Activations

Paper • 2602.06964 • Published Feb 6 • 3

submitted a paper to Daily Papers 3 months ago

Learning a Generative Meta-Model of LLM Activations

Paper • 2602.06964 • Published Feb 6 • 3

authored a paper 3 months ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published Jan 23 • 40

authored a paper 3 months ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published Jan 23 • 40

authored a paper 5 months ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 22

updated a dataset 7 months ago

interleaved-bench/TweetBench

Viewer • Updated Oct 16, 2025 • 4.69k • 63 • 1

published a dataset 7 months ago

interleaved-bench/TweetBench

Viewer • Updated Oct 16, 2025 • 4.69k • 63 • 1

in interleaved-bench/TweetBench 10 months ago

Add `rubrics` column across all splits

#2 opened 10 months ago by

Add `rubrics` column across all splits

#1 opened 10 months ago by

authored 2 papers 11 months ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17, 2025 • 39

Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint

Paper • 2505.23759 • Published May 29, 2025 • 5

authored 2 papers about 1 year ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21, 2025 • 44

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22, 2025 • 64

authored a paper about 1 year ago

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19, 2025 • 49

authored a paper about 1 year ago

Atlas: Multi-Scale Attention Improves Long Context Image Modeling

Paper • 2503.12355 • Published Mar 16, 2025 • 12

authored 3 papers over 1 year ago

Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence

Paper • 2305.14334 • Published May 23, 2023 • 1

Readout Guidance: Learning Control from Diffusion Features

Paper • 2312.02150 • Published Dec 4, 2023 • 3

Task Vectors are Cross-Modal

Paper • 2410.22330 • Published Oct 29, 2024 • 11

authored a paper almost 2 years ago

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Paper • 2403.19822 • Published Mar 28, 2024