Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published 4 days ago • 50
Image Implication Benchmarks Collection 1.II-Bench: An Image Implication Understanding Benchmark for Multimodal 2.CII-Bench: Can MLLMs Understand the Deep Implication Behind Chinese Images? • 2 items • Updated 9 days ago • 1
CPsyCoun Collection CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling • 3 items • Updated 9 days ago • 1
MetaphorStar Collection MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning • 7 items • Updated 9 days ago • 1
Image Implication Benchmarks Collection 1.II-Bench: An Image Implication Understanding Benchmark for Multimodal 2.CII-Bench: Can MLLMs Understand the Deep Implication Behind Chinese Images? • 2 items • Updated 9 days ago • 1
CPsyCoun Collection CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling • 3 items • Updated 9 days ago • 1
MetaphorStar Collection MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning • 7 items • Updated 9 days ago • 1
MetaphorStar Collection MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning • 7 items • Updated 9 days ago • 1