Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published 5 days ago • 51
Image Implication Benchmarks Collection 1.II-Bench: An Image Implication Understanding Benchmark for Multimodal 2.CII-Bench: Can MLLMs Understand the Deep Implication Behind Chinese Images? • 2 items • Updated 10 days ago • 1
CPsyCoun Collection CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling • 3 items • Updated 10 days ago • 1
MetaphorStar Collection MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning • 7 items • Updated 10 days ago • 1
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published Oct 13, 2025 • 166
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper • 2507.06261 • Published Jul 7, 2025 • 66
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper • 2505.17019 • Published May 22, 2025 • 4
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published Feb 5, 2025 • 18
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 123
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper • 2501.12599 • Published Jan 22, 2025 • 126
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 437
Can MLLMs Understand the Deep Implication Behind Chinese Images? Paper • 2410.13854 • Published Oct 17, 2024 • 12
CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations Paper • 2405.10212 • Published May 16, 2024 • 1
GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability Paper • 2403.04483 • Published Mar 7, 2024 • 1