Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published Aug 13, 2025 • 53
RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis Paper • 2404.16754 • Published Apr 25, 2024
LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery Paper • 2505.02829 • Published May 5, 2025
MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs Paper • 2510.01691 • Published Oct 2, 2025 • 4
Large Language Models Do NOT Really Know What They Don't Know Paper • 2510.09033 • Published Oct 10, 2025 • 17
ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports Paper • 2507.22030 • Published Jul 29, 2025 • 3
Phrase-grounded Fact-checking for Automatically Generated Chest X-ray Reports Paper • 2509.21356 • Published Sep 20, 2025
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5 Paper • 2601.10527 • Published Jan 15 • 25
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning Paper • 2601.10129 • Published Jan 15 • 12
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published Feb 5 • 8
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 4 days ago • 86