UCoder: Unsupervised Code Generation by Internal Probing of Large Language Models Paper • 2512.17385 • Published Dec 19, 2025 • 19
Scaling Laws for Code: Every Programming Language Matters Paper • 2512.13472 • Published Dec 15, 2025 • 15
UCoder: Unsupervised Code Generation by Internal Probing of Large Language Models Paper • 2512.17385 • Published Dec 19, 2025 • 19
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 301
Multi-Agent Collaboration for Multilingual Code Instruction Tuning Paper • 2502.07487 • Published Feb 11, 2025
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models Paper • 2502.13059 • Published Feb 18, 2025
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Paper • 2502.16614 • Published Feb 23, 2025 • 27
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation Paper • 2505.14552 • Published May 20, 2025 • 1
M3TQA: Massively Multilingual Multitask Table Question Answering Paper • 2508.16265 • Published Aug 22, 2025
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 301
V-GameGym: Visual Game Generation for Code Large Language Models Paper • 2509.20136 • Published Sep 24, 2025 • 9
MMTableBench Collection MMTableBench: A Multi-level Multimodal Benchmark for Reasoning and Layout Complexity in Table QA • 1 item • Updated Sep 2, 2025