xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations Paper • 2506.13651 • Published Jun 16 • 8
CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding Paper • 2405.02384 • Published May 3, 2024
Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective Paper • 2502.17262 • Published Feb 24 • 22
Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation Paper • 2411.03957 • Published Nov 6, 2024
One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code Paper • 2205.06126 • Published May 12, 2022 • 1
Leveraging Print Debugging to Improve Code Generation in Large Language Models Paper • 2401.05319 • Published Jan 10, 2024 • 1
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents Paper • 2105.03887 • Published May 9, 2021
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Paper • 2501.04575 • Published Jan 8 • 25
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks Paper • 2401.05507 • Published Jan 10, 2024 • 1