HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging Paper • 2102.00824 • Published Jan 18, 2021
CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning Paper • 2306.11128 • Published Jun 19, 2023
EvoClaw: Evaluating AI Agents on Continuous Software Evolution Paper • 2603.13428 • Published Mar 13 • 21
ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning Paper • 2603.10160 • Published Mar 10 • 26
S'MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning Paper • 2504.06426 • Published Apr 8, 2025 • 2
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published Dec 30, 2024 • 26
LLM-Rec: Personalized Recommendation via Prompting Large Language Models Paper • 2307.15780 • Published Jul 24, 2023 • 28
Decoupling the Depth and Scope of Graph Neural Networks Paper • 2201.07858 • Published Jan 19, 2022 • 1
GraphSAINT: Graph Sampling Based Inductive Learning Method Paper • 1907.04931 • Published Jul 10, 2019
Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2, 2024 • 46
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales Paper • 2405.20974 • Published May 31, 2024
A Single Transformer for Scalable Vision-Language Modeling Paper • 2407.06438 • Published Jul 8, 2024 • 1