On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 25 days ago • 232
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 25 days ago • 232 • 4
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 25 days ago • 232
Macaron-A2UI: A Model for Generative UI in Personal Agents Paper • 2605.24830 • Published May 24 • 83
MinT: Managed Infrastructure for Training and Serving Millions of LLMs Paper • 2605.13779 • Published May 13 • 223
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models Paper • 2502.11916 • Published Feb 17, 2025 • 1
Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling Paper • 2602.19919 • Published Feb 27
MDN: Parallelizing Stepwise Momentum for Delta Linear Attention Paper • 2605.05838 • Published May 7 • 5
$δ$-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published May 12 • 131
Position: LLM Inference Should Be Evaluated as Energy-to-Token Production Paper • 2605.11733 • Published May 12 • 3
MinT: Managed Infrastructure for Training and Serving Millions of LLMs Paper • 2605.13779 • Published May 13 • 223
Position: LLM Inference Should Be Evaluated as Energy-to-Token Production Paper • 2605.11733 • Published May 12 • 3
Position: LLM Inference Should Be Evaluated as Energy-to-Token Production Paper • 2605.11733 • Published May 12 • 3
$δ$-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published May 12 • 131 • 5
PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence Paper • 2512.16793 • Published Dec 18, 2025 • 76
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published Nov 26, 2025 • 128