On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 26 days ago • 233 • 4
$δ$-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published May 12 • 131 • 5
The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve? Paper • 2502.17535 • Published Feb 24, 2025 • 8 • 2
Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research Paper • 2502.12669 • Published Feb 18, 2025 • 2 • 2
Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing Paper • 2502.04411 • Published Feb 6, 2025 • 4 • 2
Can LLMs Maintain Fundamental Abilities under KV Cache Compression? Paper • 2502.01941 • Published Feb 4, 2025 • 15 • 2
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference Paper • 2502.00299 • Published Feb 1, 2025 • 3 • 2
Should We Really Edit Language Models? On the Evaluation of Edited Language Models Paper • 2410.18785 • Published Oct 24, 2024 • 7 • 2
3D Question Answering for City Scene Understanding Paper • 2407.17398 • Published Jul 24, 2024 • 22 • 5