Brick-Composer: Using MLLMs for Assembly with Diverse Bricks Paper • 2606.05445 • Published 8 days ago • 7
view article Article A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons NormalUhr • Feb 4, 2025 • 36
The Open Molecules 2025 (OMol25) Dataset, Evaluations, and Models Paper • 2505.08762 • Published May 13, 2025 • 4
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks Paper • 2502.17832 • Published Feb 25, 2025 • 6
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning Paper • 2509.25760 • Published Sep 30, 2025 • 55
EpiCache: Episodic KV Cache Management for Long Conversational Question Answering Paper • 2509.17396 • Published Sep 22, 2025 • 19
UserBench: An Interactive Gym Environment for User-Centric Agents Paper • 2507.22034 • Published Jul 29, 2025 • 30
Perception-Aware Policy Optimization for Multimodal Reasoning Paper • 2507.06448 • Published Jul 8, 2025 • 48
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding Paper • 2506.15745 • Published Jun 18, 2025 • 14
view article Article You could have designed state of the art positional encoding FL33TW00D-HF • Nov 25, 2024 • 484
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks Paper • 2501.11733 • Published Jan 20, 2025 • 28
Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models against Counterfactual Noise Paper • 2305.01579 • Published May 2, 2023 • 2
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch AviSoori1x • Jun 23, 2024 • 39
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper • 2404.16994 • Published Apr 25, 2024 • 38