view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 898
view article Article Mixture of Experts (MoEs) in Transformers +5 ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap • Feb 26 • 161
From Mystery to Mastery: Failure Diagnosis for Improving Manipulation Policies Paper • 2412.02818 • Published Dec 3, 2024 • 1
PAC Bench: Do Foundation Models Understand Prerequisites for Executing Manipulation Policies? Paper • 2506.23725 • Published Jun 30, 2025 • 2
Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments Paper • 2505.19361 • Published May 25, 2025 • 1
FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading Paper • 2502.11433 • Published Feb 17, 2025 • 36
DPO-Shift: Shifting the Distribution of Direct Preference Optimization Paper • 2502.07599 • Published Feb 11, 2025 • 15