Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published 6 days ago • 18
LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards Paper • 2603.02146 • Published Mar 2 • 1
OptiMer: Optimal Distribution Vector Merging Is Better than Data Mixing for Continual Pre-Training Paper • 2603.28858 • Published Mar 30 • 9
Structured Document Translation via Format Reinforcement Learning Paper • 2512.05100 • Published Dec 4, 2025 • 2
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 495
Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published Mar 10, 2025 • 47