Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation Paper • 2602.02007 • Published Feb 2 • 17
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper • 2603.22117 • Published 4 days ago • 22
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 15 days ago • 217
view article Article 🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows Feb 2, 2025 • 25
PreSINQ GGUF Collection This collection contains SINQ GGUF models • 4 items • Updated about 1 month ago • 3
Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning Paper • 2506.02627 • Published Jun 3, 2025 • 3
finetune-ar-dialects Collection Models for the thesis titled: "The Effects of Fine-Tuning on the ASR Performance of Dialectal Arabic". • 17 items • Updated May 20, 2024 • 3
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 273
LightRAG: Simple and Fast Retrieval-Augmented Generation Paper • 2410.05779 • Published Oct 8, 2024 • 33
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 11 items • Updated 25 days ago • 83