Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention Paper • 2606.20945 • Published 6 days ago • 49
nvidia/nemotron-3.5-asr-streaming-0.6b Automatic Speech Recognition • Updated 8 days ago • 41.1k • • 660
FrontiersMind/Nandi-Mini-600M-Early-Checkpoint Text Generation • 0.6B • Updated May 17 • 438 • 104
FrontiersMind/Nandi-Mini-150M-Tool-Calling Text Generation • 0.2B • Updated May 18 • 4.86k • 52
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems Paper • 2604.04936 • Published Jan 8 • 26
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19, 2025 • 89
Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models Paper • 2401.02333 • Published Jan 4, 2024 • 7