view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 about 22 hours ago • 22
Llama-Embed-Nemotron-8B Collection State-of-the-Art Text Embedding Model • 3 items • Updated 8 days ago • 4
Nemotron RAG Collection Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex do • 11 items • Updated 8 days ago • 65
Languages identification Collection a variety of pre-trained language identification models • 9 items • Updated Jul 31, 2025 • 1
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29, 2025 • 143
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29, 2025 • 211
view article Article Blazingly fast whisper transcriptions with Inference Endpoints +4 May 13, 2025 • 81
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published Jan 22, 2025 • 90