nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 Text Generation β’ 32B β’ Updated Mar 15 β’ 52.2k β’ 126
InteractComp: Evaluating Search Agents With Ambiguous Queries Paper β’ 2510.24668 β’ Published Oct 28, 2025 β’ 100
interstellarninja/hermes_reasoning_tool_use Viewer β’ Updated Dec 26, 2025 β’ 51k β’ 2.98k β’ 168
Running 93 Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks π 93 Evaluate multilingual models using FineTasks
Running 3.86k The Ultra-Scale Playbook π 3.86k The ultimate guide to training LLM on large GPU Clusters
Running Featured 1.35k FineWeb: decanting the web for the finest text data at scale π· 1.35k Explore and download the FineWeb webβscale text dataset
view article Article SwanLab and Transformers: Power Up Your NLP Experiments Andyrasika β’ Jun 17, 2024 β’ 6