ReAligned-Qwen3.5 Collection Lazarus AI's ReAligned finetune of Qwen 3.5 alters the alignment of the model, eliminating unwanted behaviors like propaganda, lying, & gaslighting. • 18 items • Updated 5 days ago • 4
Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated Apr 29 • 55
view article Article How Long Prompts Block Other Requests - Optimizing LLM Performance tngtech • Jun 12, 2025 • 13
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance tngtech • Apr 16, 2025 • 80
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated Apr 22 • 43
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated Apr 22 • 197
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 3 days ago • 150
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 18 items • Updated about 10 hours ago • 300
Falcon-H1-Tiny Collection A series of extremely small, yet powerful language models redefining capabilities at small scale • 19 items • Updated Mar 2 • 37
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 33 items • Updated 4 days ago • 144
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator nvidia • Dec 17, 2025 • 50
Bolmo Collection Artifacts for the Bolmo release: https://allenai.org/papers/bolmo. • 4 items • Updated Dec 23, 2025 • 12
Devstral 2 Collection A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 2 items • Updated Mar 2 • 56