Running on CPU Upgrade Featured 3.22k The Smol Training Playbook š 3.22k The secrets to building world-class LLMs
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA Paper ⢠2510.04849 ⢠Published Oct 6, 2025 ⢠117
Revisiting Long-context Modeling from Context Denoising Perspective Paper ⢠2510.05862 ⢠Published Oct 7, 2025 ⢠21
Running 353 LLM Embeddings Explained: A Visual and Intuitive Guide š 353 How Language Models Turn Text into Meaning, From Traditional
GemmaX2 Collection GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. ⢠7 items ⢠Updated Feb 7, 2025 ⢠24
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance tngtech ⢠Apr 16, 2025 ⢠81
view article Article Visualize and understand GPU memory in PyTorch qgallouedec ⢠Dec 24, 2024 ⢠273