view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention sirluk • Oct 7, 2024 • 71
view article Article RAG vs Fine-Tuning for LLMs: A Comprehensive Guide with Examples airabbitX • Aug 16, 2024 • 10
view article Article RegMix: Data Mixture as Regression for Language Model Pre-training SivilTaram • Jul 11, 2024 • 15
view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch AviSoori1x • May 7, 2024 • 121
view article Article Deploying Your FastAPI Applications on Huggingface Via Docker HemanthSai7 • Dec 11, 2023 • 41
view article Article Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA sirluk • Jan 22, 2024 • 26
view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H Hcompany • Jun 3, 2025 • 71
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs davidberenstein1957 • May 7, 2025 • 42
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment NormalUhr • Feb 11, 2025 • 120
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face dvgodoy • Feb 11, 2025 • 123