view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 β’ 69
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 β’ 407
Running 330 LLM Embeddings Explained: A Visual and Intuitive Guide π 330 How Language Models Turn Text into Meaning, From Traditional
Light-R1 Collection Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond β’ 7 items β’ Updated Oct 15, 2025 β’ 12