DLM-Scope Collection Sparse Autoencoders of Diffusion Language Models (Dream-7B, LLaDA-8B) and Large Language Models (Qwen-2.5-7B, LLaMA-3-8B) • 6 items • Updated Feb 5 • 6
SDAR Collection The models without suffixes use the default block size = 4. • 21 items • Updated Jan 2 • 8
Dream 7B Collection https://hkunlp.github.io/blog/2025/dream/ • 2 items • Updated Jul 16, 2025 • 6
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models Paper • 2505.19223 • Published May 25, 2025 • 9
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated 19 days ago • 703