NN Arch - a lzhbrian Collection

lzhbrian 's Collections

NN Arch

NN Arch Components

Loop

Linear Attention

TTT

NN Arch

updated 5 days ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published 13 days ago • 55
Recursive Language Models

Paper • 2512.24601 • Published 13 days ago • 53
Nested Learning: The Illusion of Deep Learning Architectures

Paper • 2512.24695 • Published 13 days ago • 34
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 249
Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 120
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 166