Challenges and Research Directions for Large Language Model Inference Hardware Paper • 2601.05047 • Published Jan 8 • 1
ACC-UNet: A Completely Convolutional UNet model for the 2020s Paper • 2308.13680 • Published Aug 25, 2023 • 1
Progressive Residual Warmup for Language Model Pretraining Paper • 2603.05369 • Published 8 days ago • 32
Flash-VAED: Plug-and-Play VAE Decoders for Efficient Video Generation Paper • 2602.19161 • Published 19 days ago • 1
Dataset Distillation via Relative Distribution Matching and Cognitive Heritage Paper • 2602.05391 • Published Feb 5 • 1
Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization Paper • 2602.02958 • Published Feb 3 • 34
FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding Paper • 2603.02096 • Published 11 days ago • 1
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 10 days ago • 88
Communication-Inspired Tokenization for Structured Image Representations Paper • 2602.20731 • Published 17 days ago • 4
stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation Paper • 2602.08968 • Published Feb 9 • 1
Is Hierarchical Quantization Essential for Optimal Reconstruction? Paper • 2601.22244 • Published Jan 29 • 1
iFSQ: Improving FSQ for Image Generation with 1 Line of Code Paper • 2601.17124 • Published Jan 23 • 33
Learning a distance measure from the information-estimation geometry of data Paper • 2510.02514 • Published Oct 2, 2025 • 1
Nacrith: Neural Lossless Compression via Ensemble Context Modeling and High-Precision CDF Coding Paper • 2602.19626 • Published 18 days ago • 3
Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations Paper • 2602.19320 • Published 19 days ago • 9