Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts Paper • 2602.03473 • Published 4 days ago • 7
Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts Paper • 2602.03473 • Published 4 days ago • 7
Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts Paper • 2602.03473 • Published 4 days ago • 7
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels Paper • 2502.20087 • Published Feb 27, 2025
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation Paper • 2412.11890 • Published Dec 16, 2024
A2Mamba: Attention-augmented State Space Models for Visual Recognition Paper • 2507.16624 • Published Jul 22, 2025
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks Paper • 2409.09649 • Published Dec 20, 2024
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition Paper • 2310.19380 • Published Mar 31, 2025
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 12 days ago • 57
Video Generation Models as World Models: Efficient Paradigms, Architectures and Algorithms Paper • 2603.28489 • Published Mar 30 • 30