Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders
Paper • 2603.19209 • Published • 5
Artificial Intelligence, Computer Vision, Machine Learning, Computational Photography, Image Enhancement, Super-Resolution, Compression, Streaming