ut-enyac/quamba2-2.7b-w4a16
Updated
•
1
Energy-aware Computing, Low Power Design, EDA, Dark Silicon, Efficient Deep Learning
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models