Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated a model about 7 hours ago
RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8 updated a model about 8 hours ago
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic updated a model 2 days ago
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16