🏗️ Building on HF

6 1

Krishna Teja Chitty-Venkata

krishnateja95

https://krishnateja95.github.io/

AI & ML interests

LLM Optimization, Neural Architecture Search, Quantization, Pruning

Recent Activity

updated a model about 15 hours ago

RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8

updated a model about 16 hours ago

RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic

updated a model 3 days ago

RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16

View all activity

Organizations

updated a model about 15 hours ago

RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8

Text Generation • 163B • Updated about 15 hours ago • 12.6k • 20

updated a model about 16 hours ago

RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic

Text Generation • 561B • Updated about 16 hours ago • 13.6k • 1

updated 3 models 3 days ago

published a model 3 days ago

inference-optimization/Nemotron-3-Super-prepared-data

Updated 3 days ago

updated a model 3 days ago

inference-optimization/Nemotron-Super-120B-Dflash-SWA

1B • Updated 3 days ago • 17

published a model 3 days ago

inference-optimization/Nemotron-Super-120B-Dflash-SWA

1B • Updated 3 days ago • 17

published 3 models 20 days ago

RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16

Text Generation • 565B • Updated 3 days ago • 1.62k • 3

RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-block

Text Generation • 561B • Updated 3 days ago • 1.41k

RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic

Text Generation • 561B • Updated about 16 hours ago • 13.6k • 1

updated a bucket 22 days ago

krishnateja95/Mellum2-12B-A2.5B-Thinking

24.3 GB

published a bucket 22 days ago

krishnateja95/Mellum2-12B-A2.5B-Thinking

24.3 GB

updated a model 28 days ago

RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-BF16

Text Generation • 124B • Updated 28 days ago • 2.33k

updated 2 models about 1 month ago

inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid

Image-Text-to-Text • 28B • Updated May 20 • 103

inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid

Image-Text-to-Text • 28B • Updated May 20 • 103

published a model about 1 month ago

inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid

Image-Text-to-Text • 28B • Updated May 20 • 103

Krishna Teja Chitty-Venkata

AI & ML interests

Recent Activity

Organizations

krishnateja95's activity

krishnateja95/Mellum2-12B-A2.5B-Thinking

krishnateja95/Mellum2-12B-A2.5B-Thinking