blascotobasco/HyperNova-56E at main

25.1 GB

1 contributor

Manually pruned version of HyperNova 60B, from 80 experts down to 56 experts, as my second attempt at pruning a MoE model. It works, but testing was limited. There is an F16 GGUF available but I am unable to quantize it lower because the script keeps ballooning the size up to 40 GB for a Q4_K_M. I am not technically proficient, I would appreciate help.

32cd000 verified about 1 month ago

.gitattributes
1.57 kB

Upload 12 files about 1 month ago
README.md
78 Bytes

Manually pruned version of HyperNova 60B, from 80 experts down to 56 experts, as my second attempt at pruning a MoE model. It works, but testing was limited. There is an F16 GGUF available but I am unable to quantize it lower because the script keeps ballooning the size up to 40 GB for a Q4_K_M. I am not technically proficient, I would appreciate help. about 1 month ago
config.json
2.1 kB

Upload 12 files about 1 month ago
model-00001-of-00007.safetensors
4.06 GB
xet

Upload 12 files about 1 month ago
model-00002-of-00007.safetensors
3.56 GB
xet

Upload 12 files about 1 month ago
model-00003-of-00007.safetensors
3.56 GB
xet

Upload 12 files about 1 month ago
model-00004-of-00007.safetensors
3.56 GB
xet

Upload 12 files about 1 month ago
model-00005-of-00007.safetensors
3.56 GB
xet

Upload 12 files about 1 month ago
model-00006-of-00007.safetensors
3.56 GB
xet

Upload 12 files about 1 month ago
model-00007-of-00007.safetensors
3.24 GB
xet

Upload 12 files about 1 month ago
model.safetensors.index.json
51.5 kB

Upload 12 files about 1 month ago
special_tokens_map.json
440 Bytes

Upload 12 files about 1 month ago
tokenizer.json
27.9 MB
xet

Upload 12 files about 1 month ago
tokenizer_config.json
4.2 kB

Upload 12 files about 1 month ago