Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Firworks
/
INTELLECT-3-nvfp4

Safetensors
glm4_moe
8-bit precision
compressed-tensors
Model card Files Files and versions
xet
Community
7
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Request quantization model

1
#6 opened 23 days ago by
win10

cost estimates?

4
#5 opened 23 days ago by
lightenup

is NVFP4 supported on sm120 (blackwell rtx pro 6000, rtx 5090 etc)?

10
#4 opened 24 days ago by
Fernanda24

Minimax

1
#3 opened 25 days ago by
jc2375

works with vLLM, with FLASHINFER_MOE_FP4

1
#2 opened 27 days ago by
bnjmnmarie

Can you redece the size from 62 GB to about 35-40 GB range in 4bit or lesser?

4
#1 opened 27 days ago by
Prompt48
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs