Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

alphakek
/
GLM-4.7-Flash-heretic-NVFP4

Text Generation
Transformers
Safetensors
English
Chinese
glm4_moe_lite
quantized
modelopt
nvfp4
vllm
conversational
8-bit precision
Model card Files Files and versions
xet
Community
GLM-4.7-Flash-heretic-NVFP4
17.8 GB
  • 1 contributor
History: 6 commits
chknlittle's picture
chknlittle
docs: document validated serving runtime and fallback profile
2009e8a verified about 20 hours ago
  • .gitattributes
    227 Bytes
    Add files using upload-large-folder tool about 24 hours ago
  • LICENSE
    1.07 kB
    Add files using upload-large-folder tool about 24 hours ago
  • QUANTIZATION.md
    2.3 kB
    docs: document validated serving runtime and fallback profile about 20 hours ago
  • README.md
    3.18 kB
    docs: add runtime compatibility guidance for vllm 0.16 about 20 hours ago
  • chat_template.jinja
    3.12 kB
    Add files using upload-large-folder tool about 24 hours ago
  • config.json
    3.1 kB
    Add files using upload-large-folder tool about 24 hours ago
  • generation_config.json
    181 Bytes
    Add files using upload-large-folder tool about 24 hours ago
  • hf_quant_config.json
    266 Bytes
    Add files using upload-large-folder tool about 24 hours ago
  • model.safetensors
    17.8 GB
    xet
    Add files using upload-large-folder tool about 24 hours ago
  • tokenizer.json
    20.2 MB
    xet
    Add files using upload-large-folder tool about 24 hours ago
  • tokenizer_config.json
    1.78 kB
    Add files using upload-large-folder tool about 24 hours ago