Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

reyvan
/
Qwen-7B-8bit

Text Generation
Transformers
PyTorch
qwen
custom_code
4-bit precision
gptq
Model card Files Files and versions
xet
Community
Qwen-7B-8bit
24.1 GB
  • 1 contributor
History: 13 commits
reyvan's picture
reyvan
Upload pytorch_model.bin
0cf7fa4 verified almost 2 years ago
  • .gitattributes
    1.52 kB
    initial commit almost 2 years ago
  • README.md
    28 Bytes
    initial commit almost 2 years ago
  • cache_autogptq_cuda_256.cpp
    8.4 kB
    Upload 9 files almost 2 years ago
  • cache_autogptq_cuda_kernel_256.cu
    52 kB
    Upload 9 files almost 2 years ago
  • config.json
    1.2 kB
    Upload config.json almost 2 years ago
  • configuration_qwen.py
    2.35 kB
    Upload 9 files almost 2 years ago
  • cpp_kernels.py
    1.92 kB
    Upload 9 files almost 2 years ago
  • generation_config.json
    222 Bytes
    Upload 9 files almost 2 years ago
  • gptq_model-8bit-128g.bin
    9.12 GB
    xet
    AutoGPTQ model for Qwen/Qwen-7B: 8bits, gr128, desc_act=False almost 2 years ago
  • gptq_model-8bit-128g.safetensors
    9.12 GB
    xet
    AutoGPTQ model for Qwen/Qwen-7B: 8bits, gr128, desc_act=False almost 2 years ago
  • modeling_qwen.py
    55.6 kB
    Upload 9 files almost 2 years ago
  • pytorch_model.bin
    5.86 GB
    xet
    Upload pytorch_model.bin almost 2 years ago
  • quantize_config.json
    294 Bytes
    Upload quantize_config.json almost 2 years ago
  • qwen.tiktoken
    2.56 MB
    Upload 9 files almost 2 years ago
  • qwen_generation_utils.py
    14.6 kB
    Upload 9 files almost 2 years ago
  • tokenization_qwen.py
    9.62 kB
    Upload 9 files almost 2 years ago
  • tokenizer_config.json
    174 Bytes
    Upload tokenizer_config.json almost 2 years ago