Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

rayf-07
/
Ouro-2.6B_smoothquant_W8A8

Text Generation
Transformers
PyTorch
ouro
conversational
custom_code
Model card Files Files and versions
xet
Community
Ouro-2.6B_smoothquant_W8A8 / qouro_runtime /quantization
114 kB
  • 1 contributor
History: 2 commits
rayf-07's picture
rayf-07
Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code
ae8294e verified 4 months ago
  • __pycache__
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago
  • __init__.py
    190 Bytes
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago
  • awq_core.py
    3.16 kB
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago
  • calibration.py
    3.21 kB
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago
  • config.py
    1.92 kB
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago
  • modules.py
    8.47 kB
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago
  • observers.py
    2.16 kB
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago
  • pipeline.py
    6 kB
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago
  • smoothquant.py
    3.01 kB
    Upload Ouro-2.6B_smoothquant_W8A8 with bundled source code 4 months ago