Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
adriabama06
/
DeepCoder-1.5B-Preview-FP8-W8A8
like
1
Text Generation
Transformers
PyTorch
English
qwen2
vllm
fp8
w8a8
llmcompressor
smoothquant
conversational
text-generation-inference
smooth_quant
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
DeepCoder-1.5B-Preview-FP8-W8A8
3.22 GB
1 contributor
History:
2 commits
adriabama06
Upload 11 files
3b4b77f
verified
10 months ago
.gitattributes
1.57 kB
Upload 11 files
10 months ago
README.md
6.78 kB
Upload 11 files
10 months ago
config.json
832 Bytes
Upload 11 files
10 months ago
generation_config.json
181 Bytes
Upload 11 files
10 months ago
inputs_stats.pth
10.5 MB
xet
Upload 11 files
10 months ago
outputs_stats.pth
15 MB
xet
Upload 11 files
10 months ago
pytorch_model-00001-of-00002.bin
2 GB
xet
Upload 11 files
10 months ago
pytorch_model-00002-of-00002.bin
1.18 GB
xet
Upload 11 files
10 months ago
pytorch_model.bin.index.json
43.4 kB
Upload 11 files
10 months ago
special_tokens_map.json
485 Bytes
Upload 11 files
10 months ago
tokenizer.json
11.4 MB
xet
Upload 11 files
10 months ago
tokenizer_config.json
6.76 kB
Upload 11 files
10 months ago