CodeLlama-7b-Instruct-hf
GenAI WebGPU Model
Generated using onnxruntime_genai.models.builder
- onnxruntime-genai commit:
41c4ce18fec1240c2f848725e31fbe8854010188 - transformers version:
4.57.0 - Precision: int4
- Execution Provider: webgpu
python -m onnxruntime_genai.models.builder -p int4 -e webgpu -m codellama/CodeLlama-7b-Instruct-hf -o E:\ai-models\CodeLlama-7b-Instruct-hf\onnx-webgpu\ --extra_options int4_algo_config=rtn_last int4_is_symmetric=true prune_lm_head=true enable_webgpu_graph=true
GGUF Model
Downloaded from: TheBloke/CodeLlama-7B-Instruct-GGUF