This model was created using https://github.com/InternLM/lmdeploy by calling lmdeploy lite smooth_quant meta-llama/Meta-Llama-3-70B-Instruct --work-dir output/llama-3-70b-instruct-sq-w8. To use it, you must accept meta's llama3 license.

Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support