Instructions to use maderix/llama-65b-4bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use maderix/llama-65b-4bit with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("maderix/llama-65b-4bit", dtype="auto") - Notebooks
- Google Colab
- Kaggle
ggml_alpaca_30b_q4 ?
#5
by DaveScream - opened
can you please make quantized 4bit version of llama30b + alpaca lora 30b?
sharing alpaca_lora 30b: https://github.com/tloen/alpaca-lora/issues/68
https://github.com/johnsmith0031/alpaca_lora_4bit
the 4bit llama 30b appears to be here, have you tried using it with alpaca lora?