Instructions to use ArtusDev/TareksTesting_Scripturient-V2.3-LLaMa-70B-EXL2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ArtusDev/TareksTesting_Scripturient-V2.3-LLaMa-70B-EXL2 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("ArtusDev/TareksTesting_Scripturient-V2.3-LLaMa-70B-EXL2", dtype="auto") - Notebooks
- Google Colab
- Kaggle
EXL2 Quants of TareksTesting/Scripturient-V2.3-LLaMa-70B
EXL2 quants of TareksTesting/Scripturient-V2.3-LLaMa-70B using exllamav2 for quantization.
Quants
| Quant(Revision) | Bits per Weight | Head Bits |
|---|---|---|
| 4.0_H6 | 4.0 | 6 |
Downloading quants with huggingface-cli
Click to view download instructions
Install hugginface-cli:
pip install -U "huggingface_hub[cli]"
Download quant by targeting the specific quant revision (branch):
huggingface-cli download ArtusDev/TareksTesting_Scripturient-V2.3-LLaMa-70B-EXL2 --revision "5.0bpw_H6" --local-dir ./
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for ArtusDev/TareksTesting_Scripturient-V2.3-LLaMa-70B-EXL2
Base model
TareksTesting/Scripturient-V2.3-LLaMa-70B