How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="DanielClough/Candle_MistralLite")
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("DanielClough/Candle_MistralLite")
model = AutoModelForCausalLM.from_pretrained("DanielClough/Candle_MistralLite")
Quick Links

This repo includes .gguf built for HuggingFace/Candle. They will not work with llama.cpp.

This model should be used with the Config config_chat_ml.

Refer to the original repo for more details.

Downloads last month
59
GGUF
Model size
7B params
Architecture
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

16-bit

Inference Providers NEW

Model tree for DanielClough/Candle_MistralLite

Quantizations
2 models