Text Generation
Transformers
Safetensors
English
mistral
unsloth
conversational
text-generation-inference
4-bit precision
bitsandbytes
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("Manal0809/Mistrial_instruct")
model = AutoModelForCausalLM.from_pretrained("Manal0809/Mistrial_instruct")
messages = [
{"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
messages,
add_generation_prompt=True,
tokenize=True,
return_dict=True,
return_tensors="pt",
).to(model.device)
outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))Quick Links
Finetune Mistral, Gemma, Llama 2-5x faster with 70% less memory via Unsloth!
We have a free Google Colab Tesla T4 notebook for Mistral Nemo 12b here: https://colab.research.google.com/drive/17d3U-CAIwzmbDRqbZ9NnpHxCkmXB6LZ0?usp=sharing
✨ Finetune for Free
All notebooks are beginner friendly! Add your dataset, click "Run All", and you'll get a 2x faster finetuned model which can be exported to GGUF, vLLM or uploaded to Hugging Face.
| Unsloth supports | Free Notebooks | Performance | Memory use |
|---|---|---|---|
| Llama-3 8b | ▶️ Start on Colab | 2.4x faster | 58% less |
| Gemma 7b | ▶️ Start on Colab | 2.4x faster | 58% less |
| Mistral 7b | ▶️ Start on Colab | 2.2x faster | 62% less |
| Llama-2 7b | ▶️ Start on Colab | 2.2x faster | 43% less |
| TinyLlama | ▶️ Start on Colab | 3.9x faster | 74% less |
| CodeLlama 34b A100 | ▶️ Start on Colab | 1.9x faster | 27% less |
| Mistral 7b 1xT4 | ▶️ Start on Kaggle | 5x faster* | 62% less |
| DPO - Zephyr | ▶️ Start on Colab | 1.9x faster | 19% less |
- This conversational notebook is useful for ShareGPT ChatML / Vicuna templates.
- This text completion notebook is for raw text. This DPO notebook replicates Zephyr.
- * Kaggle has 2x T4s, but we use 1. Due to overhead, 1x T4 is 5x faster.
- Downloads last month
- 1



# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="Manal0809/Mistrial_instruct") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)