HuggingFaceH4/ultrachat_200k
Viewer • Updated • 515k • 74.1k • 714
How to use terry69/tiny-llama-20p with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T")
model = PeftModel.from_pretrained(base_model, "terry69/tiny-llama-20p")This model is a fine-tuned version of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T on the HuggingFaceH4/ultrachat_200k dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1.0275 | 0.99 | 96 | nan |