Andyrasika/TweetSumm-tuned
Viewer • Updated • 1.1k • 60 • 1
How to use Andyrasika/fine-tuning-llama with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("Andyrasika/fine-tuning-llama", dtype="auto")This model is a fine-tuned version of TinyPixel/Llama-2-7B-bf16-sharded on the dialogstudio dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1.9048 | 0.4 | 22 | 1.9220 |
| 1.824 | 0.8 | 44 | 1.8809 |
| 1.6784 | 1.2 | 66 | 1.8619 |
| 1.77 | 1.6 | 88 | 1.8537 |
| 1.6501 | 2.0 | 110 | 1.8522 |
from peft import AutoPeftModelForCausalLM
trained_model = AutoPeftModelForCausalLM.from_pretrained(
"Andyrasika/fine-tuning-llama",
low_cpu_mem_usage=True,
)
merged_model = model.merge_and_unload()
merged_model.save_pretrained("merged_model", safe_serialization=True)
tokenizer.save_pretrained("merged_model")
Base model
TinyPixel/Llama-2-7B-bf16-sharded