Llama-2-13B Lemon Fine-tuned Model

Model Image

This is a fine-tuned version of Llama-2-13B trained on the llama2-Lemon dataset to improve instruction-following capabilities. The model has been optimized using Unsloth for efficient training with LoRA adapters.

Model Details

  • Base Model: unsloth/llama-2-13b
  • Training Dataset: Stormtrooperaim/llama2-Lemon
  • Training Method: QLoRA with Unsloth
  • Model Size: 13B parameters

Usage

This model uses the Llama-2 prompt format:

<s>[INST] Your instruction here [/INST]

Training Details

  • Trained with 4-bit quantization using Unsloth
  • LoRA rank: 16
  • Learning rate: 2e-4
  • Optimizer: AdamW 8-bit
Downloads last month
35
Safetensors
Model size
13B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Stormtrooperaim/llama2-13b-lemon

Finetuned
(4)
this model

Dataset used to train Stormtrooperaim/llama2-13b-lemon

Collection including Stormtrooperaim/llama2-13b-lemon