ibivibiv/summary_instruct
Viewer • Updated • 357k • 10
How to use ibivibiv/llama3-8b-instruct-summary with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Meta-Llama-3-8B-Instruct")
model = PeftModel.from_pretrained(base_model, "ibivibiv/llama3-8b-instruct-summary")axolotl version: 0.4.0
adapter: qlora
base_model: meta-llama/Meta-Llama-3-8B-Instruct
base_model_config: meta-llama/Meta-Llama-3-8B-Instruct
datasets:
- path: ibivibiv/summary_instruct
type: alpaca
flash_attention: true
gradient_accumulation_steps: 4
gradient_checkpointing: true
hf_use_auth_token: true
hub_model_id: ibivibiv/llama3-8b-instruct-summary
learning_rate: 0.0002
load_in_4bit: true
logging_steps: 1
lora_alpha: 16
lora_dropout: 0.05
lora_r: 32
lora_target_linear: true
lr_scheduler: cosine
micro_batch_size: 2
model_type: AutoModelForCausalLM
num_epochs: 3
optimizer: paged_adamw_32bit
output_dir: /job/out
sample_packing: true
save_safetensors: true
sequence_len: 4096
special_tokens:
pad_token: <|end_of_text|>
tokenizer_type: AutoTokenizer
wandb_project: TuneStudio
wandb_run_id: summllamma
wandb_watch: 'true'
warmup_steps: 10
This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B-Instruct on the ibivibiv/summary_instruct dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
meta-llama/Meta-Llama-3-8B-Instruct