dsaunders23/ChessAlpacaPrediction
Viewer • Updated • 1.93k • 7
How to use dsaunders23/ChessPredictor with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("TinyLlama/TinyLlama-1.1B-Chat-v1.0")
model = PeftModel.from_pretrained(base_model, "dsaunders23/ChessPredictor")axolotl version: 0.8.0.dev0
base_model: TinyLlama/TinyLlama-1.1B-Chat-v1.0
datasets:
- path: dsaunders23/ChessAlpacaPrediction
type: alpaca
output_dir: ./outputs/mymodel
sequence_len: 4096
adapter: lora
lora_r: 8
lora_alpha: 16
lora_dropout: 0.05
lora_target_modules:
- q_proj
- v_proj
- k_proj
- o_proj
- gate_proj
- down_proj
- up_proj
gradient_accumulation_steps: 1
micro_batch_size: 16
num_epochs: 1
optimizer: adamw_bnb_8bit
learning_rate: 0.0002
load_in_8bit: true
train_on_inputs: false
bf16: auto
This model is a fine-tuned version of TinyLlama/TinyLlama-1.1B-Chat-v1.0 on the dsaunders23/ChessAlpacaPrediction dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
TinyLlama/TinyLlama-1.1B-Chat-v1.0
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("TinyLlama/TinyLlama-1.1B-Chat-v1.0") model = PeftModel.from_pretrained(base_model, "dsaunders23/ChessPredictor")