raniero's picture
Upload README.md
7d6c37f verified
metadata
base_model: TheBloke/Llama-2-7B-fp16
tags:
  - LoRA
  - bittensor
  - gradients
license: apache-2.0

Submission InstructionTask (LoRA)

Fine-tuned on 2500 examples using Alpaca-like dataset with LoRA (r=16) on GPU.

  • SHA256: 8635187ed87214f676aa2d7ef67488d7950e2aeea0b8d679b0df84ca7b921326
  • Training: 1 epoch, batch size 1, 4-bit quantization
  • Upload time: 2025-07-18T21:48:06+02:00

Usage

from peft import PeftModel
from transformers import AutoModelForCausalLM

base = AutoModelForCausalLM.from_pretrained("TheBloke/Llama-2-7B-fp16")
model = PeftModel.from_pretrained(base, "raniero/submission_instruction_5000_gpu")