Tushar1K's picture
Update README.md
0f04eca verified
|
Raw
History Blame Contribute Delete
1.84 kB
---
license: apache-2.0
base_model: google/gemma-2b
tags:
- gemma
- lora
- qlora
- instruction-tuning
- unsloth
- transformers
- text-generation
library_name: transformers
---
- **Developed by:** Tushar Kamthe
- **License:** apache-2.0
- **Finetuned from model :** unsloth/gemma-2-2b-it-bnb-4bit
-
# Gemma-2B QLoRA Fine-tuned Model
## Model Description
This model is a fine-tuned version of **Google's Gemma-2B** using the **QLoRA (Quantized Low-Rank Adaptation)** technique. The model was trained using **parameter-efficient fine-tuning (PEFT)**, where only LoRA adapters were trained while keeping the base model weights frozen.
The model is designed for **instruction-following text generation tasks**.
Fine-tuning was performed using:
- QLoRA (4-bit quantization)
- LoRA adapters
- HuggingFace Transformers
- PEFT
- Unsloth for faster training
---
## Base Model
Base model used for training:
google/gemma-2b
---
## Training Details
### Training Method
The model was trained using **QLoRA**, which enables efficient training of large language models by:
- Loading the base model in **4-bit quantized format**
- Training **LoRA adapter weights only**
- Keeping base model weights frozen
This significantly reduces GPU memory requirements.
---
### Training Configuration
| Parameter | Value |
|--------|--------|
| Method | QLoRA |
| Quantization | 4-bit (NF4) |
| LoRA Rank (r) | 16 |
| LoRA Alpha | 64 |
| LoRA Dropout | 0.05 |
| Optimizer | AdamW |
| Precision | bfloat16 |
| Framework | HuggingFace Transformers |
---
### Hardware
Training was performed on:
- GPU: NVIDIA GPU (Colab / Local GPU)
- Framework: PyTorch
- Libraries:
- transformers
- peft
- datasets
- unsloth
---
## Dataset
The model was fine-tuned on a **custom instruction dataset** containing prompt-response pairs.
Dataset format: