vrs-gemma-2b-it

This is a vrs-gemma-2b-it model fine-tuned using LoRA (16-bit precision)

πŸ”§ Training

It was trained on a Google Colab notebook with a T4 GPU and high RAM.

πŸ’» Usage

# Running the model on a GPU using different precisions:- For Float16 -
#!pip install -U transformers
#!pip install "torch>=2.1.1" -U
#!pip install accelerate
#!huggingface-cli login
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

tokenizer = AutoTokenizer.from_pretrained("VijayShinde1996/vrs-gemma-2b-it")
model = AutoModelForCausalLM.from_pretrained("VijayShinde1996/vrs-gemma-2b-it", device_map="auto", torch_dtype=torch.float16)

input_text = "Write me a poem about Machine Learning."
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")

outputs = model.generate(**input_ids, max_length=500)
print(tokenizer.decode(outputs[0]))
Downloads last month
1
Safetensors
Model size
3B params
Tensor type
F16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support