How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="GreenBitAI/LLaMA-2-7B-4bit-groupsize32")
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("GreenBitAI/LLaMA-2-7B-4bit-groupsize32")
model = AutoModelForCausalLM.from_pretrained("GreenBitAI/LLaMA-2-7B-4bit-groupsize32")
Quick Links

GreenBit LLaMA

This is GreenBitAI's pretrained 4-bit LLaMA-2 7B model with advanced compression design and lossless performance to FP16 models.

Please refer to our Github page for the code to run the model and more information.

Model Description

Downloads last month
12
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support