agraj07 commited on
Commit
a1f0da9
·
verified ·
1 Parent(s): 2fcc977

Create README.md

Browse files

The model has been quantized using the below provided config-
BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_quant_type="nf4",
bnb_4bit_use_double_quant=True,
bnb_4bit_compute_dtype=torch.bfloat16
)

Files changed (1) hide show
  1. README.md +4 -0
README.md ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - meta-llama/Llama-2-7b-chat-hf
4
+ ---