SwastikM
/

Llama-2-7B-Chat-text2code

Text Generation

Llama-2-7B-Chat

instruction2code

Model card Files Files and versions

SwastikM commited on Apr 28, 2024

Commit

5501946

·

verified ·

1 Parent(s): 0122800

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -67,6 +67,22 @@ code = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print(code)
 ```
 ## Training Details

 print(code)
 ```
+### Size Comparison
+```
+The table shows comparison VRAM requirements for loading and training
+of FP16 Base Model and 4bit GPTQ quantized model with PEFT.
+The value for base model referenced from [Model Memory Calculator](https://huggingface.co/docs/accelerate/main/en/usage_guides/model_size_estimator)
+from HuggingFace
+```
+| Model                   | Total Size  | Training Using Adam |
+| ------------------------|-------------| --------------------|
+| Base Model              | 12.37 GB    | 49.48 GP            |
+| 4bitQuantized+PEFT      | 3.90 GB     | 11 GB               |
 ## Training Details