postylem
/

Llama-2-70b-hf-4bit

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

Jacob Louis Hoover commited on Jan 31, 2024

Commit

3a10f86

·

verified ·

1 Parent(s): aa658a5

Update README.md

Files changed (1) hide show

README.md +5 -6

README.md CHANGED Viewed

@@ -1,13 +1,13 @@
 ---
 library_name: transformers
-tags: []
 ---
 # Model Card for Model ID
-Just a 4bit quantized version of `meta-llama/Llama-2-70b-hf`.  Made as:
-```
 from transformers import AutoModelForCausalLM
 model = AutoModelForCausalLM.from_pretrained(
@@ -15,6 +15,7 @@ model = AutoModelForCausalLM.from_pretrained(
   device_map="auto",
   load_in_4bit=True
 )
 ```
 saved for later use (to save 30mins)
@@ -208,6 +209,4 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 ## Model Card Contact
-[More Information Needed]

 ---
 library_name: transformers
+pipeline_tag: text-generation
 ---
 # Model Card for Model ID
+Just a 4bit quantized version of [`meta-llama/Llama-2-70b-hf`](https://huggingface.co/meta-llama/Llama-2-70b-hf).  Made as:
+```python
 from transformers import AutoModelForCausalLM
 model = AutoModelForCausalLM.from_pretrained(
   device_map="auto",
   load_in_4bit=True
 )
+model.push_to_hub('Llama-2-70b-hf-4bit')
 ```
 saved for later use (to save 30mins)
 ## Model Card Contact
+[More Information Needed]