Intel
/

phi-2-int4-inc

Text Generation

text-generation-inference

4-bit precision

intel/auto-round

Model card Files Files and versions

wenhuach commited on Feb 28, 2024

Commit

f01205c

·

verified ·

1 Parent(s): 5b294fa

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -50,11 +50,12 @@ Install [AutoGPTQ](https://github.com/AutoGPTQ/AutoGPTQ) from source first
 from transformers import AutoModelForCausalLM, AutoTokenizer
 quantized_model_dir = "Intel/phi-2-int4-inc"
 tokenizer = AutoTokenizer.from_pretrained(quantized_model_dir)
-model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", trust_remote_code=True)
 text = "There is a girl who likes adventure,"
 inputs = tokenizer(text, return_tensors="pt", return_attention_mask=False).to(model.device)
 outputs = model.generate(**inputs, max_new_tokens=50)
 text = tokenizer.batch_decode(outputs)[0]
 ```

 from transformers import AutoModelForCausalLM, AutoTokenizer
 quantized_model_dir = "Intel/phi-2-int4-inc"
 tokenizer = AutoTokenizer.from_pretrained(quantized_model_dir)
+model = AutoModelForCausalLM.from_pretrained(quantized_model_dir, device_map="auto", trust_remote_code=True)
 text = "There is a girl who likes adventure,"
 inputs = tokenizer(text, return_tensors="pt", return_attention_mask=False).to(model.device)
 outputs = model.generate(**inputs, max_new_tokens=50)
 text = tokenizer.batch_decode(outputs)[0]
+print(text)
 ```