WeeRobots
/

phi-2-chat-v05

Text Generation

text-generation-inference

Model card Files Files and versions

PeterZentai commited on Jan 27, 2024

Commit

cb64c73

·

verified ·

1 Parent(s): a11eb6d

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -45,6 +45,8 @@ Cost for inference.
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
 model = AutoModelForCausalLM.from_pretrained(model_id, device_map={"": 0}, trust_remote_code=True)
 tokenizer = tokenizer = AutoTokenizer.from_pretrained(model_id, use_fast=True, trust_remote_code=True)
@@ -63,4 +65,7 @@ with torch.no_grad():
   # but it might continue generating irrelevant text. this way the model will stop at the right place
   model_response = model.generate(**model_input, max_new_tokens=512, eos_token_id=tokenizer.eos_token_id, )
   print(tokenizer.decode(model_result[0], skip_special_tokens=False))
-```

 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
+model_id = "WeeRobots/phi-2-chat-v05"
 model = AutoModelForCausalLM.from_pretrained(model_id, device_map={"": 0}, trust_remote_code=True)
 tokenizer = tokenizer = AutoTokenizer.from_pretrained(model_id, use_fast=True, trust_remote_code=True)
   # but it might continue generating irrelevant text. this way the model will stop at the right place
   model_response = model.generate(**model_input, max_new_tokens=512, eos_token_id=tokenizer.eos_token_id, )
   print(tokenizer.decode(model_result[0], skip_special_tokens=False))
+```
+# Non production quality
+Be aware that this model tuning wasn't thoroughly tested, and isn't meant to be used in production, only for experimentation or hobby projects.