SE6446
/

Phasmid-2_v2

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

SE6446 commited on Jan 9, 2024

Commit

0767860

·

1 Parent(s): 431f2e6

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -135,24 +135,24 @@ Phi doesn't like device_map = auto, therefore you should specify as like the fol
 1. FP16 / Flash-Attention / CUDA:
    ```python
-   model = AutoModelForCausalLM.from_pretrained("SE6446/Phasmid-2", torch_dtype="auto", flash_attn=True, flash_rotary=True, fused_dense=True, device_map="cuda", trust_remote_code=True)
    ```
 2. FP16 / CUDA:
    ```python
-   model = AutoModelForCausalLM.from_pretrained("SE6446/Phasmid-2", torch_dtype="auto", device_map="cuda", trust_remote_code=True)
    ```
 3. FP32 / CUDA:
    ```python
-   model = AutoModelForCausalLM.from_pretrained("SE6446/Phasmid-2", torch_dtype=torch.float32, device_map="cuda", trust_remote_code=True)
    ```
 4. FP32 / CPU:
    ```python
-   model = AutoModelForCausalLM.from_pretrained("SE6446/Phasmid-2", torch_dtype=torch.float32, device_map="cpu", trust_remote_code=True)
    ```
 And then use the following snippet
 ```python
-tokenizer = AutoTokenizer.from_pretrained("SE6446/Phasmid-1_5-V0_1", trust_remote_code=True, torch_dtype="auto")
 inputs = tokenizer('''SYSTEM: You are a helpful assistant. Please answer truthfully and politely. {custom_prompt}\n
                       USER: {{userinput}}\n
                       ASSISTANT: {{character name if applicable}}:''', return_tensors="pt", return_attention_mask=False)

 1. FP16 / Flash-Attention / CUDA:
    ```python
+   model = AutoModelForCausalLM.from_pretrained("SE6446/Phasmid-2_v2", torch_dtype="auto", flash_attn=True, flash_rotary=True, fused_dense=True, device_map="cuda", trust_remote_code=True)
    ```
 2. FP16 / CUDA:
    ```python
+   model = AutoModelForCausalLM.from_pretrained("SE6446/Phasmid-2_v2", torch_dtype="auto", device_map="cuda", trust_remote_code=True)
    ```
 3. FP32 / CUDA:
    ```python
+   model = AutoModelForCausalLM.from_pretrained("SE6446/Phasmid-2_v2", torch_dtype=torch.float32, device_map="cuda", trust_remote_code=True)
    ```
 4. FP32 / CPU:
    ```python
+   model = AutoModelForCausalLM.from_pretrained("SE6446/Phasmid-2_v2", torch_dtype=torch.float32, device_map="cpu", trust_remote_code=True)
    ```
 And then use the following snippet
 ```python
+tokenizer = AutoTokenizer.from_pretrained("SE6446/Phasmid-2_v2", trust_remote_code=True, torch_dtype="auto")
 inputs = tokenizer('''SYSTEM: You are a helpful assistant. Please answer truthfully and politely. {custom_prompt}\n
                       USER: {{userinput}}\n
                       ASSISTANT: {{character name if applicable}}:''', return_tensors="pt", return_attention_mask=False)