second-state
/

EXAONE-3.0-7.8B-Instruct-GGUF

Text Generation

Model card Files Files and versions

apepkuss79 commited on Sep 30, 2024

Commit

f58ded6

·

verified ·

1 Parent(s): 55c1e54

Update README.md

Files changed (1) hide show

README.md +12 -16

README.md CHANGED Viewed

@@ -36,34 +36,30 @@ tags:
 - LlamaEdge version: coming soon
-<!-- - LlamaEdge version: [v0.12.4](https://github.com/LlamaEdge/LlamaEdge/releases/tag/0.12.4) and above
 - Prompt template
-  - Prompt type: `llama-3-chat`
   - Prompt string
     ```text
-    <|begin_of_text|><|start_header_id|>system<|end_header_id|>
-    {{ system_prompt }}<|eot_id|><|start_header_id|>user<|end_header_id|>
-    {{ user_message_1 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
-    {{ model_answer_1 }}<|eot_id|><|start_header_id|>user<|end_header_id|>
-    {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
-    ``` -->
 - Context size: `4096`
-<!-- - Run as LlamaEdge service
   ```bash
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:EXAONE-3.0-7.8B-Instruct-Q5_K_M.gguf \
       llama-api-server.wasm \
-      --prompt-template llama-3-chat \
       --ctx-size 4096 \
       --model-name EXAONE-3.0-7.8B-Instruct
   ```
@@ -73,9 +69,9 @@ tags:
   ```bash
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:EXAONE-3.0-7.8B-Instruct-Q5_K_M.gguf \
     llama-chat.wasm \
-    --prompt-template llama-3-chat \
     --ctx-size 4096
-  ``` -->
 ## Quantized GGUF Models

 - LlamaEdge version: coming soon
+<!-- - LlamaEdge version: [v0.12.4](https://github.com/LlamaEdge/LlamaEdge/releases/tag/0.12.4) and above -->
 - Prompt template
+  - Prompt type: `exaone-chat`
   - Prompt string
     ```text
+    [|system|]system_prompt_text[|endofturn|]
+    [|user|]user_1st_turn_text
+    [|assistant|]assistant_1st_turn_text[|endofturn|]
+    [|user|]user_2nd_turn_text
+    [|assistant|]assistant_2nd_turn_text[|endofturn|]
+    ```
 - Context size: `4096`
+- Run as LlamaEdge service
   ```bash
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:EXAONE-3.0-7.8B-Instruct-Q5_K_M.gguf \
       llama-api-server.wasm \
+      --prompt-template exaone-chat \
       --ctx-size 4096 \
       --model-name EXAONE-3.0-7.8B-Instruct
   ```
   ```bash
   wasmedge --dir .:. --nn-preload default:GGML:AUTO:EXAONE-3.0-7.8B-Instruct-Q5_K_M.gguf \
     llama-chat.wasm \
+    --prompt-template exaone-chat \
     --ctx-size 4096
+  ```
 ## Quantized GGUF Models