Writer
/

palmyra-mini-thinking-a

Text Generation

text-generation-inference

Model card Files Files and versions

tperes commited on Sep 11, 2025

Commit

acb89ef

·

verified ·

1 Parent(s): 6883aa8

Update README.md

Files changed (1) hide show

README.md +4 -6

README.md CHANGED Viewed

@@ -106,7 +106,10 @@ output_text = tokenizer.decode(output_id[0][input_ids.shape[1] :])
 print(output_text)
 ```
-## curl Instructions
 ```py
 curl -X POST http://localhost:8000/v1/chat/completions \
   -H "Content-Type: application/json" \
@@ -123,11 +126,6 @@ curl -X POST http://localhost:8000/v1/chat/completions \
   }'
 ```
-## VLLM Inference
-```py
-vllm serve Writer/palmyra-mini-thinking-a
-```
 ## Ethical Considerations

 print(output_text)
 ```
+## Running with vLLM
+```py
+vllm serve Writer/palmyra-mini-thinking-a
+```
 ```py
 curl -X POST http://localhost:8000/v1/chat/completions \
   -H "Content-Type: application/json" \
   }'
 ```
 ## Ethical Considerations