Instructions to use prithivMLmods/Novaeus-Promptist-7B-Instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use prithivMLmods/Novaeus-Promptist-7B-Instruct with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="prithivMLmods/Novaeus-Promptist-7B-Instruct")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("prithivMLmods/Novaeus-Promptist-7B-Instruct")
model = AutoModelForCausalLM.from_pretrained("prithivMLmods/Novaeus-Promptist-7B-Instruct")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use prithivMLmods/Novaeus-Promptist-7B-Instruct with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "prithivMLmods/Novaeus-Promptist-7B-Instruct"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "prithivMLmods/Novaeus-Promptist-7B-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/prithivMLmods/Novaeus-Promptist-7B-Instruct

SGLang

How to use prithivMLmods/Novaeus-Promptist-7B-Instruct with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "prithivMLmods/Novaeus-Promptist-7B-Instruct" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "prithivMLmods/Novaeus-Promptist-7B-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "prithivMLmods/Novaeus-Promptist-7B-Instruct" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "prithivMLmods/Novaeus-Promptist-7B-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use prithivMLmods/Novaeus-Promptist-7B-Instruct with Docker Model Runner:
```
docker model run hf.co/prithivMLmods/Novaeus-Promptist-7B-Instruct
```

Improve language tag

by lbourdois - opened Apr 28, 2025

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+107

-95

Files changed (1) hide show

README.md +107 -95

README.md CHANGED Viewed

@@ -1,96 +1,108 @@
----
-license: creativeml-openrail-m
-datasets:
-- prithivMLmods/Prompt-Enhancement-Mini
-- gokaygokay/prompt-enhancement-75k
-- gokaygokay/prompt-enhancer-dataset
-language:
-- en
-base_model:
-- Qwen/Qwen2.5-7B-Instruct
-pipeline_tag: text-generation
-library_name: transformers
-tags:
-- Qwen2.5
-- Prompt_Enhance
-- 7B
-- Instruct
-- safetensors
-- pytorch
-- Promptist-Instruct
-- text-generation-inference
-- art
----
-### Novaeus-Promptist-7B-Instruct Uploaded Model Files
-The **Novaeus-Promptist-7B-Instruct** is a fine-tuned large language model derived from the **Qwen2.5-7B-Instruct** base model. It is optimized for **prompt enhancement, text generation**, and **instruction-following tasks**, providing high-quality outputs tailored to various applications.
-| **File Name [ Uploaded Files ]**                              | **Size**      | **Description**                          | **Upload Status** |
-|--------------------------------------------|---------------|------------------------------------------|-------------------|
-| `.gitattributes`                           | 1.57 kB       | Git attributes configuration for LFS.    | Uploaded          |
-| `README.md`                                | 400 Bytes     | Documentation about the model.           | Updated           |
-| `added_tokens.json`                        | 657 Bytes     | Custom tokens for tokenizer.             | Uploaded          |
-| `config.json`                              | 860 Bytes     | Configuration for the model.             | Uploaded          |
-| `generation_config.json`                   | 281 Bytes     | Configuration for text generation.       | Uploaded          |
-| `merges.txt`                               | 1.82 MB       | Byte-pair encoding (BPE) merge rules.    | Uploaded          |
-| `pytorch_model-00001-of-00004.bin`         | 4.88 GB       | Model weights (split part 1).            | Uploaded (LFS)    |
-| `pytorch_model-00002-of-00004.bin`         | 4.93 GB       | Model weights (split part 2).            | Uploaded (LFS)    |
-| `pytorch_model-00003-of-00004.bin`         | 4.33 GB       | Model weights (split part 3).            | Uploaded (LFS)    |
-| `pytorch_model-00004-of-00004.bin`         | 1.09 GB       | Model weights (split part 4).            | Uploaded (LFS)    |
-| `pytorch_model.bin.index.json`             | 28.1 kB       | Index file for model weights.            | Uploaded          |
-| `special_tokens_map.json`                  | 644 Bytes     | Map of special tokens for tokenizer.     | Uploaded          |
-| `tokenizer.json`                           | 11.4 MB       | Tokenizer data in JSON format.           | Uploaded (LFS)    |
-| `tokenizer_config.json`                    | 7.73 kB       | Tokenizer configuration file.            | Uploaded          |
-| `vocab.json`                               | 2.78 MB       | Vocabulary for tokenizer.                | Uploaded          |
----
-![Screenshot 2024-12-07 113150.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/pqFaT-78hssi106bfJwpN.png)
-### **Key Features:**
-1. **Prompt Refinement:**
-   Designed to enhance input prompts by rephrasing, clarifying, and optimizing for more precise outcomes.
-2. **Instruction Following:**
-   Accurately follows complex user instructions for various generation tasks, including creative writing, summarization, and question answering.
-3. **Customization and Fine-Tuning:**
-   Incorporates datasets specifically curated for prompt optimization, enabling seamless adaptation to specific user needs.
----
-### **Training Details:**
-- **Base Model:** [Qwen2.5-7B-Instruct](#)
-- **Datasets Used for Fine-Tuning:**
-  - **gokaygokay/prompt-enhancer-dataset:** Focuses on prompt engineering with 17.9k samples.
-  - **gokaygokay/prompt-enhancement-75k:** Encompasses a wider array of prompt styles with 73.2k samples.
-  - **prithivMLmods/Prompt-Enhancement-Mini:** A compact dataset (1.16k samples) for iterative refinement.
----
-### **Capabilities:**
-- **Prompt Optimization:**
-   Automatically refines and enhances user-input prompts for better generation results.
-- **Instruction-Based Text Generation:**
-   Supports diverse tasks, including:
-   - Creative writing (stories, poems, scripts).
-   - Summaries and paraphrasing.
-   - Custom Q&A systems.
-- **Efficient Fine-Tuning:**
-   Adaptable to additional fine-tuning tasks by leveraging the model's existing high-quality instruction-following capabilities.
----
-### **Usage Instructions:**
-1. **Setup:**
-   - Ensure all necessary model files, including shards, tokenizer configurations, and index files, are downloaded and placed in the correct directory.
-2. **Load Model:**
-   Use PyTorch or Hugging Face Transformers to load the model and tokenizer. Ensure `pytorch_model.bin.index.json` is correctly set for efficient shard-based loading.
-3. **Customize Generation:**
-   Adjust parameters in `generation_config.json` to control aspects such as temperature, top-p sampling, and maximum sequence length.
 ---

+---
+license: creativeml-openrail-m
+datasets:
+- prithivMLmods/Prompt-Enhancement-Mini
+- gokaygokay/prompt-enhancement-75k
+- gokaygokay/prompt-enhancer-dataset
+language:
+- zho
+- eng
+- fra
+- spa
+- por
+- deu
+- ita
+- rus
+- jpn
+- kor
+- vie
+- tha
+- ara
+base_model:
+- Qwen/Qwen2.5-7B-Instruct
+pipeline_tag: text-generation
+library_name: transformers
+tags:
+- Qwen2.5
+- Prompt_Enhance
+- 7B
+- Instruct
+- safetensors
+- pytorch
+- Promptist-Instruct
+- text-generation-inference
+- art
+---
+### Novaeus-Promptist-7B-Instruct Uploaded Model Files
+The **Novaeus-Promptist-7B-Instruct** is a fine-tuned large language model derived from the **Qwen2.5-7B-Instruct** base model. It is optimized for **prompt enhancement, text generation**, and **instruction-following tasks**, providing high-quality outputs tailored to various applications.
+| **File Name [ Uploaded Files ]**                              | **Size**      | **Description**                          | **Upload Status** |
+|--------------------------------------------|---------------|------------------------------------------|-------------------|
+| `.gitattributes`                           | 1.57 kB       | Git attributes configuration for LFS.    | Uploaded          |
+| `README.md`                                | 400 Bytes     | Documentation about the model.           | Updated           |
+| `added_tokens.json`                        | 657 Bytes     | Custom tokens for tokenizer.             | Uploaded          |
+| `config.json`                              | 860 Bytes     | Configuration for the model.             | Uploaded          |
+| `generation_config.json`                   | 281 Bytes     | Configuration for text generation.       | Uploaded          |
+| `merges.txt`                               | 1.82 MB       | Byte-pair encoding (BPE) merge rules.    | Uploaded          |
+| `pytorch_model-00001-of-00004.bin`         | 4.88 GB       | Model weights (split part 1).            | Uploaded (LFS)    |
+| `pytorch_model-00002-of-00004.bin`         | 4.93 GB       | Model weights (split part 2).            | Uploaded (LFS)    |
+| `pytorch_model-00003-of-00004.bin`         | 4.33 GB       | Model weights (split part 3).            | Uploaded (LFS)    |
+| `pytorch_model-00004-of-00004.bin`         | 1.09 GB       | Model weights (split part 4).            | Uploaded (LFS)    |
+| `pytorch_model.bin.index.json`             | 28.1 kB       | Index file for model weights.            | Uploaded          |
+| `special_tokens_map.json`                  | 644 Bytes     | Map of special tokens for tokenizer.     | Uploaded          |
+| `tokenizer.json`                           | 11.4 MB       | Tokenizer data in JSON format.           | Uploaded (LFS)    |
+| `tokenizer_config.json`                    | 7.73 kB       | Tokenizer configuration file.            | Uploaded          |
+| `vocab.json`                               | 2.78 MB       | Vocabulary for tokenizer.                | Uploaded          |
+---
+![Screenshot 2024-12-07 113150.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/pqFaT-78hssi106bfJwpN.png)
+### **Key Features:**
+1. **Prompt Refinement:**
+   Designed to enhance input prompts by rephrasing, clarifying, and optimizing for more precise outcomes.
+2. **Instruction Following:**
+   Accurately follows complex user instructions for various generation tasks, including creative writing, summarization, and question answering.
+3. **Customization and Fine-Tuning:**
+   Incorporates datasets specifically curated for prompt optimization, enabling seamless adaptation to specific user needs.
+---
+### **Training Details:**
+- **Base Model:** [Qwen2.5-7B-Instruct](#)
+- **Datasets Used for Fine-Tuning:**
+  - **gokaygokay/prompt-enhancer-dataset:** Focuses on prompt engineering with 17.9k samples.
+  - **gokaygokay/prompt-enhancement-75k:** Encompasses a wider array of prompt styles with 73.2k samples.
+  - **prithivMLmods/Prompt-Enhancement-Mini:** A compact dataset (1.16k samples) for iterative refinement.
+---
+### **Capabilities:**
+- **Prompt Optimization:**
+   Automatically refines and enhances user-input prompts for better generation results.
+- **Instruction-Based Text Generation:**
+   Supports diverse tasks, including:
+   - Creative writing (stories, poems, scripts).
+   - Summaries and paraphrasing.
+   - Custom Q&A systems.
+- **Efficient Fine-Tuning:**
+   Adaptable to additional fine-tuning tasks by leveraging the model's existing high-quality instruction-following capabilities.
+---
+### **Usage Instructions:**
+1. **Setup:**
+   - Ensure all necessary model files, including shards, tokenizer configurations, and index files, are downloaded and placed in the correct directory.
+2. **Load Model:**
+   Use PyTorch or Hugging Face Transformers to load the model and tokenizer. Ensure `pytorch_model.bin.index.json` is correctly set for efficient shard-based loading.
+3. **Customize Generation:**
+   Adjust parameters in `generation_config.json` to control aspects such as temperature, top-p sampling, and maximum sequence length.
 ---