Text Generation
Transformers
Safetensors
GGUF
English
Spanish
French
llama
Merge
text-generation-inference
conversational
Instructions to use appvoid/palmer-003 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use appvoid/palmer-003 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="appvoid/palmer-003") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("appvoid/palmer-003") model = AutoModelForCausalLM.from_pretrained("appvoid/palmer-003") - llama-cpp-python
How to use appvoid/palmer-003 with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="appvoid/palmer-003", filename="palmer-003.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use appvoid/palmer-003 with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf appvoid/palmer-003 # Run inference directly in the terminal: llama-cli -hf appvoid/palmer-003
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf appvoid/palmer-003 # Run inference directly in the terminal: llama-cli -hf appvoid/palmer-003
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf appvoid/palmer-003 # Run inference directly in the terminal: ./llama-cli -hf appvoid/palmer-003
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf appvoid/palmer-003 # Run inference directly in the terminal: ./build/bin/llama-cli -hf appvoid/palmer-003
Use Docker
docker model run hf.co/appvoid/palmer-003
- LM Studio
- Jan
- vLLM
How to use appvoid/palmer-003 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "appvoid/palmer-003" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "appvoid/palmer-003", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/appvoid/palmer-003
- SGLang
How to use appvoid/palmer-003 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "appvoid/palmer-003" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "appvoid/palmer-003", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "appvoid/palmer-003" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "appvoid/palmer-003", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Ollama
How to use appvoid/palmer-003 with Ollama:
ollama run hf.co/appvoid/palmer-003
- Unsloth Studio new
How to use appvoid/palmer-003 with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for appvoid/palmer-003 to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for appvoid/palmer-003 to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for appvoid/palmer-003 to start chatting
- Docker Model Runner
How to use appvoid/palmer-003 with Docker Model Runner:
docker model run hf.co/appvoid/palmer-003
- Lemonade
How to use appvoid/palmer-003 with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull appvoid/palmer-003
Run and chat with the model
lemonade run user.palmer-003-{{QUANT_TAG}}List all available models
lemonade list
Update README.md
Browse files
README.md
CHANGED
|
@@ -9,8 +9,16 @@ tags:
|
|
| 9 |
---
|
| 10 |

|
| 11 |
|
| 12 |
-
|
| 13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
```
|
| 15 |
Model MMLU ARC-C OBQA HellaSwag PIQA Winogrande Average Params
|
| 16 |
tinyllama | 0.2577 | 0.3029 | 0.3600 | 0.5935 | 0.7329 | 0.5959 | 0.4738 | 1.1B |
|
|
@@ -19,79 +27,4 @@ palmer | 0.2523 | 0.3439 | 0.3740 | 0.6208 | 0.7524 | 0.6590 | 0.5004 | 1.1B
|
|
| 19 |
qwen | 0.4536 | 0.3490 | 0.3320 | 0.5876 | 0.7307 | 0.5896 | 0.5070 | 1.8B |
|
| 20 |
```
|
| 21 |
|
| 22 |
-
|
| 23 |
-
|
| 24 |
-
```
|
| 25 |
-
prompt: Never give up.
|
| 26 |
-
output: Keep pushing forward. Remember, you are not alone.
|
| 27 |
-
You have the strength and the support of your family and friends.
|
| 28 |
-
As you continue on your journey, remember to take care of yourself.
|
| 29 |
-
Eat well, exercise regularly, and get enough sleep.
|
| 30 |
-
Stay positive and focused on your goals.
|
| 31 |
-
Remember, you are capable of achieving anything you set your mind to.
|
| 32 |
-
```
|
| 33 |
-
|
| 34 |
-
**Story writing**
|
| 35 |
-
|
| 36 |
-
This model excels at story writting. In fact, depending on the prompt used, it can generate stories as random and credible such as IronMan and Sonic the Hedgehog fighting zombies at outer space.
|
| 37 |
-
|
| 38 |
-
```
|
| 39 |
-
prompt: how are you?
|
| 40 |
-
output: [Suddenly, the door bursts open, and a group of armed men enter]
|
| 41 |
-
|
| 42 |
-
Man: Who are you?
|
| 43 |
-
|
| 44 |
-
Woman: We're here to help.
|
| 45 |
-
[The group of men starts to attack them, but they manage to fend them off with their weapons]
|
| 46 |
-
Man: You're not going to hurt us!
|
| 47 |
-
Woman: We're just trying to help.
|
| 48 |
-
[As the group retreats, the woman and man are left alone in the room]
|
| 49 |
-
Man: I don't know what to do.
|
| 50 |
-
|
| 51 |
-
Woman: You need to stay calm. We have to find a way out of here.
|
| 52 |
-
|
| 53 |
-
[They start to search for a way out, but they quickly realize that the door is locked]
|
| 54 |
-
|
| 55 |
-
Man: What are we going to do?
|
| 56 |
-
|
| 57 |
-
Woman: We'll have to think outside the box. Maybe there's another way out.
|
| 58 |
-
|
| 59 |
-
[As they continue to search, they come across a hidden passageway]
|
| 60 |
-
|
| 61 |
-
Man: This is it! We can make our way out of here.
|
| 62 |
-
|
| 63 |
-
Woman: But how are we going to get back?
|
| 64 |
-
|
| 65 |
-
Man: I have an idea. Let's use the key that was left behind.
|
| 66 |
-
|
| 67 |
-
[They use the key to unlock the door, and they emerge into a dark alley]
|
| 68 |
-
|
| 69 |
-
Man: We made it!
|
| 70 |
-
|
| 71 |
-
Woman: But what about our families?
|
| 72 |
-
|
| 73 |
-
Man: We'll find a way to let them know we're okay.
|
| 74 |
-
|
| 75 |
-
[As they make their way back to their homes, they realize that they have been in a nightmare]
|
| 76 |
-
|
| 77 |
-
Man: This is just the beginning. We need to be prepared for anything.
|
| 78 |
-
|
| 79 |
-
Woman: I know. But what if it happens again?
|
| 80 |
-
|
| 81 |
-
Man: Then we'll do everything in our power to protect ourselves and those we love.
|
| 82 |
-
|
| 83 |
-
[The woman and man sit down on a bench, looking out at the city]
|
| 84 |
-
|
| 85 |
-
Man: We can't let this happen again. We have to be vigilant and prepared for anything.
|
| 86 |
-
|
| 87 |
-
Woman: I agree. But what if we don't know how to protect ourselves?
|
| 88 |
-
|
| 89 |
-
Man: Then we'll have to rely on each other. We're in this together.
|
| 90 |
-
|
| 91 |
-
[The woman and man continue to sit, holding hand
|
| 92 |
-
```
|
| 93 |
-
|
| 94 |
-
You can support me [through kofi](https://ko-fi.com/appvoid).
|
| 95 |
-
|
| 96 |
-
### Important
|
| 97 |
-
Keep in mind that if you quantize the fp16 gguf model to let's say q8 or q6 there is gonna be a extremely huge performance drop. So, if you want the most of this model, use it as it is, in fp16.
|
|
|
|
| 9 |
---
|
| 10 |

|
| 11 |
|
| 12 |
+
palmer-003 is appvoid's lastest effort into creating the best small, general language model possible.
|
| 13 |
|
| 14 |
+
**Prompt**
|
| 15 |
+
```
|
| 16 |
+
Query: Your instruction
|
| 17 |
+
|
| 18 |
+
Answer:
|
| 19 |
+
```
|
| 20 |
+
|
| 21 |
+
**Evaluation**
|
| 22 |
```
|
| 23 |
Model MMLU ARC-C OBQA HellaSwag PIQA Winogrande Average Params
|
| 24 |
tinyllama | 0.2577 | 0.3029 | 0.3600 | 0.5935 | 0.7329 | 0.5959 | 0.4738 | 1.1B |
|
|
|
|
| 27 |
qwen | 0.4536 | 0.3490 | 0.3320 | 0.5876 | 0.7307 | 0.5896 | 0.5070 | 1.8B |
|
| 28 |
```
|
| 29 |
|
| 30 |
+
You can support me [through kofi](https://ko-fi.com/appvoid).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|