Hypernova
Collection
Experimental series of models, largely untested • 5 items • Updated
How to use theNovaAI/Hypernova-experimental-GPTQ with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="theNovaAI/Hypernova-experimental-GPTQ") # Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM
tokenizer = AutoTokenizer.from_pretrained("theNovaAI/Hypernova-experimental-GPTQ")
model = AutoModelForMultimodalLM.from_pretrained("theNovaAI/Hypernova-experimental-GPTQ")How to use theNovaAI/Hypernova-experimental-GPTQ with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "theNovaAI/Hypernova-experimental-GPTQ"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "theNovaAI/Hypernova-experimental-GPTQ",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker model run hf.co/theNovaAI/Hypernova-experimental-GPTQ
How to use theNovaAI/Hypernova-experimental-GPTQ with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "theNovaAI/Hypernova-experimental-GPTQ" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "theNovaAI/Hypernova-experimental-GPTQ",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "theNovaAI/Hypernova-experimental-GPTQ" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "theNovaAI/Hypernova-experimental-GPTQ",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'How to use theNovaAI/Hypernova-experimental-GPTQ with Docker Model Runner:
docker model run hf.co/theNovaAI/Hypernova-experimental-GPTQ
Quantized to 4bit 128g using AutoGPTQ and 🤗 Optimum
Tried some new stuff this time around. Very different outcome than I expected. This is an experimental model that was created for the development of NovaAI.
Good at chatting and some RP. Sometimes gets characters mixed up. Can occasionally struggle with context.
Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
{prompt}
### Response:
The following models were included in the merge:
Some finetuning done as well
Base model
Gryphe/MythoMax-L2-13b