Hypernova
Collection
Experimental series of models, largely untested β’ 5 items β’ Updated
How to use theNovaAI/Hypernova-experimental with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="theNovaAI/Hypernova-experimental") # Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM
tokenizer = AutoTokenizer.from_pretrained("theNovaAI/Hypernova-experimental")
model = AutoModelForMultimodalLM.from_pretrained("theNovaAI/Hypernova-experimental")How to use theNovaAI/Hypernova-experimental with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "theNovaAI/Hypernova-experimental"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "theNovaAI/Hypernova-experimental",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker model run hf.co/theNovaAI/Hypernova-experimental
How to use theNovaAI/Hypernova-experimental with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "theNovaAI/Hypernova-experimental" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "theNovaAI/Hypernova-experimental",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "theNovaAI/Hypernova-experimental" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "theNovaAI/Hypernova-experimental",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'How to use theNovaAI/Hypernova-experimental with Docker Model Runner:
docker model run hf.co/theNovaAI/Hypernova-experimental
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM
tokenizer = AutoTokenizer.from_pretrained("theNovaAI/Hypernova-experimental")
model = AutoModelForMultimodalLM.from_pretrained("theNovaAI/Hypernova-experimental")Tried some new stuff this time around. Very different outcome than I expected. This is an experimental model that was created for the development of NovaAI.
Good at chatting and some RP. Sometimes gets characters mixed up. Can occasionally struggle with context.
Quantized model here: theNovaAI/Hypernova-experimental-GPTQ
Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
{prompt}
### Response:
The following models were included in the merge:
Some finetuning done as well
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="theNovaAI/Hypernova-experimental")