Instructions to use jmtss/Nyx-35B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use jmtss/Nyx-35B with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="jmtss/Nyx-35B")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)

# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText

processor = AutoProcessor.from_pretrained("jmtss/Nyx-35B")
model = AutoModelForImageTextToText.from_pretrained("jmtss/Nyx-35B")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use jmtss/Nyx-35B with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "jmtss/Nyx-35B"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jmtss/Nyx-35B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/jmtss/Nyx-35B

SGLang

How to use jmtss/Nyx-35B with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "jmtss/Nyx-35B" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jmtss/Nyx-35B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "jmtss/Nyx-35B" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jmtss/Nyx-35B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use jmtss/Nyx-35B with Docker Model Runner:
```
docker model run hf.co/jmtss/Nyx-35B
```

Nyx-35B

Nyx-35B is a merged full-weight instruction model with sequential LoRA fine-tuning for coding/reasoning and cybersecurity instruction following.

Model Summary

Field	Value
Architecture	Qwen3.6 35B-A3B MoE text-generation model
Format	Merged safetensors full model
Precision	BF16/FP16 weights
Size	~70.24 GB decimal, ~65.41 GiB
Shards	21 safetensors shards
Primary focus	Python/coding reasoning + cybersecurity instruction response

Training

Nyx-35B was trained with a two-stage sequential LoRA workflow:

Stage 1: CodeX pilot
- Dataset: Modotte/CodeX-2M-Thinking
- Rows: 20,000
- Goal: improve coding and Python reasoning behavior
Stage 2: Cyber specialization
- Dataset: jmtss/cyber-security-instruct-3k
- Rows: 3,678
- Effective batch size: 32
- Steps: 115
- Learning rate: 5e-5
- Final train loss: 1.511

The final uploaded model is a merged model:

base model + Stage 1 CodeX adapter + Stage 2 Cyber adapter

Recommended Hardware

For full-precision inference, the model needs more than the raw 70 GB weight size because serving also requires runtime memory and KV cache.

Hardware	Recommendation
NVIDIA H200 141GB	Recommended single-GPU deployment
NVIDIA B200 / B300	Best high-end option with more headroom
RTX PRO 6000 Blackwell 96GB	Workstation/single-user option
H100 80GB	Tight; use small context/batch or quantization
Consumer 24GB/32GB GPUs	Use quantized variants only

Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_id = "jmtss/Nyx-35B"

tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
    trust_remote_code=True,
)

messages = [
    {"role": "user", "content": "Write a short Python function that checks if a URL uses HTTPS."}
]

prompt = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True,
)

inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
outputs = model.generate(
    **inputs,
    max_new_tokens=256,
    temperature=0.2,
    do_sample=True,
)

print(tokenizer.decode(outputs[0], skip_special_tokens=True))

vLLM

For production serving, use vLLM if your environment supports this Qwen3.6 MoE architecture:

vllm serve jmtss/Nyx-35B \
  --trust-remote-code \
  --dtype bfloat16 \
  --max-model-len 4096

Increase --max-model-len only if your GPU has enough free memory for KV cache.

Intended Use

Nyx-35B is intended for:

Python and software engineering assistance
Defensive cybersecurity education
MITRE ATT&CK-style concept explanation
Security documentation and analysis support
General technical instruction following

Safety and Limitations

This model has not been formally benchmarked beyond training loss and basic sanity prompts.
Cybersecurity outputs should be reviewed by a qualified human before operational use.
The model may produce incorrect, outdated, or incomplete security guidance.
The cybersecurity tuning is intended for defensive, educational, and authorized research contexts.
Do not use this model for unauthorized access, credential theft, malware deployment, evasion, or other harmful activity.
The model may include thinking-style prefaces in responses because of the base and training data style.

Training Artifacts

The uploaded repository contains the merged full model only. Intermediate LoRA adapters and training checkpoints were not included in this repository.

License

This model is a derivative of the listed base model and datasets. Use is subject to the terms of the base model, datasets, and any applicable licenses. Verify compatibility for your use case before commercial or production deployment.

Downloads last month: -

Safetensors

Model size

35B params

Tensor type

F32

BF16

jmtss
/

Nyx-35B

Nyx-35B

Model Summary

Training

Recommended Hardware

Quick Start

vLLM

Intended Use

Safety and Limitations

Training Artifacts

License

Datasets used to train jmtss/Nyx-35B