Instructions to use oxdev/security-auditor-grpo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use oxdev/security-auditor-grpo with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="oxdev/security-auditor-grpo")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("oxdev/security-auditor-grpo")
model = AutoModelForCausalLM.from_pretrained("oxdev/security-auditor-grpo")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use oxdev/security-auditor-grpo with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "oxdev/security-auditor-grpo"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "oxdev/security-auditor-grpo",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/oxdev/security-auditor-grpo

SGLang

How to use oxdev/security-auditor-grpo with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "oxdev/security-auditor-grpo" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "oxdev/security-auditor-grpo",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "oxdev/security-auditor-grpo" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "oxdev/security-auditor-grpo",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use oxdev/security-auditor-grpo with Docker Model Runner:
```
docker model run hf.co/oxdev/security-auditor-grpo
```

security-auditor-grpo

Commit History

Update README with comprehensive model card

df8a81e
verified

oxdev commited on Apr 27

Add Google Colab training notebook for V2 GRPO training (free T4 path)

55ef8ec
verified

oxdev commited on Apr 27

v2: 5K subset for A10G, fix escaping

3c818d7
verified

oxdev commited on Apr 25

fix: escape syntax in quality_reward

75be256
verified

oxdev commited on Apr 25

GRPO training complete — smart contract security auditor

93b6a9a
verified

oxdev commited on Apr 25

add: GRPO v2 training script with 4 reward functions + dataset builder

9ab390c
verified

oxdev commited on Apr 25

fix: total_mem -> total_memory for PyTorch compat

39535c8
verified

oxdev commited on Apr 25

fix: disable all Hub calls during trainer init to prevent 401

3716618
verified

oxdev commited on Apr 25

Upload tokenizer

1fb5e81
verified

oxdev commited on Apr 25

Upload Qwen2ForCausalLM

044e65a
verified

oxdev commited on Apr 25

Upload train_grpo_job.py with huggingface_hub

df05b8e
verified

oxdev commited on Apr 25

Upload train_grpo_job.py with huggingface_hub

0ee8b77
verified

oxdev commited on Apr 25

Upload train_grpo_job.py with huggingface_hub

eac5c9b
verified

oxdev commited on Apr 25

Upload train_grpo_job.py with huggingface_hub

7168e35
verified

oxdev commited on Apr 25

Upload train_grpo_job.py with huggingface_hub

74022f8
verified

oxdev commited on Apr 24

initial commit

c4b5a68
verified

oxdev commited on Apr 24

Commit History

Update README with comprehensive model card df8a81e verified

Add Google Colab training notebook for V2 GRPO training (free T4 path) 55ef8ec verified

v2: 5K subset for A10G, fix escaping 3c818d7 verified

fix: escape syntax in quality_reward 75be256 verified

GRPO training complete — smart contract security auditor 93b6a9a verified

add: GRPO v2 training script with 4 reward functions + dataset builder 9ab390c verified

fix: total_mem -> total_memory for PyTorch compat 39535c8 verified

fix: disable all Hub calls during trainer init to prevent 401 3716618 verified

Upload tokenizer 1fb5e81 verified

Upload Qwen2ForCausalLM 044e65a verified

Upload train_grpo_job.py with huggingface_hub df05b8e verified

Upload train_grpo_job.py with huggingface_hub 0ee8b77 verified

Upload train_grpo_job.py with huggingface_hub eac5c9b verified

Upload train_grpo_job.py with huggingface_hub 7168e35 verified

Upload train_grpo_job.py with huggingface_hub 74022f8 verified

initial commit c4b5a68 verified

Update README with comprehensive model card

df8a81e
verified

Add Google Colab training notebook for V2 GRPO training (free T4 path)

55ef8ec
verified

v2: 5K subset for A10G, fix escaping

3c818d7
verified

fix: escape syntax in quality_reward

75be256
verified

GRPO training complete — smart contract security auditor

93b6a9a
verified

add: GRPO v2 training script with 4 reward functions + dataset builder

9ab390c
verified

fix: total_mem -> total_memory for PyTorch compat

39535c8
verified

fix: disable all Hub calls during trainer init to prevent 401

3716618
verified

Upload tokenizer

1fb5e81
verified

Upload Qwen2ForCausalLM

044e65a
verified

Upload train_grpo_job.py with huggingface_hub

df05b8e
verified

Upload train_grpo_job.py with huggingface_hub

0ee8b77
verified

Upload train_grpo_job.py with huggingface_hub

eac5c9b
verified

Upload train_grpo_job.py with huggingface_hub

7168e35
verified

Upload train_grpo_job.py with huggingface_hub

74022f8
verified

initial commit

c4b5a68
verified