Spaces:

Rayugacodes
/

Breach-OS

Sleeping

Naman Gupta commited on Apr 1

Commit

e092a4c

1 Parent(s): 39ae0cb

update env example to use Groq instead of HuggingFace

Switched the primary LLM provider to Groq with clear comments
on which model to pick, what each variable does, and when the
HF vars are actually needed (only for deployment to HF Spaces).

Files changed (1) hide show

.env.example +28 -11

.env.example CHANGED Viewed

@@ -1,19 +1,36 @@
-# Copy this to .env and fill in your values
-# Never commit .env to git
-# HuggingFace
-HF_TOKEN=hf_your_token_here
-API_BASE_URL=https://api-inference.huggingface.co/models
-MODEL_NAME=mistralai/Mistral-7B-Instruct-v0.3
-# Anthropic (used by P3's attack classifier as fallback)
-ANTHROPIC_API_KEY=sk-ant-your_key_here
-# Server
 MAX_TURNS=10
 DEBUG=false
-DEFAULT_LLM_PROVIDER=huggingface
-# Timeouts
 LLM_TIMEOUT=30
 LLM_MAX_RETRIES=3

+# Copy this file to .env and fill in your values.
+# Never commit .env to git — it's already in .gitignore.
+# ------------------------------------------------------------------
+# Groq (required — Person 3's LLM pipeline uses this)
+# Get your key at: https://console.groq.com → API Keys
+# ------------------------------------------------------------------
+GROQ_API_KEY=gsk_your_key_here
+# Which Groq model to use.
+# Fast + free options: llama-3.1-8b-instant, mixtral-8x7b-32768
+# Smarter but slower: llama-3.3-70b-versatile
+MODEL_NAME=llama-3.1-8b-instant
+# ------------------------------------------------------------------
+# Server settings
+# ------------------------------------------------------------------
+# Maximum number of attack turns per episode
 MAX_TURNS=10
+# Set to true to enable FastAPI debug mode and verbose logging
 DEBUG=false
+# How long to wait for a single Groq API call (seconds)
 LLM_TIMEOUT=30
+# How many times to retry a failed Groq call before giving up
 LLM_MAX_RETRIES=3
+# ------------------------------------------------------------------
+# HuggingFace (only needed if deploying to HF Spaces)
+# The inference.py attacker script uses this to call the HF API
+# ------------------------------------------------------------------
+HF_TOKEN=hf_your_token_here
+API_BASE_URL=https://api-inference.huggingface.co/models