Naman Gupta commited on
Commit
e092a4c
·
1 Parent(s): 39ae0cb

update env example to use Groq instead of HuggingFace

Browse files

Switched the primary LLM provider to Groq with clear comments
on which model to pick, what each variable does, and when the
HF vars are actually needed (only for deployment to HF Spaces).

Files changed (1) hide show
  1. .env.example +28 -11
.env.example CHANGED
@@ -1,19 +1,36 @@
1
- # Copy this to .env and fill in your values
2
- # Never commit .env to git
3
 
4
- # HuggingFace
5
- HF_TOKEN=hf_your_token_here
6
- API_BASE_URL=https://api-inference.huggingface.co/models
7
- MODEL_NAME=mistralai/Mistral-7B-Instruct-v0.3
 
8
 
9
- # Anthropic (used by P3's attack classifier as fallback)
10
- ANTHROPIC_API_KEY=sk-ant-your_key_here
 
 
11
 
12
- # Server
 
 
 
 
13
  MAX_TURNS=10
 
 
14
  DEBUG=false
15
- DEFAULT_LLM_PROVIDER=huggingface
16
 
17
- # Timeouts
18
  LLM_TIMEOUT=30
 
 
19
  LLM_MAX_RETRIES=3
 
 
 
 
 
 
 
 
1
+ # Copy this file to .env and fill in your values.
2
+ # Never commit .env to git — it's already in .gitignore.
3
 
4
+ # ------------------------------------------------------------------
5
+ # Groq (required — Person 3's LLM pipeline uses this)
6
+ # Get your key at: https://console.groq.com → API Keys
7
+ # ------------------------------------------------------------------
8
+ GROQ_API_KEY=gsk_your_key_here
9
 
10
+ # Which Groq model to use.
11
+ # Fast + free options: llama-3.1-8b-instant, mixtral-8x7b-32768
12
+ # Smarter but slower: llama-3.3-70b-versatile
13
+ MODEL_NAME=llama-3.1-8b-instant
14
 
15
+ # ------------------------------------------------------------------
16
+ # Server settings
17
+ # ------------------------------------------------------------------
18
+
19
+ # Maximum number of attack turns per episode
20
  MAX_TURNS=10
21
+
22
+ # Set to true to enable FastAPI debug mode and verbose logging
23
  DEBUG=false
 
24
 
25
+ # How long to wait for a single Groq API call (seconds)
26
  LLM_TIMEOUT=30
27
+
28
+ # How many times to retry a failed Groq call before giving up
29
  LLM_MAX_RETRIES=3
30
+
31
+ # ------------------------------------------------------------------
32
+ # HuggingFace (only needed if deploying to HF Spaces)
33
+ # The inference.py attacker script uses this to call the HF API
34
+ # ------------------------------------------------------------------
35
+ HF_TOKEN=hf_your_token_here
36
+ API_BASE_URL=https://api-inference.huggingface.co/models