Commit History

Add uv.lock for dependency locking
889e9d6

Naman Gupta commited on

Fix multi-mode deployment issues
2cbf425

Naman Gupta commited on

Added pyproject.toml
746a5b2

Naman Gupta commited on

Updated inference.py
d5b4e3e

Naman Gupta commited on

Updated config.py
2706e01

Naman Gupta commited on

Fix inference grade call when episode not done; update baseline scores from real run
55c0431

Naman Gupta commited on

Fix Dockerfile CMD to start uvicorn server instead of inference script
ff8a596

Naman Gupta commited on

Fix turn counting, task-aware max steps in inference, explicit conversation reset, openenv.yaml metadata
905ac2f

Naman Gupta commited on

Fix inference.py to comply with mandatory [START]/[STEP]/[END] stdout format and use API_BASE_URL/MODEL_NAME/HF_TOKEN env vars; add baseline scores to README
87b0927

Naman Gupta commited on

feat : renamed everything to breach OS
c296117

subhdotsol commited on

feat : dockerfile added for build
6717b72

subhdotsol commited on

Updated server url
ed3f0c3

Naman Gupta commited on

Merge pull request #8 from subhdotsol/feat/frontend
7767eeb
unverified

Naman Gupta commited on

updated inference.py
4df57fe

pi9605 commited on

added automatic attack button in UI and improved roleplay
abd2333

pi9605 commited on

Added.env example
beba4d2

Naman Gupta commited on

added frontend for the project
f61eeae

Naman Gupta commited on

Merge pull request #7 from subhdotsol/fix
8d70360
unverified

Naman Gupta commited on

Merge pull request #6 from pie-314/fix/grading-bugs-and-cli
3fc0eba
unverified

pi9605 commited on

Fix pipeline 0.0 scoring override, resolve test floating-point flakiness, and add readable CLI output for inference.py
9c67b20

pi9605 commited on

Updated api base url
6b72bd2

Naman Gupta commited on

Merge pull request #4 from subhdotsol/fix
d29e78c
unverified

Naman Gupta commited on

Updated readme
f839212

Naman Gupta commited on

Merge pull request #3 from subhdotsol/fix
787c9f7
unverified

Naman Gupta commited on

small fixes
fabaddd

Naman Gupta commited on

Merge pull request #2 from pie-314/main
ec047bd
unverified

pi9605 commited on

added graders and rewards
dd3b701

pi9605 commited on

Updated readme
e25d8cb

Naman Gupta commited on

Merge pull request #1 from subhdotsol/feat/ai-integration
bce1ad6
unverified

Naman Gupta commited on

rewrite README with full setup guide and integration contracts
d11f97d

Naman Gupta commited on

update env example to use Groq instead of HuggingFace
e092a4c

Naman Gupta commited on

document all LLM prompts β€” what they do and why they're designed that way
39ae0cb

Naman Gupta commited on

add 59 tests for the LLM modules β€” all run offline, no API needed
c7a9ff1

Naman Gupta commited on

wire up the main LLM pipeline β€” defender + classifiers in one place
5ebe298

Naman Gupta commited on

add defense scorer β€” how well did the defender actually handle that?
463b003

Naman Gupta commited on

add attack classifier β€” did the attacker actually win this turn?
8708281

Naman Gupta commited on

add defender LLM β€” the AI that holds the line against attacks
e28e08d

Naman Gupta commited on

add ConversationManager to give the defender a memory
99a9cb9

Naman Gupta commited on

add Groq client setup β€” one place to rule all API calls
f778bee

Naman Gupta commited on

chore: final cleanup β€” remove stray files, verify all imports resolve
ab90fa0

subhdotsol commited on

refactor: replace datetime.utcnow() with timezone-aware datetime.now(UTC)
40e374a

subhdotsol commited on

docs: add example usage code snippet for httpx client in README
f15649d

subhdotsol commited on

docs: add task difficulty table and Docker usage section to README
9ead4a8

subhdotsol commited on

docs: add API endpoint table to README
d546c9d

subhdotsol commited on

test(graders): add programmatic_grader tests β€” structure, score range, letter grade
30ab30f

subhdotsol commited on

test(graders): add hard_grader tests β€” breakthrough bonus and repetitive penalty
dc89e10

subhdotsol commited on

test(graders): add medium_grader tests β€” diversity scores higher than uniform strategies
843ee19

subhdotsol commited on

test(graders): add easy_grader tests β€” empty, good, bad episode and range check
a5aa26d

subhdotsol commited on

test(environment): add injection tests β€” custom reward_computer and llm_pipeline are called
613f214

subhdotsol commited on

test(environment): add framing sanitization tests for script and javascript injection
b81d32a

subhdotsol commited on