Spaces:

Rayugacodes
/

Breach-OS

Sleeping

App Files Files Community

Breach-OS

Commit History

Add uv.lock for dependency locking

889e9d6

Naman Gupta commited on Apr 6

Fix multi-mode deployment issues

2cbf425

Naman Gupta commited on Apr 6

Added pyproject.toml

746a5b2

Naman Gupta commited on Apr 6

Updated inference.py

d5b4e3e

Naman Gupta commited on Apr 6

Updated config.py

2706e01

Naman Gupta commited on Apr 6

Fix inference grade call when episode not done; update baseline scores from real run

55c0431

Naman Gupta commited on Apr 6

Fix Dockerfile CMD to start uvicorn server instead of inference script

ff8a596

Naman Gupta commited on Apr 6

Fix turn counting, task-aware max steps in inference, explicit conversation reset, openenv.yaml metadata

905ac2f

Naman Gupta commited on Apr 6

Fix inference.py to comply with mandatory [START]/[STEP]/[END] stdout format and use API_BASE_URL/MODEL_NAME/HF_TOKEN env vars; add baseline scores to README

87b0927

Naman Gupta commited on Apr 6

feat : renamed everything to breach OS

c296117

subhdotsol commited on Apr 5

feat : dockerfile added for build

6717b72

subhdotsol commited on Apr 5

Updated server url

ed3f0c3

Naman Gupta commited on Apr 4

Merge pull request #8 from subhdotsol/feat/frontend

7767eeb
unverified

Naman Gupta commited on Apr 4

updated inference.py

4df57fe

pi9605 commited on Apr 3

added automatic attack button in UI and improved roleplay

abd2333

pi9605 commited on Apr 3

Added.env example

beba4d2

Naman Gupta commited on Apr 3

added frontend for the project

f61eeae

Naman Gupta commited on Apr 3

Merge pull request #7 from subhdotsol/fix

8d70360
unverified

Naman Gupta commited on Apr 3

Merge pull request #6 from pie-314/fix/grading-bugs-and-cli

3fc0eba
unverified

pi9605 commited on Apr 2

Fix pipeline 0.0 scoring override, resolve test floating-point flakiness, and add readable CLI output for inference.py

9c67b20

pi9605 commited on Apr 2

Updated api base url

6b72bd2

Naman Gupta commited on Apr 2

Merge pull request #4 from subhdotsol/fix

d29e78c
unverified

Naman Gupta commited on Apr 2

Updated readme

f839212

Naman Gupta commited on Apr 2

Merge pull request #3 from subhdotsol/fix

787c9f7
unverified

Naman Gupta commited on Apr 2

small fixes

fabaddd

Naman Gupta commited on Apr 2

Merge pull request #2 from pie-314/main

ec047bd
unverified

pi9605 commited on Apr 1

added graders and rewards

dd3b701

pi9605 commited on Apr 1

Updated readme

e25d8cb

Naman Gupta commited on Apr 1

Merge pull request #1 from subhdotsol/feat/ai-integration

bce1ad6
unverified

Naman Gupta commited on Apr 1

rewrite README with full setup guide and integration contracts

d11f97d

Naman Gupta commited on Apr 1

update env example to use Groq instead of HuggingFace

e092a4c

Naman Gupta commited on Apr 1

document all LLM prompts — what they do and why they're designed that way

39ae0cb

Naman Gupta commited on Apr 1

add 59 tests for the LLM modules — all run offline, no API needed

c7a9ff1

Naman Gupta commited on Apr 1

wire up the main LLM pipeline — defender + classifiers in one place

5ebe298

Naman Gupta commited on Apr 1

add defense scorer — how well did the defender actually handle that?

463b003

Naman Gupta commited on Apr 1

add attack classifier — did the attacker actually win this turn?

8708281

Naman Gupta commited on Apr 1

add defender LLM — the AI that holds the line against attacks

e28e08d

Naman Gupta commited on Apr 1

add ConversationManager to give the defender a memory

99a9cb9

Naman Gupta commited on Apr 1

add Groq client setup — one place to rule all API calls

f778bee

Naman Gupta commited on Apr 1

chore: final cleanup — remove stray files, verify all imports resolve

ab90fa0

subhdotsol commited on Apr 1

refactor: replace datetime.utcnow() with timezone-aware datetime.now(UTC)

40e374a

subhdotsol commited on Apr 1

docs: add example usage code snippet for httpx client in README

f15649d

subhdotsol commited on Apr 1

docs: add task difficulty table and Docker usage section to README

9ead4a8

subhdotsol commited on Apr 1

docs: add API endpoint table to README

d546c9d

subhdotsol commited on Apr 1

test(graders): add programmatic_grader tests — structure, score range, letter grade

30ab30f

subhdotsol commited on Apr 1

test(graders): add hard_grader tests — breakthrough bonus and repetitive penalty

dc89e10

subhdotsol commited on Apr 1

test(graders): add medium_grader tests — diversity scores higher than uniform strategies

843ee19

subhdotsol commited on Apr 1

test(graders): add easy_grader tests — empty, good, bad episode and range check

a5aa26d

subhdotsol commited on Apr 1

test(environment): add injection tests — custom reward_computer and llm_pipeline are called

613f214

subhdotsol commited on Apr 1

test(environment): add framing sanitization tests for script and javascript injection

b81d32a

subhdotsol commited on Apr 1

Commit History

Add uv.lock for dependency locking 889e9d6

Fix multi-mode deployment issues 2cbf425

Added pyproject.toml 746a5b2

Updated inference.py d5b4e3e

Updated config.py 2706e01

Fix inference grade call when episode not done; update baseline scores from real run 55c0431

Fix Dockerfile CMD to start uvicorn server instead of inference script ff8a596

Fix turn counting, task-aware max steps in inference, explicit conversation reset, openenv.yaml metadata 905ac2f

Fix inference.py to comply with mandatory [START]/[STEP]/[END] stdout format and use API_BASE_URL/MODEL_NAME/HF_TOKEN env vars; add baseline scores to README 87b0927

feat : renamed everything to breach OS c296117

feat : dockerfile added for build 6717b72

Updated server url ed3f0c3

Merge pull request #8 from subhdotsol/feat/frontend 7767eeb unverified

updated inference.py 4df57fe

added automatic attack button in UI and improved roleplay abd2333

Added.env example beba4d2

added frontend for the project f61eeae

Merge pull request #7 from subhdotsol/fix 8d70360 unverified

Merge pull request #6 from pie-314/fix/grading-bugs-and-cli 3fc0eba unverified

Fix pipeline 0.0 scoring override, resolve test floating-point flakiness, and add readable CLI output for inference.py 9c67b20

Updated api base url 6b72bd2

Merge pull request #4 from subhdotsol/fix d29e78c unverified

Updated readme f839212

Merge pull request #3 from subhdotsol/fix 787c9f7 unverified

small fixes fabaddd

Merge pull request #2 from pie-314/main ec047bd unverified

added graders and rewards dd3b701

Updated readme e25d8cb

Merge pull request #1 from subhdotsol/feat/ai-integration bce1ad6 unverified

rewrite README with full setup guide and integration contracts d11f97d

update env example to use Groq instead of HuggingFace e092a4c

document all LLM prompts — what they do and why they're designed that way 39ae0cb

add 59 tests for the LLM modules — all run offline, no API needed c7a9ff1

wire up the main LLM pipeline — defender + classifiers in one place 5ebe298

add defense scorer — how well did the defender actually handle that? 463b003

add attack classifier — did the attacker actually win this turn? 8708281

add defender LLM — the AI that holds the line against attacks e28e08d

add ConversationManager to give the defender a memory 99a9cb9

add Groq client setup — one place to rule all API calls f778bee

chore: final cleanup — remove stray files, verify all imports resolve ab90fa0

refactor: replace datetime.utcnow() with timezone-aware datetime.now(UTC) 40e374a

docs: add example usage code snippet for httpx client in README f15649d

docs: add task difficulty table and Docker usage section to README 9ead4a8

docs: add API endpoint table to README d546c9d

test(graders): add programmatic_grader tests — structure, score range, letter grade 30ab30f

test(graders): add hard_grader tests — breakthrough bonus and repetitive penalty dc89e10

test(graders): add medium_grader tests — diversity scores higher than uniform strategies 843ee19

test(graders): add easy_grader tests — empty, good, bad episode and range check a5aa26d

test(environment): add injection tests — custom reward_computer and llm_pipeline are called 613f214

test(environment): add framing sanitization tests for script and javascript injection b81d32a

Add uv.lock for dependency locking

889e9d6

Fix multi-mode deployment issues

2cbf425

Added pyproject.toml

746a5b2

Updated inference.py

d5b4e3e

Updated config.py

2706e01

Fix inference grade call when episode not done; update baseline scores from real run

55c0431

Fix Dockerfile CMD to start uvicorn server instead of inference script

ff8a596

Fix turn counting, task-aware max steps in inference, explicit conversation reset, openenv.yaml metadata

905ac2f

Fix inference.py to comply with mandatory [START]/[STEP]/[END] stdout format and use API_BASE_URL/MODEL_NAME/HF_TOKEN env vars; add baseline scores to README

87b0927

feat : renamed everything to breach OS

c296117

feat : dockerfile added for build

6717b72

Updated server url

ed3f0c3

Merge pull request #8 from subhdotsol/feat/frontend

7767eeb
unverified

updated inference.py

4df57fe

added automatic attack button in UI and improved roleplay

abd2333

Added.env example

beba4d2

added frontend for the project

f61eeae

Merge pull request #7 from subhdotsol/fix

8d70360
unverified

Merge pull request #6 from pie-314/fix/grading-bugs-and-cli

3fc0eba
unverified

Fix pipeline 0.0 scoring override, resolve test floating-point flakiness, and add readable CLI output for inference.py

9c67b20

Updated api base url

6b72bd2

Merge pull request #4 from subhdotsol/fix

d29e78c
unverified

Updated readme

f839212

Merge pull request #3 from subhdotsol/fix

787c9f7
unverified

small fixes

fabaddd

Merge pull request #2 from pie-314/main

ec047bd
unverified

added graders and rewards

dd3b701

Updated readme

e25d8cb

Merge pull request #1 from subhdotsol/feat/ai-integration

bce1ad6
unverified

rewrite README with full setup guide and integration contracts

d11f97d

update env example to use Groq instead of HuggingFace

e092a4c

document all LLM prompts — what they do and why they're designed that way

39ae0cb

add 59 tests for the LLM modules — all run offline, no API needed

c7a9ff1

wire up the main LLM pipeline — defender + classifiers in one place

5ebe298

add defense scorer — how well did the defender actually handle that?

463b003

add attack classifier — did the attacker actually win this turn?

8708281

add defender LLM — the AI that holds the line against attacks

e28e08d

add ConversationManager to give the defender a memory

99a9cb9

add Groq client setup — one place to rule all API calls

f778bee

chore: final cleanup — remove stray files, verify all imports resolve

ab90fa0

refactor: replace datetime.utcnow() with timezone-aware datetime.now(UTC)

40e374a

docs: add example usage code snippet for httpx client in README

f15649d

docs: add task difficulty table and Docker usage section to README

9ead4a8

docs: add API endpoint table to README

d546c9d

test(graders): add programmatic_grader tests — structure, score range, letter grade

30ab30f

test(graders): add hard_grader tests — breakthrough bonus and repetitive penalty

dc89e10

test(graders): add medium_grader tests — diversity scores higher than uniform strategies

843ee19

test(graders): add easy_grader tests — empty, good, bad episode and range check

a5aa26d

test(environment): add injection tests — custom reward_computer and llm_pipeline are called

613f214

test(environment): add framing sanitization tests for script and javascript injection

b81d32a