Merge branch 'main' of https://github.com/razancodes/Meta-Pytorch-Hackathon 052e052 MuazTPM commited on Apr 25
perf: A100 optimizations β BF16, dual-model loading, larger batches 9501940 razancodes commited on Apr 25
fix: add root / β /web/ redirect for HF Spaces (OpenEnv SDK has no root route) 8d98464 MuazTPM commited on Apr 25
feat: AGUI live state polling, rebuild static frontend with API fix 29d60d6 MuazTPM commited on Apr 25
fix: batch normalization, EMA baseline, investigation bonuses, entity regex 7e494da razancodes commited on Apr 25
fix(launderer): stop truncating response_text so scoring loop can parse JSON 91d42f8 razancodes commited on Apr 25
fix: PPO stagnation - launderer diversity, reward noise, KL logging, terminal dedup 77666fa razancodes commited on Apr 25
fix: final audit β orchestrator scores, KL direction, grader typology alias 11a0963 MuazTPM commited on Apr 25
fix: get_scenario() always produces suspicious scenarios (force_clean=False) 95b1622 MuazTPM commited on Apr 24
debug: flush __pycache__ at test start, add grader breakdown output 388d55f MuazTPM commited on Apr 24
fix: terminal actions skip grade_step to prevent reward double-count 8db62c9 MuazTPM commited on Apr 24
feat: self-play AML simulator β clean scenarios, reward composition, OS metrics d5b93f2 MuazTPM commited on Apr 24
audit: fix tool roster (10+3+5=18), align adversary default to local Llama, add accelerate/bitsandbytes to README deps 86d777f MuazTPM commited on Apr 24
docs: rewrite PROJECT_CONTEXT, TRAINING, README β correct 18-tool count, add Phase 3 FinCEN tools, verified terminal weights, unified GRPO/PLR/adversarial coverage 8c23772 MuazTPM commited on Apr 24
feat: Replace GPT-4o-mini with local Llama-3.1 8B in Adversary Agent 3b66efa razancodes commited on Apr 23
feat: Add PLR curriculum engine, GRPO trainer, and 5th AGUI panel; merge frontend updates dd6813e razancodes commited on Apr 23
fix(ui): patch trim typeerror, globe invisible arcs, and leaflet latency c865cd8 MuazTPM commited on Apr 23
feat(ui): phase 4 ui polish - fix cytoscape, globe, leaflet rendering and unify command center layout f37e9bf MuazTPM commited on Apr 23
chore: Finalize Phase 1-3 upgrades (Memex UI, 3D Globe, Adversarial Training, Reward Caps) e11f6a9 razancodes commited on Apr 23
feat: Phase 3 FinCEN 4-pillar expansion β industrial-grade AML environment c1991bd MuazTPM commited on Apr 23
docs: update PROJECT_CONTEXT.md and README.md for OpenEnv deployment b3cee40 MuazTPM commited on Apr 23