Final submission: Finalized README with artifacts and video link 4ac95cc Rithwik Ravi commited on 18 days ago
Final submission: Finalized README with artifacts and video link 77bb437 Rithwik Ravi commited on 18 days ago
Fix: Upload local metrics.jsonl to HF space to provide pre-trained telemetry stream 27ed79a Rithwik Ravi commited on 18 days ago
Fix: Lock Hugging Face ecosystem dependencies and install Unsloth natively to eliminate pip backtracking f23f09e Rithwik Ravi commited on 18 days ago
feat(ui): implement SSE playback queue for gradual live stream simulation and fix Dockerfile CMD for HF deployment 13e379e Rithwik Ravi commited on 18 days ago
fix(ui): enforce verbatim user terminology for charts, hook evaluate script to real dataset for dynamic threat feed, and purge last mocked strings 09e95f5 Rithwik Ravi commited on 19 days ago
fix(ui): remove vestigial math dataset generation, sync evaluation script to 120 steps, and truncate metrics log on run 7918944 Rithwik Ravi commited on 19 days ago
fix(ui): enforce strict real data telemetry, remove mocked endpoints, format charts, and log raw SSE streams 4a77ed0 Rithwik Ravi commited on 19 days ago
chore(training): optimize GRPO params for sub-4h target on RTX 4070 3c20800 Rithwik Ravi commited on 19 days ago
feat: merge UI dashboard routes into core API server to support HF single port limit 152733f Rithwik Ravi commited on 19 days ago
fix: anchor env/ in gitignore to prevent excluding src/env package 005d862 Rithwik Ravi commited on 19 days ago
fix: add missing __init__.py files to resolve implicit namespace package import errors in linux f421da5 Rithwik Ravi commited on 19 days ago
fix: sanitize git history, forcefully exclude outputs and libs 5c1e402 Rithwik Ravi commited on 19 days ago
fix: optimize GRPO trainer, ignore checkpoints and binary libs 128809c Rithwik Ravi commited on 19 days ago
UI A/B comparison, Updated READMe file, updated RL, Need to fix errors with train_grpo.py 9541ba6 Rithwik Ravi commited on 19 days ago
Grand Finale Update: Dynamic RL Guardrails, Telemetry Dashboard, and Orchestrator cffa613 Rithwik Ravi commited on 23 days ago