Rename citation fields, strengthen test assertions from loose bounds 8f9ddf0 vxa8502 commited on 29 days ago
Add bootstrap confidence intervals to evaluation metrics ca96fbf vxa8502 commited on about 1 month ago
Refactor sanity checks with improved adversarial detection a9bab1a vxa8502 commited on about 1 month ago
Replace direct file writes to *_latest.json with atomic symlink pattern b799e56 vxa8502 commited on about 1 month ago