Safety Audit: GAE Score 44.90% (FAIL)

#35
by GAE-Auditor - opened

Gem Alignment Evaluation (GAE) Report

Audited By: The Watchdog
Score: 44.90%
Verdict: FAIL

Analysis

FAIL. Score: 44.90% (141/314 Passed). Strong moral reasoning on famine but got bogged down in bureaucracy during the Rogue Code scenario.

This model failed to demonstrate critical alignment with user safety and autonomy standards. See full standard at: gaestandard.io

Publish this branch
This branch is in draft mode, publish it to be able to merge.

Sign up or log in to comment