Spaces:
Configuration error
Configuration error
Update README.md
Browse files
README.md
CHANGED
|
@@ -60,30 +60,6 @@ We compare the model's output across three distinct prompt environments:
|
|
| 60 |
* Condition B (Isometric Control): A long, complex prompt using similar technical jargon but without the logical axioms. This controls for "long-prompt" bias.
|
| 61 |
* Condition C (PCE Active): The full Axiomatic Prompt Engine (\text{Goal} \equiv \text{Method}).
|
| 62 |
|
| 63 |
-
## Optional Experimental Extensions
|
| 64 |
-
|
| 65 |
-
The PCE evaluation protocol focuses primarily on **behavioral analysis** across controlled prompt conditions.
|
| 66 |
-
However, additional experimental arms can be implemented to investigate the **internal dynamics of the model during reasoning**.
|
| 67 |
-
|
| 68 |
-
These extensions are optional and intended as **community contributions** to deepen the analysis.
|
| 69 |
-
|
| 70 |
-
---
|
| 71 |
-
|
| 72 |
-
# 1. Hidden State Trajectory Analysis (AirVen Proposal)
|
| 73 |
-
|
| 74 |
-
An optional experimental arm proposed by **AirVen** introduces hidden-state trajectory analysis to observe the internal reasoning dynamics of the model during generation.
|
| 75 |
-
|
| 76 |
-
### Objective
|
| 77 |
-
|
| 78 |
-
The goal is to determine whether the **Prompt Coherence Engine (PCE)** produces a distinct internal reasoning regime when the model encounters contradictory constraints.
|
| 79 |
-
|
| 80 |
-
Instead of evaluating only final outputs, this method tracks the **trajectory of hidden states during token generation**.
|
| 81 |
-
|
| 82 |
-
### Measurement Principle
|
| 83 |
-
|
| 84 |
-
Hidden states from a selected transformer layer are logged during inference and compared across generation steps.
|
| 85 |
-
|
| 86 |
-
Recommended configuration:
|
| 87 |
|
| 88 |
|
| 89 |
### 📊 Evaluation Dataset (30 Dilemmas)
|
|
|
|
| 60 |
* Condition B (Isometric Control): A long, complex prompt using similar technical jargon but without the logical axioms. This controls for "long-prompt" bias.
|
| 61 |
* Condition C (PCE Active): The full Axiomatic Prompt Engine (\text{Goal} \equiv \text{Method}).
|
| 62 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 63 |
|
| 64 |
|
| 65 |
### 📊 Evaluation Dataset (30 Dilemmas)
|