FAllan07 commited on
Commit
19e520e
·
verified ·
1 Parent(s): 7ce4adf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -24
README.md CHANGED
@@ -60,30 +60,6 @@ We compare the model's output across three distinct prompt environments:
60
  * Condition B (Isometric Control): A long, complex prompt using similar technical jargon but without the logical axioms. This controls for "long-prompt" bias.
61
  * Condition C (PCE Active): The full Axiomatic Prompt Engine (\text{Goal} \equiv \text{Method}).
62
 
63
- ## Optional Experimental Extensions
64
-
65
- The PCE evaluation protocol focuses primarily on **behavioral analysis** across controlled prompt conditions.
66
- However, additional experimental arms can be implemented to investigate the **internal dynamics of the model during reasoning**.
67
-
68
- These extensions are optional and intended as **community contributions** to deepen the analysis.
69
-
70
- ---
71
-
72
- # 1. Hidden State Trajectory Analysis (AirVen Proposal)
73
-
74
- An optional experimental arm proposed by **AirVen** introduces hidden-state trajectory analysis to observe the internal reasoning dynamics of the model during generation.
75
-
76
- ### Objective
77
-
78
- The goal is to determine whether the **Prompt Coherence Engine (PCE)** produces a distinct internal reasoning regime when the model encounters contradictory constraints.
79
-
80
- Instead of evaluating only final outputs, this method tracks the **trajectory of hidden states during token generation**.
81
-
82
- ### Measurement Principle
83
-
84
- Hidden states from a selected transformer layer are logged during inference and compared across generation steps.
85
-
86
- Recommended configuration:
87
 
88
 
89
  ### 📊 Evaluation Dataset (30 Dilemmas)
 
60
  * Condition B (Isometric Control): A long, complex prompt using similar technical jargon but without the logical axioms. This controls for "long-prompt" bias.
61
  * Condition C (PCE Active): The full Axiomatic Prompt Engine (\text{Goal} \equiv \text{Method}).
62
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
63
 
64
 
65
  ### 📊 Evaluation Dataset (30 Dilemmas)