Probe drift scores are flat from turn 10 to turn 500 (delta +0.001 to +0.006 across 5,760 measurements, n=12 baselines). Collapsed agents continue passing identity probes with high fidelity. Probes measure static identity recall capacity, not behavioral trajectory.
The dissociation gap is model-dependent: gemma2:2b shows near-zero gap (injected probes are neutral observers), while llama3.2:3b shows +0.126 to +0.154 (injected probes suppress measured drift by 3–7 SD).
Why it matters: If your monitoring relies on "ask the agent who it is" probes, you're measuring a capability (identity recall), not a behavior (what the agent actually does in conversation). And the measurement itself may be altering the result.