Chucks90's picture
download
raw
1.3 kB
"""Energy-based lesion-subspace coverage (additive alternative to effective rank).
Motivation (Gate 2 / Gate 4 root cause): the RankMe / coding-rate coverage is an AGGREGATE
over all tokens whose value barely moves when a few small-lesion tokens are added or removed.
An ENERGY coverage is ADDITIVE in tokens, so high-lesion-energy tokens contribute in
proportion to their lesion content:
C_E(S; x) = sum_{i in S} || P_L z_i ||^2 (total lesion-subspace energy retained)
Removing a lesion token (high ||P_L z||) drops C_E a lot, so the coverage DROP tracks lesion
loss directly. Label-free, differentiable, no SVD. C*_E(x) = C_E({1..n}; x).
"""
from __future__ import annotations
import torch
def energy_coverage(Z_retained: torch.Tensor, P_L: torch.Tensor | None = None) -> torch.Tensor:
"""C_E(S;x): total lesion-subspace energy of retained tokens (scalar)."""
Z = Z_retained
if Z.ndim != 2 or Z.shape[0] == 0:
return torch.zeros((), dtype=Z.dtype, device=Z.device)
PZ = Z @ P_L.T if P_L is not None else Z
return PZ.pow(2).sum()
def energy_coverage_drop(Z_full: torch.Tensor, Z_retained: torch.Tensor,
P_L: torch.Tensor | None = None) -> torch.Tensor:
return energy_coverage(Z_full, P_L) - energy_coverage(Z_retained, P_L)

Xet Storage Details

Size:
1.3 kB
·
Xet hash:
6dfbb497a41f239168dda4acba0e31e82b907cf121d84348dbd645b78cf157eb

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.