YoungXuan/MS-backup / xuan /memory /inference-always-cfpatch.md
YoungXuan's picture
|
download
raw
1.24 kB
metadata
name: inference-always-cfpatch
description: >-
  Reporting rule — inference always uses cf-patch fast-weight update; report
  only the cf-patch score, never the "vanilla" no-TTT column
metadata:
  node_type: memory
  type: feedback
  originSessionId: a902e50d-bd1f-422b-8298-552e3fb0a73f

For this TTT project, inference is always done with cf-patch (closed-form ridge ΔW written into each TTT layer's W_down = the fast-weight update). That IS our inference mode.

Why: The user standardized on cf-patch as the one inference method (live-TTT paper-rule cumsum is unstable on these ckpts; cf-patch's one-shot ΔW is the fast-weight update we use).

How to apply:

  • When reporting cf-patch RULER/eval results, report ONLY the patch (cf-patch) score. Do NOT present the vanilla column as a baseline or result.
  • The vanilla column in eval_cf_ruler.sh output is a Phase-1 artifact: OpenCompass workers do NOT import inference_model, so they run stock Qwen3 with no TTT and no cf-patch — it is not a mode we use. Ignore it / don't surface it.
  • Cross-checkpoint or cross-config comparisons should compare cf-patch vs cf-patch.

See [[ttt-inference-findings]] for why live-TTT is off and cf-patch is the chosen path.

Xet Storage Details

Size:
1.24 kB
·
Xet hash:
ded410bfdfe874cac180710d1d01f882fa020629a8f9294cbfaad809a4a8e694

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.