metadata
name: inference-always-cfpatch
description: >-
Reporting rule — inference always uses cf-patch fast-weight update; report
only the cf-patch score, never the "vanilla" no-TTT column
metadata:
node_type: memory
type: feedback
originSessionId: a902e50d-bd1f-422b-8298-552e3fb0a73f
For this TTT project, inference is always done with cf-patch (closed-form ridge ΔW written into each TTT layer's W_down = the fast-weight update). That IS our inference mode.
Why: The user standardized on cf-patch as the one inference method (live-TTT paper-rule cumsum is unstable on these ckpts; cf-patch's one-shot ΔW is the fast-weight update we use).
How to apply:
- When reporting cf-patch RULER/eval results, report ONLY the
patch(cf-patch) score. Do NOT present thevanillacolumn as a baseline or result. - The
vanillacolumn ineval_cf_ruler.shoutput is a Phase-1 artifact: OpenCompass workers do NOT importinference_model, so they run stock Qwen3 with no TTT and no cf-patch — it is not a mode we use. Ignore it / don't surface it. - Cross-checkpoint or cross-config comparisons should compare cf-patch vs cf-patch.
See [[ttt-inference-findings]] for why live-TTT is off and cf-patch is the chosen path.
Xet Storage Details
- Size:
- 1.24 kB
- Xet hash:
- ded410bfdfe874cac180710d1d01f882fa020629a8f9294cbfaad809a4a8e694
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.