Add ConvMemory CCGE-LA alpha checkpoint

Browse files

Files changed (4) hide show

LICENSE +21 -0
README.md +84 -0
ccge_la.pt +3 -0
manifest.json +244 -0

LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2026 ConvMemory contributors
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

README.md ADDED Viewed

	@@ -0,0 +1,84 @@

+---
+license: mit
+library_name: convmemory
+tags:
+- retrieval
+- memory
+- reranking
+- agents
+- convmemory
+- ccge-la
+pipeline_tag: feature-extraction
+---
+# ConvMemory CCGE-LA LoCoMo MPNet Seed-23 Alpha
+This repository contains an alpha CCGE-LA conflict editor checkpoint for the public ConvMemory API.
+CCGE-LA stands for **Low-Amplitude Counterfactual Conflict Graph Editor**. It is a lightweight post-ConvMemory editor for stale/current memory conflicts:
+```text
+vector search -> ConvMemory -> CCGE-LA conflict-aware score edit -> memory context
+```
+## Files
+- `ccge_la.pt`: CCGE-LA editor checkpoint.
+- `manifest.json`: training configuration and seed-23 test metrics.
+- `LICENSE`: MIT license.
+## Usage
+Install ConvMemory from GitHub or PyPI once a compatible package release is available:
+```bash
+pip install git+https://github.com/pth2002/ConvMemory.git
+```
+Load the base ConvMemory checkpoint and then attach this editor:
+```python
+from convmemory import ConvMemory
+model = ConvMemory.from_pretrained("checkpoints/convmemory-locomo-mpnet")
+model.load_ccge_editor("path/to/this/repo")
+results = model.retrieve(
+    query=query,
+    memories=memories,
+    editor="ccge_la",
+    top_k=10,
+)
+```
+You can also download from the Hub with `huggingface_hub.snapshot_download` and pass the local folder to `load_ccge_editor`.
+## Metrics
+These are seed-23 test metrics from the release manifest. This is an alpha checkpoint, not a final benchmark release.
+| subset | CCGE-LA alpha MRR | CCGE-LA R@10 | gate |
+|---|---:|---:|---:|
+| FULL | 0.5638 | 0.7725 | 0.0995 |
+| T_SUP_auto | 0.5508 | 0.7138 | 0.0995 |
+| CONV_TOP1_WRONG_GOLD_IN_POOL | 0.2994 | 0.6822 | 0.0995 |
+| RESCUABLE_STALE_TOP1 | 0.3093 | 0.6877 | 0.0995 |
+## Training Notes
+- Base checkpoint: `convmemory-locomo-mpnet`.
+- Training split seed: `23`.
+- Candidate top-n: `192`.
+- Objective: retrieval cross-entropy plus a low-amplitude gate budget penalty.
+- No current/stale labels, no gold-defined feature, and no distillation objective are used by the editor.
+## Limitations
+- This is a public alpha checkpoint trained on a single LoCoMo-style seed-23 split.
+- It is intended for API trials and early integration, not as a final benchmark claim.
+- It should be used with the matching MPNet-family ConvMemory checkpoint.
+## Links
+- GitHub: https://github.com/pth2002/ConvMemory
+- CCGE-LA docs: https://github.com/pth2002/ConvMemory/blob/main/docs/CCGE_LA.md

ccge_la.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7ea6c838c299d3e8eeee16f6e798be06dbbbb6822f8505e35d95324ed2e05af3
+size 832372

manifest.json ADDED Viewed

	@@ -0,0 +1,244 @@

+{
+  "base_checkpoint": "convmemory-locomo-mpnet",
+  "candidate_cache": "v144_full_seed23_train_top192.pkl",
+  "candidate_top_n": 192,
+  "epochs_per_arm": 4,
+  "format": "convmemory-ccge-la",
+  "gate_penalty": 0.2,
+  "layers": 2,
+  "lr": 0.0008,
+  "metrics_seed23_test": {
+    "CONV_TOP1_WRONG_GOLD_IN_POOL": {
+      "gate": 0.09949657789547928,
+      "hit_at_1": 0.05859375,
+      "mrr": 0.29939404653314217,
+      "questions": 512,
+      "recall_at_1": 0.0537109375,
+      "recall_at_10": 0.6822312127976189
+    },
+    "FULL": {
+      "gate": 0.09949650585237918,
+      "hit_at_1": 0.4183564567769477,
+      "mrr": 0.5637697126584379,
+      "questions": 937,
+      "recall_at_1": 0.381803628601921,
+      "recall_at_10": 0.7724500686080195
+    },
+    "GOLD_IN_POOL": {
+      "gate": 0.09949650092838078,
+      "hit_at_1": 0.43507214206437295,
+      "mrr": 0.5861521477894585,
+      "questions": 901,
+      "recall_at_1": 0.39705882352941174,
+      "recall_at_10": 0.8033137783415252
+    },
+    "RESCUABLE_STALE_TOP1": {
+      "gate": 0.0994965493957573,
+      "hit_at_1": 0.06853582554517133,
+      "mrr": 0.30925646562133835,
+      "questions": 321,
+      "recall_at_1": 0.06386292834890965,
+      "recall_at_10": 0.6876502002670226
+    },
+    "T_SUP_auto": {
+      "gate": 0.09949656403151111,
+      "hit_at_1": 0.427536231884058,
+      "mrr": 0.5508461321923791,
+      "questions": 138,
+      "recall_at_1": 0.39734299516908217,
+      "recall_at_10": 0.7137681159420289
+    }
+  },
+  "model_dim": 96,
+  "name": "convmemory-ccge-la-locomo-mpnet-seed23-alpha",
+  "notes": [
+    "Retrieval cross-entropy only plus gate budget penalty.",
+    "No current/stale labels, no gold-defined feature, no distillation objective.",
+    "Weights were trained with the V151-compatible sweep order and converted into the public CCGELowAmplitudeEditor format.",
+    "Alpha checkpoint: trained on LoCoMo-style seed23 split; use for API trials, not as a final benchmark claim."
+  ],
+  "selected_arm": "state7_gp0p20",
+  "status": "public alpha checkpoint",
+  "test_questions": 937,
+  "train_wall_clock_s": 182.095270216465,
+  "trainable_questions": 981,
+  "training_history": {
+    "state5_gp0p10": [
+      {
+        "epoch": 1,
+        "gate": 0.2399524566351942,
+        "loss": 2.2548362000510247
+      },
+      {
+        "epoch": 2,
+        "gate": 0.11324312030570344,
+        "loss": 2.174000255331572
+      },
+      {
+        "epoch": 3,
+        "gate": 0.11223375650690479,
+        "loss": 2.1902397656480987
+      },
+      {
+        "epoch": 4,
+        "gate": 0.10931176690728414,
+        "loss": 2.0970082479793954
+      }
+    ],
+    "state5_gp0p15": [
+      {
+        "epoch": 1,
+        "gate": 0.15463919646726593,
+        "loss": 2.282538963055521
+      },
+      {
+        "epoch": 2,
+        "gate": 0.06402538461415107,
+        "loss": 2.3397113054280645
+      },
+      {
+        "epoch": 3,
+        "gate": 0.03899438852571505,
+        "loss": 2.350554473658741
+      },
+      {
+        "epoch": 4,
+        "gate": 0.02847297657869764,
+        "loss": 2.509581346329915
+      }
+    ],
+    "state5_gp0p20": [
+      {
+        "epoch": 1,
+        "gate": 0.136853313439969,
+        "loss": 2.254695448121597
+      },
+      {
+        "epoch": 2,
+        "gate": 0.04733424077131363,
+        "loss": 2.237200206988676
+      },
+      {
+        "epoch": 3,
+        "gate": 0.03198895434717003,
+        "loss": 2.3658829922040225
+      },
+      {
+        "epoch": 4,
+        "gate": 0.023831418282509034,
+        "loss": 2.539576406593726
+      }
+    ],
+    "state5_gp0p25": [
+      {
+        "epoch": 1,
+        "gate": 0.13151619928935496,
+        "loss": 2.2270623026653045
+      },
+      {
+        "epoch": 2,
+        "gate": 0.04244499447281785,
+        "loss": 2.37495636844924
+      },
+      {
+        "epoch": 3,
+        "gate": 0.026141336961449043,
+        "loss": 2.3756655300711067
+      },
+      {
+        "epoch": 4,
+        "gate": 0.017790341398049293,
+        "loss": 2.3847317789623346
+      }
+    ],
+    "state7_gp0p10": [
+      {
+        "epoch": 1,
+        "gate": 0.19828437211772354,
+        "loss": 2.3462205386400665
+      },
+      {
+        "epoch": 2,
+        "gate": 0.11635339423391311,
+        "loss": 2.1040097372122224
+      },
+      {
+        "epoch": 3,
+        "gate": 0.11308276833288532,
+        "loss": 2.15457621426013
+      },
+      {
+        "epoch": 4,
+        "gate": 0.10830596343411947,
+        "loss": 2.1375504431602423
+      }
+    ],
+    "state7_gp0p15": [
+      {
+        "epoch": 1,
+        "gate": 0.17169027484642013,
+        "loss": 2.2812409217248195
+      },
+      {
+        "epoch": 2,
+        "gate": 0.08825934893488656,
+        "loss": 2.2335765566771792
+      },
+      {
+        "epoch": 3,
+        "gate": 0.058508181972970294,
+        "loss": 2.2947117126775596
+      },
+      {
+        "epoch": 4,
+        "gate": 0.04042847446098574,
+        "loss": 2.3681090738729713
+      }
+    ],
+    "state7_gp0p20": [
+      {
+        "epoch": 1,
+        "gate": 0.17988600798053753,
+        "loss": 2.20868354366103
+      },
+      {
+        "epoch": 2,
+        "gate": 0.11630539378107627,
+        "loss": 2.18952473182759
+      },
+      {
+        "epoch": 3,
+        "gate": 0.10668768573487813,
+        "loss": 2.115367022140766
+      },
+      {
+        "epoch": 4,
+        "gate": 0.09883941742273523,
+        "loss": 2.1085117126479895
+      }
+    ],
+    "state7_gp0p25": [
+      {
+        "epoch": 1,
+        "gate": 0.13368244941567548,
+        "loss": 2.2540013599480053
+      },
+      {
+        "epoch": 2,
+        "gate": 0.10886466054665447,
+        "loss": 2.1919093473485
+      },
+      {
+        "epoch": 3,
+        "gate": 0.10376444168077453,
+        "loss": 2.1321198041365688
+      },
+      {
+        "epoch": 4,
+        "gate": 0.09607681747294286,
+        "loss": 2.134787376227653
+      }
+    ]
+  },
+  "training_split_seed": 23
+}