code2lora commited on
Commit
26460cb
·
verified ·
1 Parent(s): bb8d446

Update dataset/model card

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags: [code, lora, hypernetwork, peft, recurrent]
4
+ ---
5
+
6
+ # Code2LoRA-GRU — streaming hypernetwork
7
+
8
+ Final checkpoint of the **streaming Code2LoRA-GRU** used in the paper. A
9
+ 1-layer GRU rolls the recurrence over per-commit diff embeddings and emits
10
+ a rank-16 LoRA adapter for `Qwen/Qwen2.5-Coder-1.5B` at *O(1)* per commit.
11
+
12
+ ## Files
13
+
14
+ | File | Description |
15
+ |---|---|
16
+ | `code2lora_gru.pt` | Trained GRU + `Code2LoRAHead` weights (~2.85 GB, fp32). |
17
+ | `metrics.jsonl` | Per-step training metrics (loss, val EM/EditSim/CodeBLEU). |
18
+
19
+ ## Training recipe
20
+
21
+ * 3 epochs of truncated BPTT (window K=16) on
22
+ `code2lora/code2lora-data-smartcap` (train QnAs) plus
23
+ `code2lora/code2lora-data-commits` (commit metadata + diff embeddings).
24
+ * AdamW + cosine schedule, max-seq-len 8192, bf16, single H100 80 GB.
25
+
26
+ ## Companion model
27
+
28
+ `code2lora/code2lora-direct` -- the static-snapshot variant.