ada-flo commited on
Commit
df372d9
Β·
verified Β·
1 Parent(s): 385c9c3

Update top-level README

Browse files
Files changed (1) hide show
  1. README.md +45 -0
README.md ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: peft
3
+ license: apache-2.0
4
+ tags:
5
+ - lora
6
+ - monkey-research
7
+ - arith_op
8
+ ---
9
+
10
+ # monkey-cpt-arith_op
11
+
12
+ Continued-pretraining (CPT) LoRA adapters, one per synthetic-document bundle condition.
13
+
14
+ From the project *Tell or Show: How Training-Data Format Shapes Implicit
15
+ vs. Explicit Rule Knowledge*.
16
+
17
+ ## Layout
18
+
19
+ Adapters are organized as `<base-model>/<bundle-condition>/`:
20
+
21
+ ```
22
+ .
23
+ └── qwen3-4b-instruct-2507/ # base = Qwen/Qwen3-4B-Instruct-2507
24
+ β”œβ”€β”€ fewshot/
25
+ β”œβ”€β”€ explicit/
26
+ └── explicit_fewshot/
27
+ ```
28
+
29
+ Each leaf subdir is a self-contained PEFT-loadable adapter:
30
+ - `adapter_config.json`
31
+ - `adapter_model.safetensors`
32
+ - `README.md` (per-variant details)
33
+ - `trainer_state.json` (training-time metrics)
34
+
35
+ Future base models (Qwen3-7B etc.) will appear as sibling base-model dirs.
36
+
37
+ ## Loading
38
+
39
+ ```python
40
+ from peft import PeftModel
41
+ from transformers import AutoModelForCausalLM
42
+
43
+ base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-4B-Instruct-2507", torch_dtype="bfloat16")
44
+ model = PeftModel.from_pretrained(base, "ada-flo/monkey-cpt-arith_op", subfolder="qwen3-4b-instruct-2507/fewshot")
45
+ ```