ceselder
/

cot-oracle-paper-ablation-ours-1layer

Text Generation

activation-oracle

22.5m-train-tokens

Model card Files Files and versions

ceselder commited on 24 days ago

Commit

73b61cb

·

verified ·

1 Parent(s): 3db5a0d

Fix README front matter formatting

Files changed (1) hide show

README.md +10 -10

README.md CHANGED Viewed

@@ -1,9 +1,9 @@
-    ---
-    base_model: Qwen/Qwen3-8B
-    library_name: peft
-    pipeline_tag: text-generation
-    tags:
-    - base_model:adapter:Qwen/Qwen3-8B
 - lora
 - transformers
 - cot-oracle
@@ -12,13 +12,13 @@
 - ours
 - 1-layer
 - 22.5m-train-tokens
-    ---
-    # CoT Oracle Paper Ablation: Ours, 1 Layer
-    This repo contains the 1-layer paper ablation for the CoT Oracle recipe: on-policy lens tasks, chunked ConvQA, FineWeb lens readouts, and classification, without LatentQA.
-    ## What This Checkpoint Is
 - Base model: `Qwen/Qwen3-8B`
 - Adapter format: PEFT LoRA

+---
+base_model: Qwen/Qwen3-8B
+library_name: peft
+pipeline_tag: text-generation
+tags:
+- base_model:adapter:Qwen/Qwen3-8B
 - lora
 - transformers
 - cot-oracle
 - ours
 - 1-layer
 - 22.5m-train-tokens
+---
+# CoT Oracle Paper Ablation: Ours, 1 Layer
+This repo contains the 1-layer paper ablation for the CoT Oracle recipe: on-policy lens tasks, chunked ConvQA, FineWeb lens readouts, and classification, without LatentQA.
+## What This Checkpoint Is
 - Base model: `Qwen/Qwen3-8B`
 - Adapter format: PEFT LoRA