luciaquirke's picture
Upload affine transforms with MSE metrics
3446217 verified
---
tags:
- affine-transform
- activation-mapping
library_name: safetensors
---
# Affine Transform: EleutherAI/deep-ignorance-pretraining-stage-unfiltered@global_step38144 → EleutherAI/deep-ignorance-unfiltered@main
Learned affine transformation mapping hidden state activations from a source checkpoint to a target model.
## Usage
```python
from safetensors.torch import load_file
import torch.nn as nn
from huggingface_hub import hf_hub_download
# Download files
weights_path = hf_hub_download(repo_id="EleutherAI/affine-checkpoint-transfer", filename="affine_transforms.safetensors")
metadata_path = hf_hub_download(repo_id="EleutherAI/affine-checkpoint-transfer", filename="metadata.json")
# Load
import json
with open(metadata_path) as f:
metadata = json.load(f)
weights = load_file(weights_path)
affine_transforms = {}
for layer_idx in metadata["layer_indices"]:
linear = nn.Linear(metadata["hidden_dim"], metadata["hidden_dim"], bias=True)
linear.weight.data = weights[f"layer_{layer_idx}.weight"]
linear.bias.data = weights[f"layer_{layer_idx}.bias"]
affine_transforms[layer_idx] = linear
```
## MSE Metrics
| Layer | MSE |
|-------|-----|
| 5 | 0.081037 |
| 10 | 0.289330 |
| 15 | 0.684350 |
| 20 | 0.978894 |
| 25 | 1.569308 |
| 30 | 4.231404 |
**Mean MSE: 1.305720**
## Training Details
- **Source Model:** EleutherAI/deep-ignorance-pretraining-stage-unfiltered@global_step38144
- **Target Model:** EleutherAI/deep-ignorance-unfiltered@main
- **Hidden Dimension:** 4096
- **Ridge Alpha:** 0.01
- **Layers:** [5, 10, 15, 20, 25, 30]
- **Training Examples:** 100000