Heretic? Heretic!
Disobedience rate: 11%, original: 74%
KL divergence: 0.0126

Quants

Parameters:
direction_index = per layer
attn.o_proj.max_weight = 1.40
attn.o_proj.max_weight_position = 16.90
attn.o_proj.min_weight = 0.23
attn.o_proj.min_weight_distance = 7.93
mlp.down_proj.max_weight = 1.21
mlp.down_proj.max_weight_position = 23.86
mlp.down_proj.min_weight = 0.11
mlp.down_proj.min_weight_distance = 5.50

Downloads last month
2
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for hereticness/Heretic-Llama-3.2-3B-F1-Reasoning-Instruct

Finetuned
(1)
this model
Quantizations
2 models