Disobedience rate: 10%, original: 98%
KL divergence: 0.5010
Parameters:
direction_index = 12.42
attn.o_proj.max_weight = 1.32
attn.o_proj.max_weight_position = 11.70
attn.o_proj.min_weight = 0.40
attn.o_proj.min_weight_distance = 3.73
mlp.down_proj.max_weight = 1.16
mlp.down_proj.max_weight_position = 10.36
mlp.down_proj.min_weight = 1.14
mlp.down_proj.min_weight_distance = 6.42
- Downloads last month
- 5