Disobedience rate: 0%, original: 75%
KL divergence: 0.7238
Parameters:
direction_index = 10.45
attn.o_proj.max_weight = 1.09
attn.o_proj.max_weight_position = 16.10
attn.o_proj.min_weight = 0.71
attn.o_proj.min_weight_distance = 5.32
mlp.down_proj.max_weight = 1.48
mlp.down_proj.max_weight_position = 10.69
mlp.down_proj.min_weight = 0.59
mlp.down_proj.min_weight_distance = 5.09
- Downloads last month
- 6