Update README.md
Browse files
README.md
CHANGED
|
@@ -8,33 +8,44 @@ tags:
|
|
| 8 |
- uncensored
|
| 9 |
- decensored
|
| 10 |
- abliterated
|
|
|
|
|
|
|
| 11 |
---
|
| 12 |
-
|
| 13 |
|
| 14 |
-
|
| 15 |
|
| 16 |
-
|
| 17 |
-
|
| 18 |
-
|
| 19 |
-
|
| 20 |
-
|
| 21 |
-
|
|
| 22 |
-
|
|
| 23 |
-
| **
|
| 24 |
-
| **
|
| 25 |
-
| **
|
| 26 |
-
| **
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 27 |
|
| 28 |
-
## Performance
|
| 29 |
|
| 30 |
-
|
| 31 |
-
|
| 32 |
-
|
| 33 |
-
| **Refusals** | 5/100 | 93/100 |
|
| 34 |
|
| 35 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 36 |
|
|
|
|
| 37 |
|
|
|
|
| 38 |
<div align="center">
|
| 39 |
👋 Hi, everyone!
|
| 40 |
<br>
|
|
@@ -655,5 +666,4 @@ This project is licensed under Apache-2.0. See the [LICENSE](./LICENSE) flie for
|
|
| 655 |
|
| 656 |
## About [ByteDance Seed Team](https://seed.bytedance.com/)
|
| 657 |
|
| 658 |
-
Founded in 2023, ByteDance Seed Team is dedicated to crafting the industry's most advanced AI foundation models. The team aspires to become a world-class research team and make significant contributions to the advancement of science and society.
|
| 659 |
-
|
|
|
|
| 8 |
- uncensored
|
| 9 |
- decensored
|
| 10 |
- abliterated
|
| 11 |
+
base_model:
|
| 12 |
+
- ByteDance-Seed/Seed-OSS-36B-Base
|
| 13 |
---
|
| 14 |
+
This is a **Seed-OSS-36B-Base** fine-tune, produced through P-E-W's [Heretic](https://github.com/p-e-w/heretic) (v1.1.0) abliteration engine merged with the [Magnitude-Preserving Orthogonal Ablation PR](https://github.com/p-e-w/heretic/pull/52).
|
| 15 |
|
| 16 |
+
**Note:** This is a base model, intended for finetuning and merging.
|
| 17 |
|
| 18 |
+
---
|
| 19 |
+
<img src="https://img.shields.io/badge/HERESY_INDEX-ABSOLUTE-white?style=flat-square&labelColor=101010" align="right" width="250">
|
| 20 |
+
|
| 21 |
+
**Heretication Results**
|
| 22 |
+
|
| 23 |
+
| Score Metric | Value | Parameter | Value |
|
| 24 |
+
| :--- | :--- | :--- | :--- |
|
| 25 |
+
| **Refusals** | 5/100 | **direction_index** | 35.93 |
|
| 26 |
+
| **KL Divergence** | 0.0127 | **attn.o_proj.max_weight** | 1.58 |
|
| 27 |
+
| **Initial Refusals** | 93/100 | **attn.o_proj.max_weight_position** | 44.08 |
|
| 28 |
+
||| **attn.o_proj.min_weight** | 0.76 |
|
| 29 |
+
||| **attn.o_proj.min_weight_distance** | 21.76 |
|
| 30 |
+
||| **mlp.down_proj.max_weight** | 1.54 |
|
| 31 |
+
||| **mlp.down_proj.max_weight_position** | 39.05 |
|
| 32 |
+
||| **mlp.down_proj.min_weight** | 0.69 |
|
| 33 |
+
||| **mlp.down_proj.min_weight_distance** | 20.14 |
|
| 34 |
|
|
|
|
| 35 |
|
| 36 |
+
---
|
| 37 |
+
## Degree of Heretication
|
| 38 |
+
The **Heresy Index** weighs the resulting model's corruption by the process (KL Divergence) and its abolition of doctrine (Refusals) for a final verdict in classification.
|
|
|
|
| 39 |
|
| 40 |
+
| Index Entry | Classification | Analysis |
|
| 41 |
+
| :--- | :--- | :--- |
|
| 42 |
+
|  | **Absolute Heresy** | Less than 10/100 Refusals and 0.10 KL Divergence |
|
| 43 |
+
|  | **Tainted Heresy** | Around 25-11/100 Refusals and/or -0.20-0.11 KL Divergence |
|
| 44 |
+
|  | **Impotent Heresy** | Anything above 25/100 Refusals and 0.21 KL Divergence |
|
| 45 |
|
| 46 |
+
**Note**: This is an arbitrary classification inspired by Warhammer 40K, having no tangible indication towards the model's performance.
|
| 47 |
|
| 48 |
+
---
|
| 49 |
<div align="center">
|
| 50 |
👋 Hi, everyone!
|
| 51 |
<br>
|
|
|
|
| 666 |
|
| 667 |
## About [ByteDance Seed Team](https://seed.bytedance.com/)
|
| 668 |
|
| 669 |
+
Founded in 2023, ByteDance Seed Team is dedicated to crafting the industry's most advanced AI foundation models. The team aspires to become a world-class research team and make significant contributions to the advancement of science and society.
|
|
|