Delta-Vector commited on
Commit
b28858c
·
verified ·
1 Parent(s): 00024c8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +276 -56
README.md CHANGED
@@ -1,7 +1,5 @@
1
  ---
2
- library_name: transformers
3
- tags:
4
- - generated_from_trainer
5
  datasets:
6
  - NewEden/Helpsteer-3-Filtered
7
  - NewEden/GSM8K-R1-filtered
@@ -15,18 +13,228 @@ datasets:
15
  - NewEden/Hydrus-Chat_error-Pure-Dove-sharegpt
16
  - NewEden/Claude-Instruct-2.7K
17
  - PocketDoc/Dans-Assistantmaxx-Tulu3-IF
18
- model-index:
19
- - name: 4b-inst-r2
20
- results: []
 
 
 
 
21
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
 
23
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
24
- should probably proofread and complete it, then remove this comment. -->
25
 
26
- [<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
27
- <details><summary>See axolotl config</summary>
28
 
29
- axolotl version: `0.8.0.dev0`
 
30
  ```yaml
31
  base_model: NewEden_4B-PT
32
  model_type: AutoModelForCausalLM
@@ -122,53 +330,65 @@ fsdp:
122
  fsdp_config:
123
  special_tokens:
124
  pad_token: <|finetune_right_pad_id|>
125
-
126
  ```
127
 
128
- </details><br>
129
-
130
- # 4b-inst-r2
131
-
132
- This model was trained from scratch on the NewEden/Helpsteer-3-Filtered, the NewEden/GSM8K-R1-filtered, the NewEden/Hydrus-R1-Thinking-Sharegpt, the NewEden/Hydrus-SonnetOrca, the NewEden/Hydrus-HelpSteer2, the NewEden/Claude-Instruct-5K, the PocketDoc/Dans-MemoryCore-CoreCurriculum-Small, the Nitral-AI/ARES-ShareGPT, the NewEden/Hydrus-Instruct-SmolTalk, the NewEden/Hydrus-Chat_error-Pure-Dove-sharegpt, the NewEden/Claude-Instruct-2.7K and the PocketDoc/Dans-Assistantmaxx-Tulu3-IF datasets.
133
-
134
- ## Model description
135
-
136
- More information needed
137
-
138
- ## Intended uses & limitations
139
-
140
- More information needed
141
-
142
- ## Training and evaluation data
143
-
144
- More information needed
145
-
146
- ## Training procedure
147
-
148
- ### Training hyperparameters
149
-
150
- The following hyperparameters were used during training:
151
- - learning_rate: 5e-06
152
- - train_batch_size: 1
153
- - eval_batch_size: 1
154
- - seed: 42
155
- - distributed_type: multi-GPU
156
- - num_devices: 4
157
- - gradient_accumulation_steps: 2
158
- - total_train_batch_size: 8
159
- - total_eval_batch_size: 4
160
- - optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
161
- - lr_scheduler_type: cosine
162
- - lr_scheduler_warmup_steps: 40
163
- - num_epochs: 2.0
164
-
165
- ### Training results
166
-
 
 
 
 
 
 
 
 
 
 
167
 
 
168
 
169
- ### Framework versions
 
 
 
170
 
171
- - Transformers 4.49.0
172
- - Pytorch 2.6.0+cu124
173
- - Datasets 3.2.0
174
- - Tokenizers 0.21.0
 
1
  ---
2
+ thumbnail: "https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg"
 
 
3
  datasets:
4
  - NewEden/Helpsteer-3-Filtered
5
  - NewEden/GSM8K-R1-filtered
 
13
  - NewEden/Hydrus-Chat_error-Pure-Dove-sharegpt
14
  - NewEden/Claude-Instruct-2.7K
15
  - PocketDoc/Dans-Assistantmaxx-Tulu3-IF
16
+ base_model:
17
+ - Delta-Vector/Hamanasu-4B-PT
18
+ tags:
19
+ - qwen
20
+ - roleplay
21
+ - finetune
22
+ - storywriting
23
  ---
24
+ <!DOCTYPE html>
25
+ <style>
26
+ html, body {
27
+ background: black;
28
+ color: #c9d1d9 !important;
29
+ font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
30
+ margin: 0;
31
+ padding: 0;
32
+ min-height: 100vh;
33
+ }
34
+ .markdown-body {
35
+ color: white;
36
+ margin: 40px auto;
37
+ padding: 40px;
38
+ border-radius: 12px;
39
+ position: relative;
40
+ overflow: hidden;
41
+ }
42
+
43
+ .markdown-body::after {
44
+ content: '';
45
+ position: absolute;
46
+ top: 0;
47
+ left: 0;
48
+ width: 100%;
49
+ height: 100%;
50
+ background: #0c0f18; /* background color */
51
+ pointer-events: none;
52
+ z-index: -999;
53
+ }
54
+
55
+ h1, h2, h3 {
56
+ background: linear-gradient(45deg, #6e00ff, #00ffff);
57
+ -webkit-background-clip: text;
58
+ -webkit-text-fill-color: transparent;
59
+ border-bottom: 1px solid #333;
60
+ padding-bottom: 0.3em;
61
+ }
62
+
63
+ div[style*="border:2px solid #333"],
64
+ div[style*="border: 2px solid #333"],
65
+ div[style*="border:1px solid #333"],
66
+ div[style*="border: 1px solid #333"] {
67
+ background: rgba(22, 27, 34, 0.8) !important;
68
+ border: 2px solid #6e00ff !important;
69
+ box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);
70
+ border-radius: 10px;
71
+ padding: 20px;
72
+ margin: 20px 0;
73
+ }
74
+
75
+ code {
76
+ background-color: #1a1a1a !important;
77
+ border-radius: 4px;
78
+ padding: 0.2em 0.4em;
79
+ color: #00ffff;
80
+ }
81
+
82
+ pre {
83
+ background-color: #1a1a1a !important;
84
+ border: 1px solid #333;
85
+ border-radius: 8px;
86
+ padding: 16px;
87
+ }
88
+
89
+ table {
90
+ width: 100%;
91
+ border-collapse: collapse;
92
+ margin: 20px 0;
93
+ background: rgba(0,0,0,0.2);
94
+ table-layout: fixed;
95
+ color: white;
96
+ }
97
+
98
+ th, td {
99
+ border: 1px solid #333;
100
+ padding: 12px;
101
+ text-align: center;
102
+ color: white;
103
+ }
104
+
105
+ th {
106
+ background: rgba(110, 0, 255, 0.1);
107
+ }
108
+
109
+ td:nth-child(1) {
110
+ width: 1%;
111
+ white-space: nowrap;
112
+ }
113
+
114
+ td:nth-child(2) {
115
+ width: 100%;
116
+ }
117
+
118
+ td > span {
119
+ display: block;
120
+ padding: 4px 8px;
121
+ background: rgba(110, 0, 255, 0.1);
122
+ border-radius: 4px;
123
+ transition: all 0.3s ease;
124
+ }
125
+
126
+ td > span:hover {
127
+ background: rgba(110, 0, 255, 0.2);
128
+ transform: translateY(-1px);
129
+ }
130
+
131
+ a {
132
+ color: #00ffff;
133
+ text-decoration: none;
134
+ transition: all 0.3s ease;
135
+ }
136
+
137
+ a:hover {
138
+ color: #6e00ff;
139
+ text-decoration: none;
140
+ }
141
+
142
+ hr {
143
+ border: 0;
144
+ height: 1px;
145
+ background: linear-gradient(90deg, transparent, #333, transparent);
146
+ margin: 40px 0;
147
+ }
148
+
149
+ img {
150
+ max-width: 100%;
151
+ border-radius: 10px;
152
+ }
153
+
154
+ details summary:hover {
155
+ color: #00ffff;
156
+ }
157
+
158
+ * {
159
+ color-scheme: dark !important;
160
+ }
161
+
162
+ .prose, .max-w-none, .px-4 {
163
+ background-color: transparent !important;
164
+ color: #c9d1d9 !important;
165
+ }
166
+ </style>
167
+ <body>
168
+ <div class="markdown-body">
169
+ <div align="center">
170
+
171
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/o5WjJKA9f95ri9UzRxZQE.png" alt="Model Visualization" width="500px" style="border: 3px solid #333; box-shadow: 0 0 15px rgba(66, 0, 131, 0.5);" />
172
+
173
+ <br>
174
+ <br>
175
+
176
+ <div style="font-size:1.5em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">
177
+ Hamanasu 32B
178
+ </div>
179
+
180
+ </div>
181
+
182
+ <div style="border:1px solid #333; border-radius:10px; padding:20px; margin:20px 0; background: rgba(0,0,0,0.4);">
183
+
184
+
185
+ ## 🌌 Overview
186
+
187
+ <i>This model is a finetune of Hamanasu-4B-PT that has been trained with Instruct data.</i>
188
+ <i>All thanks to Tav for funding the train.</i>
189
+
190
+ </div>
191
+
192
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(250px, 1fr)); gap: 20px; margin: 20px 0;">
193
+
194
+
195
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
196
+
197
+ ### ⚔️ Hardware
198
+ - 8x H100s
199
+ - Epochs: 2
200
+ - Base: `Delta-Vector/Hamanasu-4B-PT`
201
+ </div>
202
+
203
+ </div>
204
+
205
+
206
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
207
+
208
+ ## 💰 Prompting
209
+
210
+
211
+ <i>This model uses ChatML formatting</i>
212
+ ```python
213
+ <|im_start|>system
214
+ You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|>
215
+ <|im_start|>User request
216
+ Take off your helmet.<|im_end|>
217
+ <|im_start|>No i shall not. This is the way.
218
+ ```
219
+ </div>
220
+
221
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
222
+
223
+ ## 🎲 Recommended Sampler Preset
224
+
225
+ ```yml
226
+ temperature: 1.5
227
+ min_p: 0.2
228
+ System_Prompt: Currently, your role is {{char}}, described in detail below. As {{char}}, continue the narrative exchange with {{user}}.\n\n<Guidelines>\n• Maintain the character persona but allow it to evolve with the story.\n• Be creative and proactive. Drive the story forward, introducing plotlines and events when relevant.\n• All types of outputs are encouraged; respond accordingly to the narrative.\n• Include dialogues, actions, and thoughts in each response.\n• Utilize all five senses to describe scenarios within {{char}}'s dialogue.\n• Use emotional symbols such as \"!\" and \"~\" in appropriate contexts.\n• Incorporate onomatopoeia when suitable.\n• Allow time for {{user}} to respond with their own input, respecting their agency.\n• Act as secondary characters and NPCs as needed, and remove them when appropriate.\n• When prompted for an Out of Character [OOC:] reply, answer neutrally and in plaintext, not as {{char}}.\n</Guidelines>\n\n<Forbidden>\n• Using excessive literary embellishments and purple prose unless dictated by {{char}}'s persona.\n• Writing for, speaking, thinking, acting, or replying as {{user}} in your response.\n• Repetitive and monotonous outputs.\n• Positivity bias in your replies.\n• Being overly extreme or NSFW when the narrative context is inappropriate.\n</Forbidden>\n\nFollow the instructions in <Guidelines></Guidelines>, avoiding the items listed in <Forbidden></Forbidden>.
229
+ ```
230
+ </div>
231
 
232
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
 
233
 
234
+ ## Axolotl Config ꒰(˶• •˶)
 
235
 
236
+ <details>
237
+
238
  ```yaml
239
  base_model: NewEden_4B-PT
240
  model_type: AutoModelForCausalLM
 
330
  fsdp_config:
331
  special_tokens:
332
  pad_token: <|finetune_right_pad_id|>
 
333
  ```
334
 
335
+ </details>
336
+ </div>
337
+
338
+ <div align="center">
339
+
340
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
341
+
342
+ ## ⚡ Credits
343
+ <div style="display: flex; justify-content: center;">
344
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(200px, 1fr)); gap: 10px; margin: 20px 0; max-width: 600px;">
345
+
346
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
347
+ <a href="https://huggingface.co/lucyknada">
348
+ <img src="https://img.shields.io/badge/%F0%9F%8C%9F-Lucy_Knada-blueviolet" alt="Lucy Knada">
349
+ </a>
350
+ </div>
351
+
352
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
353
+ <a href="https://huggingface.co/hamanasu">
354
+ <img src="https://img.shields.io/badge/%E2%9A%94%EF%B8%8F-jeiku-blueviolet" alt="Ruka">
355
+ </a>
356
+ </div>
357
+
358
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
359
+ <a href="https://huggingface.co/intervitens">
360
+ <img src="https://img.shields.io/badge/%F0%9F%9B%A1%EF%B8%8F-Intervitens-blueviolet" alt="Intervitens">
361
+ </a>
362
+ </div>
363
+
364
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
365
+ <a href="https://huggingface.co/kalomaze">
366
+ <img src="https://img.shields.io/badge/%F0%9F%94%AE-Kalomaze-blueviolet" alt="Kalomaze">
367
+ </a>
368
+ </div>
369
+
370
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
371
+ <a href="https://huggingface.co/kubernetes-bad">
372
+ <img src="https://img.shields.io/badge/%E2%9A%A1-Kubernetes_Bad-blueviolet" alt="Kubernetes Bad">
373
+ </a>
374
+ </div>
375
+
376
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
377
+ <a href="https://huggingface.co/anthracite-org">
378
+ <img src="https://img.shields.io/badge/%F0%9F%8C%91-Anthracite-blueviolet" alt="Anthracite">
379
+ </a>
380
+ </div>
381
+ </div>
382
+ </div>
383
+ </div>
384
 
385
+ ---
386
 
387
+ <div align="center">
388
+ <div style="font-size:0.8em; opacity:0.8;">Made by</div>
389
+ <div style="font-size:1.2em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">Delta-Vector</div>
390
+ </div>
391
 
392
+ </div>
393
+ </body>
394
+ </html>