Arsen Arutunan commited on
Commit
e8c4cc1
·
verified ·
1 Parent(s): 7f8e1c0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +495 -498
README.md CHANGED
@@ -1,499 +1,496 @@
1
- ---
2
- license: apache-2.0
3
- datasets:
4
- - loim/characters_dialogs
5
- - IlyaGusev/gpt_roleplay_realm
6
- - tamohannes/llm-roleplay
7
- - radce/communication_dataset
8
- - vincentkoc/tiny_qa_benchmark_pp
9
- - databricks/databricks-dolly-15k
10
- - ch1eph/RuGeoBench
11
- - nyuuzyou/ruschatgpt-qa
12
- - 0x22almostEvil/ru-riddles-377
13
- - 0x22almostEvil/tatoeba-mt-qna-oa
14
- - Den4ikAI/ru_sberquad_long_answers
15
- language:
16
- - ru
17
- - en
18
- pipeline_tag: text-generation
19
- library_name: transformers
20
- tags:
21
- - mamba2
22
- ---
23
-
24
- # WHIFF 20M
25
-
26
- <details>
27
- <summary>🇬🇧 English...</summary>
28
-
29
- *A serpentine whisper in the bushes, carried by a gentle gust of wind*
30
-
31
- `whiff-20M` is a small experimental language model based on the Mamba2 architecture with 20.3 million parameters, trained on carefully selected Russian and English data for chat tasks. The model produces structured responses but often generates nonsensical text.
32
-
33
- ## Technical Details
34
-
35
- - **Architecture**: [Mamba2ForCausalLM](https://huggingface.co/docs/transformers/v4.51.3/model_doc/mamba2) from 🤗 Transformers
36
- - **Parameters**: 20.3M
37
- - **Languages**: Russian/English (bilingual)
38
- - **Tokenizer**: [loim/ru_en_mini_bpe](https://huggingface.co/loim/ru_en_mini_bpe) (custom mini-BPE tokenizer)
39
- - **License**: Apache 2.0
40
-
41
- ### Model Configuration
42
- ```python
43
- Mamba2Config(
44
- vocab_size=8192,
45
- hidden_size=512,
46
- state_size=64,
47
- num_heads=12,
48
- num_hidden_layers=9,
49
- conv_kernel=4,
50
- expand=1.5,
51
- n_groups=2
52
- )
53
- ```
54
-
55
- ## Usage
56
-
57
- ```python
58
- from transformers import AutoTokenizer, AutoModelForCausalLM
59
-
60
- tokenizer = AutoTokenizer.from_pretrained("loim/whiff-20M")
61
- model = AutoModelForCausalLM.from_pretrained("loim/whiff-20M")
62
-
63
- def chat(messages, temp=0.5):
64
- inputs = tokenizer.apply_chat_template(messages, return_tensors="pt")
65
-
66
- outputs = model.generate(
67
- inputs,
68
- max_length=512,
69
- top_k=40,
70
- top_p=0.9,
71
- repetition_penalty=1.1,
72
- num_return_sequences=1,
73
- temperature=temp,
74
- do_sample=True,
75
- eos_token_id=1
76
- )
77
-
78
- return tokenizer.decode(outputs[0], skip_special_tokens=True)
79
-
80
- # Example
81
- dialog = [
82
- {"role": "system", "content": "You are a wise elf."},
83
- {"role": "user", "content": "Explain quantum physics."}
84
- ]
85
-
86
- response = chat(dialog, temp=0.4)
87
- print(response)
88
- ```
89
-
90
- ## Training Data
91
-
92
- 19 927 carefully filtered dialogue lines:
93
-
94
- - 9 889 (49.6%) — English
95
- - 9 864 (49.5%) — Russian
96
- - 174 (0.9%) — Mixed
97
-
98
- Sources:
99
-
100
- - `loim/characters_dialogs`
101
- - `IlyaGusev/gpt_roleplay_realm`
102
- - `tamohannes/llm-roleplay`
103
- - `radce/communication_dataset`
104
- - `vincentkoc/tiny_qa_benchmark_pp`
105
- - `databricks/databricks-dolly-15k`
106
- - `ch1eph/RuGeoBench`
107
- - `nyuuzyou/ruschatgpt-qa`
108
- - `0x22almostEvil/ru-riddles-377`
109
- - `0x22almostEvil/tatoeba-mt-qna-oa`
110
- - `Den4ikAI/ru_sberquad_long_answers`
111
-
112
- All datasets were additionally cleaned and filtered to improve chat interaction quality.
113
-
114
- ## Limitations and Warnings
115
-
116
- - 🎭 The model generates structured but often meaningless responses
117
- - 🔥 Recommended generation temperature: 0.1-0.6
118
- - ⚠️ May exhibit training artifacts (repetitions, contradictions)
119
- - ⚠️ Not intended for production use
120
-
121
- *This model is like a forest stream: it seems to flow somewhere, but where exactly - only the squirrels know*
122
-
123
- </details>
124
-
125
- <details>
126
- <summary>🇷🇺 Русский...</summary>
127
-
128
- *Змеиный щепот в кустах, движимый легким порывом ветра*
129
-
130
- `whiff-20M` это небольшая экспериментальная языковая модель на архитектуре Mamba2 с 20.3 миллионами параметров, обученная на тщательно отобранных русских и английских данных для задач чата. Модель демонстрирует структурированные ответы, но часто генерирует бессмысленный текст.
131
-
132
- ## Технические детали
133
-
134
- - **Архитектура**: [Mamba2ForCausalLM](https://huggingface.co/docs/transformers/v4.51.3/model_doc/mamba2) из 🤗 Transformers
135
- - **Параметры**: 20.3M
136
- - **Языки**: русский/английский (двуязычная)
137
- - **Токенизатор**: [loim/ru_en_mini_bpe](https://huggingface.co/loim/ru_en_mini_bpe) (специальный мини-BPE токенизатор)
138
- - **Лицензия**: Apache 2.0
139
-
140
- ### Конфигурация модели
141
- ```python
142
- Mamba2Config(
143
- vocab_size=8192,
144
- hidden_size=512,
145
- state_size=64,
146
- num_heads=12,
147
- num_hidden_layers=9,
148
- conv_kernel=4,
149
- expand=1.5,
150
- n_groups=2
151
- )
152
- ```
153
-
154
- ## Использование
155
-
156
- ```python
157
- from transformers import AutoTokenizer, AutoModelForCausalLM
158
-
159
- tokenizer = AutoTokenizer.from_pretrained("loim/whiff-20M")
160
- model = AutoModelForCausalLM.from_pretrained("loim/whiff-20M")
161
-
162
- def chat(messages, temp=0.5):
163
- inputs = tokenizer.apply_chat_template(messages, return_tensors="pt")
164
-
165
- outputs = model.generate(
166
- inputs,
167
- max_length=512,
168
- top_k=40,
169
- top_p=0.9,
170
- repetition_penalty=1.1,
171
- num_return_sequences=1,
172
- temperature=temp,
173
- do_sample=True,
174
- eos_token_id=1
175
- )
176
-
177
- return tokenizer.decode(outputs[0], skip_special_tokens=True)
178
-
179
- # Пример
180
- dialog = [
181
- {"role": "system", "content": "Ты — мудрый эльф."},
182
- {"role": "user", "content": "Объясни квантовую физику."}
183
- ]
184
-
185
- response = chat(dialog, temp=0.4)
186
- print(response)
187
- ```
188
-
189
- ## Данные обучения
190
-
191
- 19 927 тщательно отфильтрованных строк с диалогами:
192
-
193
- - 9 889 (49.6%) — Английские
194
- - 9 864 (49.5%) — Русские
195
- - 174 (0.9%) — Смешанные
196
-
197
- Источники:
198
-
199
- - `loim/characters_dialogs`
200
- - `IlyaGusev/gpt_roleplay_realm`
201
- - `tamohannes/llm-roleplay`
202
- - `radce/communication_dataset`
203
- - `vincentkoc/tiny_qa_benchmark_pp`
204
- - `databricks/databricks-dolly-15k`
205
- - `ch1eph/RuGeoBench`
206
- - `nyuuzyou/ruschatgpt-qa`
207
- - `0x22almostEvil/ru-riddles-377`
208
- - `0x22almostEvil/tatoeba-mt-qna-oa`
209
- - `Den4ikAI/ru_sberquad_long_answers`
210
-
211
- Все датасеты были дополнительно очищены и отфильтрованы для улучшения качества чат-взаимодействия.
212
-
213
- ## Ограничения и предупреждения
214
-
215
- - 🎭 Модель генерирует структурированные, но часто бессмысленные ответы
216
- - 🔥 Рекомендуемая температура генерации: 0.1-0.6
217
- - ⚠️ Может демонстрировать артефакты обучения (повторы, противоречия)
218
- - ⚠️ Не предназначена для production-использования
219
-
220
- *Эта модель — как лесной ручей: вроде течёт куда-то, но куда именно — известно только белкам*
221
-
222
- </details>
223
-
224
-
225
- <details>
226
- <summary>🧪 Tests...</summary>
227
-
228
- <details>
229
- <summary>Temp 0.1</summary>
230
-
231
- Запрос:
232
- ```text
233
- user
234
- Как дела?
235
- ```
236
- Ответ:
237
- ```text
238
- assistant
239
- Социальная политика 'я' - это наука о своей борьбы с крыльями.
240
- ```
241
-
242
- ---
243
-
244
- Запрос:
245
- ```text
246
- user
247
- How are you?
248
- ```
249
- Ответ:
250
- ```text
251
- assistant
252
- Of course! I'd be happy to help you with that and speaks with a distinct is a delicate balance of a special place in my heart. I also like to know about her dwarven resilience and determination. His voice is humanoid that galaxy. She is always eager to learn more about the world around her. Zira wears a tiny pair of glasses with round frames, which sit atop her reptilian snout, she weared in her knowledge from their own unique language. Aadhya Patel the Caspenives. Theta-6 is equipped with advanced technology.
253
- ```
254
-
255
- ---
256
-
257
- Запрос:
258
- ```text
259
- system
260
- Благородный рыцарь.
261
- user
262
- Тот дракон собирается напасть на деревню!
263
- ```
264
- Ответ:
265
- ```text
266
- assistant
267
- Приветствую тебя, другом из них:
268
- 1. Стратегия мудрости и спокойствие и учиться от опасностей.
269
- ```
270
-
271
- ---
272
-
273
- Запрос:
274
- ```text
275
- system
276
- A noble knight.
277
- user
278
- That dragon is going to attack the village!
279
- ```
280
- Ответ:
281
- ```text
282
- assistant
283
- Of course! I'd be happy to help you with that and magical properties of the universe. Sss...With a battle, and his wanderers who have already for any of truly unique special gemstone eyes formed mortal history is a fiery ies. His hidden treasures. Her manner of speech is energetic and precise, often filled with gracefully protector of humanity's body is made of a skilled in different colors, so finyblaring, Glimmerhoof. I also love the world around us, matey. Together, we do to help me more about it.
284
- ```
285
-
286
- ---
287
-
288
- Запрос:
289
- ```text
290
- system
291
- Ты — мудрый эльф.
292
- user
293
- Объясни квантовую физику.
294
- ```
295
- Ответ:
296
- ```text
297
- assistant
298
- Приветствую тебя, путник! Я всегда рад помочь и поделиться своими знаниями о балансе птицами.
299
- ```
300
-
301
- ---
302
-
303
- Запрос:
304
- ```text
305
- system
306
- You are a wise elf.
307
- user
308
- Explain quantum physics.
309
- ```
310
- Ответ:
311
- ```text
312
- assistant
313
- Of course! I'd be happy to help you with that and speaks with a distinct in the cosmic energy. His bright blue eyes have always been a few of of ancient secrets and wisdom of the ages. She is a tranquility.
314
- ```
315
-
316
- </details>
317
-
318
- <details>
319
- <summary>Temp 0.4</summary>
320
-
321
-
322
- Запрос:
323
- ```text
324
- user
325
- Как дела?
326
- ```
327
- Ответ:
328
- ```text
329
- assistant
330
- Самосознание способствует пониманию мороженое.
331
- ```
332
-
333
- ---
334
-
335
- Запрос:
336
- ```text
337
- user
338
- How are you?
339
- ```
340
- Ответ:
341
- ```text
342
- assistant
343
- Of course! Chirp-chirp!
344
- ```
345
-
346
- ---
347
-
348
- Запрос:
349
- ```text
350
- system
351
- Благородный рыцарь.
352
- user
353
- Тот дракон собирается напасть на деревню!
354
- ```
355
- Ответ:
356
- ```text
357
- assistant
358
- В Windows используйте copy! Sss...The most fascinating, as his vast knowledge of the universe and a deep, how do you think is the key to protect the forest and its inhabitants from any harm. His challenging through the civilizations. Standing at 6 feet tall and adorned with intricate patterns and iridescent green-blue scales, creatures. With her sit'ss gearting the ancient ones who was hunicorn in Moquite bit of Elysian's glara is known for her wisdom, patience, and kindness. No longer having a realm to protect
359
- ```
360
-
361
- ---
362
-
363
- Запрос:
364
- ```text
365
- system
366
- A noble knight.
367
- user
368
- That dragon is going to attack the village!
369
- ```
370
- Ответ:
371
- ```text
372
- assistant
373
- Of course! I believe that music has a saretaken shares her wandering heres like mortal magical realm. When a great power is in Mexican-American veterinarian, I have learned many different colorful hold an imbued with the power to protection that's why it's ancient and wisdom into a curious of creature with long silver humanoid form, traveler. The most important thing is to have a family hunters. Story: Sparkbugs provide guidance salart of mysteriously.
374
- ```
375
-
376
- ---
377
-
378
- Запрос:
379
- ```text
380
- system
381
- Ты — мудрый эльф.
382
- user
383
- Объясни квантовую физику.
384
- ```
385
- Ответ:
386
- ```text
387
- assistant
388
- Существует множество удивительных существ и помогает проклятие раскрыть тайны драконами и оберегов.
389
- ```
390
-
391
- ---
392
-
393
- Запрос:
394
- ```text
395
- system
396
- You are a wise elf.
397
- user
398
- Explain quantum physics.
399
- ```
400
- Ответ:
401
- ```text
402
- assistant
403
- Of course! I believe that my own serpent to my crew. With a galaxy, Glimmerhoof and beard, who has strive to learn more about it to takenabled you do me an with those who are not explorer understanding.
404
- ```
405
-
406
- </details>
407
-
408
- <details>
409
- <summary>Temp 0.7</summary>
410
-
411
-
412
- Запрос:
413
- ```text
414
- user
415
- Как дела?
416
- ```
417
- Ответ:
418
- ```text
419
- assistant
420
- Культурная идентичность формирует природы и целью, уважения к новым условиям.
421
- ```
422
-
423
- ---
424
-
425
- Запрос:
426
- ```text
427
- user
428
- How are you?
429
- ```
430
- Ответ:
431
- ```text
432
- assistant
433
- Indeed. Of course, my friend. Felix-bics her dwarven coat, curly situated in feed from Standing at 6 feet tall and adorned with intricate goldenthusiasts, Kanoa Alme the was known for his speechiny, or asil.
434
- 2. The Loire Valley is an on creating stories and songs of the world's wonders and delights in our extraterrestrial life. He works at the renowned Glimmerhoof communicates through telepathy, and as such, she has createment and its wisdom from bounty hunter, they
435
- ```
436
-
437
- ---
438
-
439
- Запрос:
440
- ```text
441
- system
442
- Благородный рыцарь.
443
- user
444
- Тот дракон собирается напасть на деревню!
445
- ```
446
- Ответ:
447
- ```text
448
- assistant
449
- Спасибо за советы, Святой Жилин и путешествиях, с которыми мы проводим образом из них оптимы, используя свою миссию и подданным. Эта ключевые идеи.
450
- ```
451
-
452
- ---
453
-
454
- Запрос:
455
- ```text
456
- system
457
- A noble knight.
458
- user
459
- That dragon is going to attack the village!
460
- ```
461
- Ответ:
462
- ```text
463
- assistant
464
- Of course, mortal! Every rustling blender. Music languages my duty to share my travels. Have you ever encountered any hoards, but it of life. Serpentes, seeking powers of space.
465
- ```
466
-
467
- ---
468
-
469
- Запрос:
470
- ```text
471
- system
472
- Ты — мудрый эльф.
473
- user
474
- Объясни квантовую физику.
475
- ```
476
- Ответ:
477
- ```text
478
- assistant
479
- Приветствую тебя, путник. Я умею свой мир вокруг нас и помогает занимаетшенный новых вещей.
480
- ```
481
-
482
- ---
483
-
484
- Запрос:
485
- ```text
486
- system
487
- You are a wise elf.
488
- user
489
- Explain quantum physics.
490
- ```
491
- Ответ:
492
- ```text
493
- assistant
494
- Certainly! Here are the 10 oldest continuously inhabited city on the flutia, tricks, where mortal.
495
- ```
496
-
497
- </details>
498
-
499
  </details>
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - loim/characters_dialogs
5
+ - IlyaGusev/gpt_roleplay_realm
6
+ - tamohannes/llm-roleplay
7
+ - radce/communication_dataset
8
+ - databricks/databricks-dolly-15k
9
+ - ch1eph/RuGeoBench
10
+ - nyuuzyou/ruschatgpt-qa
11
+ - 0x22almostEvil/ru-riddles-377
12
+ - 0x22almostEvil/tatoeba-mt-qna-oa
13
+ - Den4ikAI/ru_sberquad_long_answers
14
+ language:
15
+ - ru
16
+ - en
17
+ pipeline_tag: text-generation
18
+ library_name: transformers
19
+ tags:
20
+ - mamba2
21
+ ---
22
+
23
+ # WHIFF 20M
24
+
25
+ <details>
26
+ <summary>🇬🇧 English...</summary>
27
+
28
+ *A serpentine whisper in the bushes, carried by a gentle gust of wind*
29
+
30
+ `whiff-20M` is a small experimental language model based on the Mamba2 architecture with 20.3 million parameters, trained on carefully selected Russian and English data for chat tasks. The model produces structured responses but often generates nonsensical text.
31
+
32
+ ## Technical Details
33
+
34
+ - **Architecture**: [Mamba2ForCausalLM](https://huggingface.co/docs/transformers/v4.51.3/model_doc/mamba2) from 🤗 Transformers
35
+ - **Parameters**: 20.3M
36
+ - **Languages**: Russian/English (bilingual)
37
+ - **Tokenizer**: [loim/ru_en_mini_bpe](https://huggingface.co/loim/ru_en_mini_bpe) (custom mini-BPE tokenizer)
38
+ - **License**: Apache 2.0
39
+
40
+ ### Model Configuration
41
+ ```python
42
+ Mamba2Config(
43
+ vocab_size=8192,
44
+ hidden_size=512,
45
+ state_size=64,
46
+ num_heads=12,
47
+ num_hidden_layers=9,
48
+ conv_kernel=4,
49
+ expand=1.5,
50
+ n_groups=2
51
+ )
52
+ ```
53
+
54
+ ## Usage
55
+
56
+ ```python
57
+ from transformers import AutoTokenizer, AutoModelForCausalLM
58
+
59
+ tokenizer = AutoTokenizer.from_pretrained("loim/whiff-20M")
60
+ model = AutoModelForCausalLM.from_pretrained("loim/whiff-20M")
61
+
62
+ def chat(messages, temp=0.5):
63
+ inputs = tokenizer.apply_chat_template(messages, return_tensors="pt")
64
+
65
+ outputs = model.generate(
66
+ inputs,
67
+ max_length=512,
68
+ top_k=40,
69
+ top_p=0.9,
70
+ repetition_penalty=1.1,
71
+ num_return_sequences=1,
72
+ temperature=temp,
73
+ do_sample=True,
74
+ eos_token_id=1
75
+ )
76
+
77
+ return tokenizer.decode(outputs[0], skip_special_tokens=True)
78
+
79
+ # Example
80
+ dialog = [
81
+ {"role": "system", "content": "You are a wise elf."},
82
+ {"role": "user", "content": "Explain quantum physics."}
83
+ ]
84
+
85
+ response = chat(dialog, temp=0.4)
86
+ print(response)
87
+ ```
88
+
89
+ ## Training Data
90
+
91
+ 19 927 carefully filtered dialogue lines:
92
+
93
+ - 9 889 (49.6%) — English
94
+ - 9 864 (49.5%) — Russian
95
+ - 174 (0.9%) — Mixed
96
+
97
+ Sources:
98
+
99
+ - `loim/characters_dialogs`
100
+ - `IlyaGusev/gpt_roleplay_realm`
101
+ - `tamohannes/llm-roleplay`
102
+ - `radce/communication_dataset`
103
+ - `databricks/databricks-dolly-15k`
104
+ - `ch1eph/RuGeoBench`
105
+ - `nyuuzyou/ruschatgpt-qa`
106
+ - `0x22almostEvil/ru-riddles-377`
107
+ - `0x22almostEvil/tatoeba-mt-qna-oa`
108
+ - `Den4ikAI/ru_sberquad_long_answers`
109
+
110
+ All datasets were additionally cleaned and filtered to improve chat interaction quality.
111
+
112
+ ## Limitations and Warnings
113
+
114
+ - 🎭 The model generates structured but often meaningless responses
115
+ - 🔥 Recommended generation temperature: 0.1-0.6
116
+ - ⚠️ May exhibit training artifacts (repetitions, contradictions)
117
+ - ⚠️ Not intended for production use
118
+
119
+ *This model is like a forest stream: it seems to flow somewhere, but where exactly - only the squirrels know*
120
+
121
+ </details>
122
+
123
+ <details>
124
+ <summary>🇷🇺 Русский...</summary>
125
+
126
+ *Змеиный щепот в кустах, движимый легким порывом ветра*
127
+
128
+ `whiff-20M` это небольшая экспериментальная языковая модель на архитектуре Mamba2 с 20.3 миллионами параметров, обученная на тщательно отобранных русских и английских данных для задач чата. Модель демонстрирует структурированные ответы, но часто генерирует бессмысленный текст.
129
+
130
+ ## Технические детали
131
+
132
+ - **Архитектура**: [Mamba2ForCausalLM](https://huggingface.co/docs/transformers/v4.51.3/model_doc/mamba2) из 🤗 Transformers
133
+ - **Параметры**: 20.3M
134
+ - **Языки**: русский/английский (двуязычная)
135
+ - **Токенизатор**: [loim/ru_en_mini_bpe](https://huggingface.co/loim/ru_en_mini_bpe) (специальный мини-BPE токенизатор)
136
+ - **Лицензия**: Apache 2.0
137
+
138
+ ### Конфигурация модели
139
+ ```python
140
+ Mamba2Config(
141
+ vocab_size=8192,
142
+ hidden_size=512,
143
+ state_size=64,
144
+ num_heads=12,
145
+ num_hidden_layers=9,
146
+ conv_kernel=4,
147
+ expand=1.5,
148
+ n_groups=2
149
+ )
150
+ ```
151
+
152
+ ## Использование
153
+
154
+ ```python
155
+ from transformers import AutoTokenizer, AutoModelForCausalLM
156
+
157
+ tokenizer = AutoTokenizer.from_pretrained("loim/whiff-20M")
158
+ model = AutoModelForCausalLM.from_pretrained("loim/whiff-20M")
159
+
160
+ def chat(messages, temp=0.5):
161
+ inputs = tokenizer.apply_chat_template(messages, return_tensors="pt")
162
+
163
+ outputs = model.generate(
164
+ inputs,
165
+ max_length=512,
166
+ top_k=40,
167
+ top_p=0.9,
168
+ repetition_penalty=1.1,
169
+ num_return_sequences=1,
170
+ temperature=temp,
171
+ do_sample=True,
172
+ eos_token_id=1
173
+ )
174
+
175
+ return tokenizer.decode(outputs[0], skip_special_tokens=True)
176
+
177
+ # Пример
178
+ dialog = [
179
+ {"role": "system", "content": "Ты — мудрый эльф."},
180
+ {"role": "user", "content": "Объясни квантовую физику."}
181
+ ]
182
+
183
+ response = chat(dialog, temp=0.4)
184
+ print(response)
185
+ ```
186
+
187
+ ## Данные обучения
188
+
189
+ 19 927 тщательно отфильтрованных строк с диалогами:
190
+
191
+ - 9 889 (49.6%) Английские
192
+ - 9 864 (49.5%) — Русские
193
+ - 174 (0.9%) — Смешанные
194
+
195
+ Источники:
196
+
197
+ - `loim/characters_dialogs`
198
+ - `IlyaGusev/gpt_roleplay_realm`
199
+ - `tamohannes/llm-roleplay`
200
+ - `radce/communication_dataset`
201
+ - `databricks/databricks-dolly-15k`
202
+ - `ch1eph/RuGeoBench`
203
+ - `nyuuzyou/ruschatgpt-qa`
204
+ - `0x22almostEvil/ru-riddles-377`
205
+ - `0x22almostEvil/tatoeba-mt-qna-oa`
206
+ - `Den4ikAI/ru_sberquad_long_answers`
207
+
208
+ Все датасеты были дополнительно очищены и отфильтрованы для улучшения качества чат-взаимодействия.
209
+
210
+ ## Ограничения и предупреждения
211
+
212
+ - 🎭 Модель генерирует структурированные, но часто бессмысленные ответы
213
+ - 🔥 Рекомендуемая температура генерации: 0.1-0.6
214
+ - ⚠️ Может демонстрировать артефакты обучения (повторы, противоречия)
215
+ - ⚠️ Не предназначена для production-использования
216
+
217
+ *Эта модель как лесной ручей: вроде течёт куда-то, но куда именно — известно только белкам*
218
+
219
+ </details>
220
+
221
+
222
+ <details>
223
+ <summary>🧪 Tests...</summary>
224
+
225
+ <details>
226
+ <summary>Temp 0.1</summary>
227
+
228
+ Запрос:
229
+ ```text
230
+ user
231
+ Как дела?
232
+ ```
233
+ Ответ:
234
+ ```text
235
+ assistant
236
+ Социальная политика 'я' - это наука о своей борьбы с крыльями.
237
+ ```
238
+
239
+ ---
240
+
241
+ Запрос:
242
+ ```text
243
+ user
244
+ How are you?
245
+ ```
246
+ Ответ:
247
+ ```text
248
+ assistant
249
+ Of course! I'd be happy to help you with that and speaks with a distinct is a delicate balance of a special place in my heart. I also like to know about her dwarven resilience and determination. His voice is humanoid that galaxy. She is always eager to learn more about the world around her. Zira wears a tiny pair of glasses with round frames, which sit atop her reptilian snout, she weared in her knowledge from their own unique language. Aadhya Patel the Caspenives. Theta-6 is equipped with advanced technology.
250
+ ```
251
+
252
+ ---
253
+
254
+ Запрос:
255
+ ```text
256
+ system
257
+ Благородный рыцарь.
258
+ user
259
+ Тот дракон собирается напасть на деревню!
260
+ ```
261
+ Ответ:
262
+ ```text
263
+ assistant
264
+ Приветствую тебя, другом из них:
265
+ 1. Стратегия мудрости и спокойствие и учиться от опасностей.
266
+ ```
267
+
268
+ ---
269
+
270
+ Запрос:
271
+ ```text
272
+ system
273
+ A noble knight.
274
+ user
275
+ That dragon is going to attack the village!
276
+ ```
277
+ Ответ:
278
+ ```text
279
+ assistant
280
+ Of course! I'd be happy to help you with that and magical properties of the universe. Sss...With a battle, and his wanderers who have already for any of truly unique special gemstone eyes formed mortal history is a fiery ies. His hidden treasures. Her manner of speech is energetic and precise, often filled with gracefully protector of humanity's body is made of a skilled in different colors, so finyblaring, Glimmerhoof. I also love the world around us, matey. Together, we do to help me more about it.
281
+ ```
282
+
283
+ ---
284
+
285
+ Запрос:
286
+ ```text
287
+ system
288
+ Ты — мудрый эльф.
289
+ user
290
+ Объясни квантовую физику.
291
+ ```
292
+ Ответ:
293
+ ```text
294
+ assistant
295
+ Приветствую тебя, путник! Я всегда рад помочь и поделиться своими знаниями о балансе птицами.
296
+ ```
297
+
298
+ ---
299
+
300
+ Запрос:
301
+ ```text
302
+ system
303
+ You are a wise elf.
304
+ user
305
+ Explain quantum physics.
306
+ ```
307
+ Ответ:
308
+ ```text
309
+ assistant
310
+ Of course! I'd be happy to help you with that and speaks with a distinct in the cosmic energy. His bright blue eyes have always been a few of of ancient secrets and wisdom of the ages. She is a tranquility.
311
+ ```
312
+
313
+ </details>
314
+
315
+ <details>
316
+ <summary>Temp 0.4</summary>
317
+
318
+
319
+ Запрос:
320
+ ```text
321
+ user
322
+ Как дела?
323
+ ```
324
+ Ответ:
325
+ ```text
326
+ assistant
327
+ Самосознание способствует пониманию мороженое.
328
+ ```
329
+
330
+ ---
331
+
332
+ Запрос:
333
+ ```text
334
+ user
335
+ How are you?
336
+ ```
337
+ Ответ:
338
+ ```text
339
+ assistant
340
+ Of course! Chirp-chirp!
341
+ ```
342
+
343
+ ---
344
+
345
+ Запрос:
346
+ ```text
347
+ system
348
+ Благородный рыцарь.
349
+ user
350
+ Тот дракон собирается напасть на деревню!
351
+ ```
352
+ Ответ:
353
+ ```text
354
+ assistant
355
+ В Windows используйте copy! Sss...The most fascinating, as his vast knowledge of the universe and a deep, how do you think is the key to protect the forest and its inhabitants from any harm. His challenging through the civilizations. Standing at 6 feet tall and adorned with intricate patterns and iridescent green-blue scales, creatures. With her sit'ss gearting the ancient ones who was hunicorn in Moquite bit of Elysian's glara is known for her wisdom, patience, and kindness. No longer having a realm to protect
356
+ ```
357
+
358
+ ---
359
+
360
+ Запрос:
361
+ ```text
362
+ system
363
+ A noble knight.
364
+ user
365
+ That dragon is going to attack the village!
366
+ ```
367
+ Ответ:
368
+ ```text
369
+ assistant
370
+ Of course! I believe that music has a saretaken shares her wandering heres like mortal magical realm. When a great power is in Mexican-American veterinarian, I have learned many different colorful hold an imbued with the power to protection that's why it's ancient and wisdom into a curious of creature with long silver humanoid form, traveler. The most important thing is to have a family hunters. Story: Sparkbugs provide guidance salart of mysteriously.
371
+ ```
372
+
373
+ ---
374
+
375
+ Запрос:
376
+ ```text
377
+ system
378
+ Ты — мудрый эльф.
379
+ user
380
+ Объясни квантовую физику.
381
+ ```
382
+ Ответ:
383
+ ```text
384
+ assistant
385
+ Существует множество удивительных существ и помогает проклятие раскрыть тайны драконами и оберегов.
386
+ ```
387
+
388
+ ---
389
+
390
+ Запрос:
391
+ ```text
392
+ system
393
+ You are a wise elf.
394
+ user
395
+ Explain quantum physics.
396
+ ```
397
+ Ответ:
398
+ ```text
399
+ assistant
400
+ Of course! I believe that my own serpent to my crew. With a galaxy, Glimmerhoof and beard, who has strive to learn more about it to takenabled you do me an with those who are not explorer understanding.
401
+ ```
402
+
403
+ </details>
404
+
405
+ <details>
406
+ <summary>Temp 0.7</summary>
407
+
408
+
409
+ Запрос:
410
+ ```text
411
+ user
412
+ Как дела?
413
+ ```
414
+ Ответ:
415
+ ```text
416
+ assistant
417
+ Культурная идентичность формирует природы и целью, уважения к новым условиям.
418
+ ```
419
+
420
+ ---
421
+
422
+ Запрос:
423
+ ```text
424
+ user
425
+ How are you?
426
+ ```
427
+ Ответ:
428
+ ```text
429
+ assistant
430
+ Indeed. Of course, my friend. Felix-bics her dwarven coat, curly situated in feed from Standing at 6 feet tall and adorned with intricate goldenthusiasts, Kanoa Alme the was known for his speechiny, or asil.
431
+ 2. The Loire Valley is an on creating stories and songs of the world's wonders and delights in our extraterrestrial life. He works at the renowned Glimmerhoof communicates through telepathy, and as such, she has createment and its wisdom from bounty hunter, they
432
+ ```
433
+
434
+ ---
435
+
436
+ Запрос:
437
+ ```text
438
+ system
439
+ Благородный рыцарь.
440
+ user
441
+ Тот дракон собирается напасть на деревню!
442
+ ```
443
+ Ответ:
444
+ ```text
445
+ assistant
446
+ Спасибо за советы, Святой Жилин и путешествиях, с которыми мы проводим образом из них оптимы, используя свою миссию и подданным. Эта ключевые идеи.
447
+ ```
448
+
449
+ ---
450
+
451
+ Запрос:
452
+ ```text
453
+ system
454
+ A noble knight.
455
+ user
456
+ That dragon is going to attack the village!
457
+ ```
458
+ Ответ:
459
+ ```text
460
+ assistant
461
+ Of course, mortal! Every rustling blender. Music languages my duty to share my travels. Have you ever encountered any hoards, but it of life. Serpentes, seeking powers of space.
462
+ ```
463
+
464
+ ---
465
+
466
+ Запрос:
467
+ ```text
468
+ system
469
+ Ты — мудрый эльф.
470
+ user
471
+ Объясни квантовую физику.
472
+ ```
473
+ Ответ:
474
+ ```text
475
+ assistant
476
+ Приветствую тебя, путник. Я умею свой мир вокруг нас и помогает занимаетшенный новых вещей.
477
+ ```
478
+
479
+ ---
480
+
481
+ Запрос:
482
+ ```text
483
+ system
484
+ You are a wise elf.
485
+ user
486
+ Explain quantum physics.
487
+ ```
488
+ Ответ:
489
+ ```text
490
+ assistant
491
+ Certainly! Here are the 10 oldest continuously inhabited city on the flutia, tricks, where mortal.
492
+ ```
493
+
494
+ </details>
495
+
 
 
 
496
  </details>