deltakitsune commited on
Commit
d675356
·
verified ·
1 Parent(s): f5b6ac8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +74 -69
README.md CHANGED
@@ -1,70 +1,75 @@
1
- ---
2
- library_name: transformers
3
- pipeline_tag: text-generation
4
- language:
5
- - en
6
- tags:
7
- - kitsune-training-suite
8
- - lora
9
- - causal-lm
10
- - text-generation
11
- - properly-e4-91e3-2
12
- - custom-dataset
13
- license: other
14
- ---
15
-
16
- # Properly-E4-91E3-2
17
-
18
- ## Summary
19
-
20
- - Training run: #95
21
- - Base model: `Local artifact (path omitted)`
22
- - Artifact: `Local artifact (path omitted)`
23
- - Status: completed
24
- - Started: 2026-04-30T04:00:32.996087+00:00
25
- - Finished: 2026-04-30T09:22:44.290944+00:00
26
- - Final loss: 0.8404612632730544
27
- - Final accuracy: N/A - token accuracy not logged for this run type
28
-
29
- ## Training Configuration
30
-
31
- - attn_implementation: `eager`
32
- - batch_size: `1`
33
- - epochs: `1`
34
- - grad_accum: `16`
35
- - learning_rate: `5e-5`
36
- - lora_alpha: `32`
37
- - lora_dropout: `0.05`
38
- - lora_rank: `16`
39
- - max_grad_norm: `1`
40
- - max_seq: `512`
41
- - system_prompt_override: `You are Properly a helpful assistant. You fix grammar, spelling, and clarity. Preserve the author's voice. Return only the corrected text. No explanations. No commentary. No emojis or hashtags.`
42
- - target_examples: `50000`
43
-
44
- ## Dataset Configuration
45
-
46
- - Dataset Mix | 3 sources | seed 42 (mixture; 35,000 rows)
47
-
48
- ## Recent Training Metrics
49
-
50
- Accuracy is marked `N/A` because this run type logs causal language-model loss, not a publishable evaluation accuracy.
51
-
52
- | Step | Loss | Accuracy | LR | Epoch | Timestamp |
53
- | --- | ---: | ---: | ---: | ---: | --- |
54
- | 2130 / 2188 | 0.8876 | - | - | 0.97 | 2026-04-30T09:14:13.929849+00:00 |
55
- | 2140 / 2188 | 0.8566 | - | - | 0.98 | 2026-04-30T09:15:41.870227+00:00 |
56
- | 2150 / 2188 | 0.8243 | - | - | 0.98 | 2026-04-30T09:17:06.787695+00:00 |
57
- | 2160 / 2188 | 0.8341 | - | - | 0.99 | 2026-04-30T09:18:34.679929+00:00 |
58
- | 2170 / 2188 | 0.8131 | - | - | 0.99 | 2026-04-30T09:20:01.712890+00:00 |
59
- | 2180 / 2188 | 0.8141 | - | - | 1.0 | 2026-04-30T09:21:24.802808+00:00 |
60
- | 2188 / 2188 | 0.8404612632730544 | - | - | - | 2026-04-30T09:22:42.250939+00:00 |
61
-
62
- ## Deployment History
63
-
64
- | Target | Reference | Status | Created |
65
- | --- | --- | --- | --- |
66
- | huggingface | `deltakitsune/properly` | completed | 2026-05-01T01:13:15.059197+00:00 |
67
-
68
- ## Notes
69
-
 
 
 
 
 
70
  Generated by Kitsune Training Suite. Review limitations, intended use, safety notes, and licensing before publishing.
 
1
+ ---
2
+ library_name: transformers
3
+ pipeline_tag: text-generation
4
+ language:
5
+ - en
6
+ tags:
7
+ - kitsune-training-suite
8
+ - lora
9
+ - causal-lm
10
+ - text-generation
11
+ - properly-e4-91e3-2
12
+ - custom-dataset
13
+ license: other
14
+ datasets:
15
+ - deltakitsune/properly-v1.01
16
+ - deltakitsune/properly-v1.02
17
+ - deltakitsune/properly-v1.03
18
+ - deltakitsune/properly-v1.04
19
+ ---
20
+
21
+ # Properly-E4-91E3-2
22
+
23
+ ## Summary
24
+
25
+ - Training run: #95
26
+ - Base model: `Local artifact (path omitted)`
27
+ - Artifact: `Local artifact (path omitted)`
28
+ - Status: completed
29
+ - Started: 2026-04-30T04:00:32.996087+00:00
30
+ - Finished: 2026-04-30T09:22:44.290944+00:00
31
+ - Final loss: 0.8404612632730544
32
+ - Final accuracy: N/A - token accuracy not logged for this run type
33
+
34
+ ## Training Configuration
35
+
36
+ - attn_implementation: `eager`
37
+ - batch_size: `1`
38
+ - epochs: `1`
39
+ - grad_accum: `16`
40
+ - learning_rate: `5e-5`
41
+ - lora_alpha: `32`
42
+ - lora_dropout: `0.05`
43
+ - lora_rank: `16`
44
+ - max_grad_norm: `1`
45
+ - max_seq: `512`
46
+ - system_prompt_override: `You are Properly a helpful assistant. You fix grammar, spelling, and clarity. Preserve the author's voice. Return only the corrected text. No explanations. No commentary. No emojis or hashtags.`
47
+ - target_examples: `50000`
48
+
49
+ ## Dataset Configuration
50
+
51
+ - Dataset Mix | 3 sources | seed 42 (mixture; 35,000 rows)
52
+
53
+ ## Recent Training Metrics
54
+
55
+ Accuracy is marked `N/A` because this run type logs causal language-model loss, not a publishable evaluation accuracy.
56
+
57
+ | Step | Loss | Accuracy | LR | Epoch | Timestamp |
58
+ | --- | ---: | ---: | ---: | ---: | --- |
59
+ | 2130 / 2188 | 0.8876 | - | - | 0.97 | 2026-04-30T09:14:13.929849+00:00 |
60
+ | 2140 / 2188 | 0.8566 | - | - | 0.98 | 2026-04-30T09:15:41.870227+00:00 |
61
+ | 2150 / 2188 | 0.8243 | - | - | 0.98 | 2026-04-30T09:17:06.787695+00:00 |
62
+ | 2160 / 2188 | 0.8341 | - | - | 0.99 | 2026-04-30T09:18:34.679929+00:00 |
63
+ | 2170 / 2188 | 0.8131 | - | - | 0.99 | 2026-04-30T09:20:01.712890+00:00 |
64
+ | 2180 / 2188 | 0.8141 | - | - | 1.0 | 2026-04-30T09:21:24.802808+00:00 |
65
+ | 2188 / 2188 | 0.8404612632730544 | - | - | - | 2026-04-30T09:22:42.250939+00:00 |
66
+
67
+ ## Deployment History
68
+
69
+ | Target | Reference | Status | Created |
70
+ | --- | --- | --- | --- |
71
+ | huggingface | `deltakitsune/properly` | completed | 2026-05-01T01:13:15.059197+00:00 |
72
+
73
+ ## Notes
74
+
75
  Generated by Kitsune Training Suite. Review limitations, intended use, safety notes, and licensing before publishing.