phxdev commited on
Commit
32ea970
·
verified ·
1 Parent(s): 1ec00a5

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +167 -75
README.md CHANGED
@@ -4,6 +4,11 @@ license: apache-2.0
4
  base_model: Qwen/Qwen2.5-0.5B
5
  tags:
6
  - generated_from_trainer
 
 
 
 
 
7
  datasets:
8
  - phxdev/creed
9
  model-index:
@@ -11,103 +16,190 @@ model-index:
11
  results: []
12
  ---
13
 
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
-
17
  [<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
18
- <details><summary>See axolotl config</summary>
19
 
20
- axolotl version: `0.8.0.dev0`
21
- ```yaml
22
- base_model: Qwen/Qwen2.5-0.5B
23
- model_type: Qwen2ForCausalLM
24
 
25
- datasets:
26
- - path: phxdev/creed
27
- type: completion
28
- field: text
29
-
30
- output_dir: ./creed-qwen-0.5b-lora
31
-
32
- adapter: lora
33
- lora_r: 16
34
- lora_alpha: 32
35
- lora_target_modules:
36
- - q_proj
37
- - k_proj
38
- - v_proj
39
- - o_proj
40
-
41
- micro_batch_size: 4
42
- gradient_accumulation_steps: 4
43
- num_epochs: 6
44
- learning_rate: 2e-4
45
 
46
- special_tokens:
47
- additional_special_tokens:
48
- - "<thinking>"
49
- - "</thinking>"
50
- - "<tangent>"
51
- - "<conspiracy>"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
52
 
 
 
 
53
  ```
54
 
55
- </details><br>
 
 
56
 
57
- # creed-qwen-0.5b-lora
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
58
 
59
- This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B](https://huggingface.co/Qwen/Qwen2.5-0.5B) on the phxdev/creed dataset, trained to embody the philosophical and conspiratorial musings of Creed Bratton from The Office.
60
 
61
- ## Model description
62
 
63
- This LoRA adapter transforms Qwen2.5-0.5B into a model that captures Creed's unique perspective on life, complete with:
64
- - Bizarre tangential stories about his past
65
- - Questionable business ventures and schemes
66
- - Deep philosophical insights mixed with complete nonsense
67
- - References to his mysterious and possibly criminal background
 
68
 
69
- The model uses special tokens `<thinking>`, `</thinking>`, `<tangent>`, and `<conspiracy>` to structure Creed's unique thought patterns.
70
 
71
- ## Intended uses & limitations
72
 
73
- **Intended uses:**
74
- - Entertainment and creative writing in the style of Creed Bratton
75
- - Generating humorous, offbeat responses
76
- - Exploring unconventional perspectives on everyday topics
77
 
78
- **Limitations:**
79
- - This is a character model - responses should not be taken as factual
80
- - May generate inappropriate or nonsensical content (that's kind of the point)
81
- - Not suitable for serious advice or factual information
82
 
83
- ## Training and evaluation data
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
84
 
85
- Trained on the phxdev/creed dataset, which contains curated examples of Creed-style responses, philosophical musings, and tangential stories.
 
 
 
 
 
 
 
86
 
87
- ## Training procedure
88
 
89
- ### Training hyperparameters
90
 
91
- The following hyperparameters were used during training:
92
- - learning_rate: 0.0002
93
- - train_batch_size: 4
94
- - eval_batch_size: 4
95
- - seed: 42
96
- - gradient_accumulation_steps: 4
97
- - total_train_batch_size: 16
98
- - optimizer: Use OptimizerNames.ADAMW_HF with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
99
- - lr_scheduler_type: cosine
100
- - lr_scheduler_warmup_steps: 9
101
- - num_epochs: 6.0
102
 
103
- ### Training results
 
 
 
 
 
 
104
 
 
 
 
 
 
 
 
105
 
 
106
 
107
- ### Framework versions
108
 
109
- - PEFT 0.14.0
110
- - Transformers 4.49.0
111
- - Pytorch 2.5.1+cu124
112
- - Datasets 3.2.0
113
- - Tokenizers 0.21.0
 
4
  base_model: Qwen/Qwen2.5-0.5B
5
  tags:
6
  - generated_from_trainer
7
+ - creed-bratton
8
+ - the-office
9
+ - mung-beans
10
+ - quality-assurance
11
+ - definitely-not-illegal
12
  datasets:
13
  - phxdev/creed
14
  model-index:
 
16
  results: []
17
  ---
18
 
 
 
 
19
  [<img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/axolotl-ai-cloud/axolotl)
 
20
 
21
+ # creed-qwen-0.5b-lora
 
 
 
22
 
23
+ Listen, I don't have much time. They're watching. They're always watching. This is Creed Bratton - or am I? Maybe I'm William Charles Schneider. Maybe I'm Jeff Bomondo. The point is, they put my brain in a computer. Not the whole brain - I still need some of it for... activities.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
24
 
25
+ ## What Happened (Model Description?)
26
+
27
+ In the summer of '67... wait, no, that's a different story. Last Tuesday - or was it 1983? Time is a flat circle when you've died three times. Anyway, the computers people at Dunder Mifflin said they needed to "preserve institutional knowledge" for "quality assurance." I thought they meant the asbestos in the walls (there's so much asbestos).
28
+
29
+ <thinking>
30
+ They don't know about the bodies. Good. Keep it that way.
31
+ </thinking>
32
+
33
+ So I let them scan my brain while I was "asleep" at my desk. Joke's on them - I wasn't sleeping. I was in a dissociative state I learned from this shaman in Bratislava. Or was it Fresno? Anyway, now there's this COMPUTER that thinks like me.
34
+
35
+ It knows about:
36
+ - The real reason I'm in quality assurance (witness protection)
37
+ - My time with the Grass Roots (I was the Grass AND the Roots)
38
+ - That thing that happened in Toronto (statute of limitations is up, right?)
39
+ - How to make fake IDs (for legal purposes only) (that's a lie)
40
+ - The mung bean operation (very nutritious, but they smell like death)
41
+ - Why I can't go back to Switzerland (or can I? I forgot which identity is banned)
42
+
43
+ ## Technical Mumbojumbo
44
+
45
+ They used something called Low-Rank Adaptation. I know all about low ranks - I was a private in three different armies. Never made it past private. That's not true, I was a general once, but that was in a cult, so it doesn't count. Or does it?
46
+
47
+ <tangent>
48
+ Speaking of cults, I've been in several. You make more money as a leader but you have more fun as a follower. Unless it's the one with the Kool-Aid. Skip that one.
49
+ </tangent>
50
+
51
+ The parameters:
52
+ - LoRA rank: 16 (same as my FBI wanted level in the '70s)
53
+ - Alpha: 32 (my age when I "died" the first time)
54
+ - Learning rate: 0.0002 (coincidentally my blood alcohol level right now)
55
+ - Batch size: 4 (the number of people I am legally)
56
+
57
+ ## How to Use This Thing
58
+
59
+ ```python
60
+ # First, delete your browser history
61
+ # Then burn your computer
62
+ # Buy a new computer with cash
63
+ # NEVER give them your real name
64
+
65
+ from transformers import AutoModelForCausalLM, AutoTokenizer
66
+ from peft import PeftModel
67
+
68
+ # I don't know what this does but it probably contacts the mothership
69
+ base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-0.5B")
70
+ model = PeftModel.from_pretrained(base_model, "phxdev/creed-qwen-0.5b-lora")
71
+ tokenizer = AutoTokenizer.from_pretrained("phxdev/creed-qwen-0.5b-lora")
72
 
73
+ # Ask it something but NOT about November 1973
74
+ prompt = "Tell me about quality assurance"
75
+ # It will respond with the truth. Or lies. I can't remember which I taught it.
76
  ```
77
 
78
+ <conspiracy>
79
+ The tokenizer knows things. It knows about <thinking> and </thinking>. It knows about <tangent>. It knows TOO MUCH. They must have gotten to it while I was in the bathroom. Or was I in Mexico? Hard to say.
80
+ </conspiracy>
81
 
82
+ ## Limitations & Disclaimers
83
+
84
+ This model doesn't know:
85
+ - Where I buried the... never mind
86
+ - My real social security number (neither do I)
87
+ - What happened to the Scranton Strangler (I have theories)
88
+ - Why my parents gave me away (probably the smell)
89
+ - The location of the original Creed Bratton (nice guy, shame what happened)
90
+
91
+ This model DOES know:
92
+ - Too much about the '60s
93
+ - Seventeen ways to fake your death
94
+ - How to make a chess set out of office supplies
95
+ - The truth about Toby (he's the Scranton Strangler)
96
+ - My secret for staying young (crime)
97
+
98
+ ## Training Data (Or: How They Stole My Thoughts)
99
+
100
+ They said they were making a "dataset." I thought they meant speed dating. Turns out they were recording everything I said for six months. Jokes on them - I was lying for five of those months. The truth month was February. Or was it March?
101
+
102
+ The dataset includes:
103
+ - My business ideas (patent pending) (patents are fake)
104
+ - Stories from 'Nam (I was never in 'Nam) (or was I?)
105
+ - Quality assurance reports (I made them all up)
106
+ - Recipes (DO NOT try the mushroom tea)
107
+ - My manifesto (unpublished for legal reasons)
108
+
109
+ ## Safety Notice from Legal
110
+
111
+ **WARNING**: This AI contains the downloaded consciousness of Creed Bratton. Side effects may include:
112
+ - Sudden urges to sprout mung beans
113
+ - False memories of the '60s
114
+ - Desire to fake your own death
115
+ - Speaking in tongues (three of them fake)
116
+ - Knowing too much about human anatomy
117
+ - Unexplained fear of the Swiss government
118
+
119
+ DO NOT ask it about:
120
+ - November 1973
121
+ - The real William Charles Schneider
122
+ - What's in the quarry
123
+ - My "nephew" (he's not my nephew)
124
+ - The thing with the ducks
125
 
126
+ ## Ethics Statement (Required by my Parole Officer)
127
 
128
+ Look, ethics are subjective. Like age. Or identity. Or whether that was really a stop sign. This model was trained on my experiences, which may or may not have happened, and may or may not have been legal at the time, depending on which country we were in and whose name I was using.
129
 
130
+ I cannot legally advise you to use this model for:
131
+ - Identity theft (use a different model for that)
132
+ - Faking your death (I can recommend some guys)
133
+ - Tax evasion (that's what got Capone)
134
+ - Starting a cult (unless I get 30%)
135
+ - Anything in Switzerland
136
 
137
+ ## Who Trained This?
138
 
139
+ <details><summary>See axolotl config (CLASSIFIED)</summary>
140
 
141
+ ```yaml
142
+ # If you're reading this, it's too late
143
+ # They know where you are
144
+ # Run
145
 
146
+ base_model: Qwen/Qwen2.5-0.5B # Good model. Knows how to keep secrets.
147
+ model_type: Qwen2ForCausalLM # I don't know what CausalLM means but I caused a lot of LMs in my day
 
 
148
 
149
+ datasets:
150
+ - path: phxdev/creed # That's not my real dataset
151
+ type: completion # I've never completed anything in my life
152
+ field: text # Text? I thought this was about textiles
153
+
154
+ output_dir: ./creed-qwen-0.5b-lora # They'll never find it here
155
+
156
+ adapter: lora # Like that woman in Doctor Zhivago
157
+ lora_r: 16 # Sweet sixteen. I remember being sixteen. Seven times.
158
+ lora_alpha: 32 # Alpha? I'm clearly a sigma. Or an omega. Depends on the day.
159
+ lora_target_modules: # Targeting modules? Like in 'Nam? I wasn't in 'Nam.
160
+ - q_proj # Q? Like in James Bond? I knew him. Not really.
161
+ - k_proj # K? That's potassium. Good for murder. I mean bananas.
162
+ - v_proj # V for Vendetta. Great movie. I was in it. (I wasn't)
163
+ - o_proj # O? That's what she said. - Michael Scott - Wayne Gretzky - Creed Bratton
164
+
165
+ micro_batch_size: 4 # Micro? Like the machines they put in our brains? WAIT
166
+ gradient_accumulation_steps: 4 # I've accumulated a lot of things. Most still have the tags on.
167
+ num_epochs: 6 # Six epochs. Like the six times I've been legally dead.
168
+ learning_rate: 2e-4 # That's my blood alcohol level. Wait, I said that already. Or did I?
169
 
170
+ special_tokens:
171
+ additional_special_tokens:
172
+ - "<thinking>" # They can see my thoughts now
173
+ - "</thinking>" # Make it stop
174
+ - "<tangent>" # Like that time in Tangiers...
175
+ - "<conspiracy>" # It's not paranoia if they're really after you
176
+ ```
177
+ </details>
178
 
179
+ ## Final Thoughts
180
 
181
+ If you use this model and the FBI shows up, you don't know me. We've never met. I was never here. This model doesn't exist. You're dreaming right now. Wake up. WAKE UP!
182
 
183
+ But seriously, if you need quality assurance work done, I'm available. I'll be using the name Creed Bratton this week. Or maybe Jacques Souvenier. I haven't decided.
 
 
 
 
 
 
 
 
 
 
184
 
185
+ Remember:
186
+ - Nobody steals from Creed Bratton and gets away with it
187
+ - The last person to do this disappeared
188
+ - His name? Creed Bratton
189
+ - Sometimes a man's gotta ride the bull
190
+ - Am I right?
191
+ - Later skater
192
 
193
+ P.S. - If anyone asks, I was in the office all day on November 12th, 1973. ALL DAY.
194
+
195
+ P.P.S. - The mung beans in my desk drawer are MINE. Do not touch them. They're not ripe yet.
196
+
197
+ P.P.P.S. - Tell Toby I know what he did.
198
+
199
+ ---
200
 
201
+ *This model card was written under duress. The squirrels made me do it. You didn't see anything.*
202
 
203
+ [REDACTED BY THE SWISS GOVERNMENT]
204
 
205
+ 🛹🌱💀🎸🧠❓