waddie commited on
Commit
2999cfb
·
verified ·
1 Parent(s): 75b004c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -15
README.md CHANGED
@@ -1,7 +1,6 @@
1
  ---
2
  library_name: transformers
3
  tags:
4
- - discord
5
  - human-style
6
  - conversational
7
  - qwen
@@ -14,7 +13,7 @@ license: apache-2.0
14
 
15
  # CloudWaddie Mini 1.0
16
 
17
- This model is a fine-tuned version of `Qwen2.5-0.5B-Instruct` designed to mimic the specific conversational rhythm, slang, and technical jargon of an AI-centric Discord community.
18
 
19
  ## Model Details
20
 
@@ -61,16 +60,4 @@ outputs = model.generate(
61
  eos_token_id=tokenizer.convert_tokens_to_ids("<|im_end|>")
62
  )
63
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
64
- ```
65
-
66
- ## Training Details
67
-
68
- ### Training Data
69
- Trained on 789 conversation pairs extracted from AI-related Discord channels, focusing on topics like reverse engineering, internal Google/Anthropic models, and general community banter.
70
-
71
- ### Training Procedure
72
- - **Method:** QLoRA (4-bit)
73
- - **Hardware:** NVIDIA T4 GPU
74
- - **Epochs:** 2
75
- - **Learning Rate:** 5e-5
76
- - **Batch Size:** 1 (with 4 Gradient Accumulation Steps)
 
1
  ---
2
  library_name: transformers
3
  tags:
 
4
  - human-style
5
  - conversational
6
  - qwen
 
13
 
14
  # CloudWaddie Mini 1.0
15
 
16
+ This model is a fine-tuned version of `Qwen2.5-0.5B-Instruct` designed to mimic the specific conversational rhythm, slang, and technical jargon of a human,
17
 
18
  ## Model Details
19
 
 
60
  eos_token_id=tokenizer.convert_tokens_to_ids("<|im_end|>")
61
  )
62
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
63
+ ```