darwinkernelpanic commited on
Commit
7e6b4e2
·
verified ·
1 Parent(s): 94b1dcd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -1,6 +1,5 @@
1
  ---
2
  license: mit
3
- library_name: diffusers
4
  tags:
5
  - diffusion
6
  - llm
@@ -8,6 +7,8 @@ tags:
8
  - difference-labs
9
  datasets:
10
  - smangrul/ultrachat-10k-chatml
 
 
11
  ---
12
 
13
  # DiffReaper-6
@@ -27,4 +28,4 @@ It represents a significant architectural leap over the previous 5L version, tra
27
  The model is being trained on an RTX 5090 using the `ultrachat-10k` dataset, focusing on conversational flow and instruction following.
28
 
29
  ## Goal
30
- To prove that diffusion models can reach (and eventually exceed) the coherence of auto-regressive models while maintaining the creative "soul" and parallel generation benefits of diffusion.
 
1
  ---
2
  license: mit
 
3
  tags:
4
  - diffusion
5
  - llm
 
7
  - difference-labs
8
  datasets:
9
  - smangrul/ultrachat-10k-chatml
10
+ base_model:
11
+ - darwinkernelpanic/DiffReaper-5L
12
  ---
13
 
14
  # DiffReaper-6
 
28
  The model is being trained on an RTX 5090 using the `ultrachat-10k` dataset, focusing on conversational flow and instruction following.
29
 
30
  ## Goal
31
+ To prove that diffusion models can reach (and eventually exceed) the coherence of auto-regressive models while maintaining the creative "soul" and parallel generation benefits of diffusion.