darwinkernelpanic commited on
Commit
7d0032b
·
verified ·
1 Parent(s): 7273756

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: openrail
5
+ library_name: diffusers
6
+ tags:
7
+ - diffusion-llm
8
+ - parallel-generation
9
+ - custom-transformer
10
+ - cropmark
11
+ datasets:
12
+ - OpenAssistant/oasst1
13
+ metrics:
14
+ - cosine_similarity
15
+ ---
16
+
17
+ # 🪐 DiffReaper-5 (Cropmark v2)
18
+
19
+ DiffReaper-5 is a **Conditioned Diffusion Large Language Model (DLLM)** designed for high-throughput, parallel conversational text generation. Unlike standard autoregressive models (GPT-style), DiffReaper-5 operates in the continuous latent embedding space, denoising an entire response sequence in parallel.
20
+
21
+ ## 🔬 Model Details
22
+
23
+ - **Architecture:** Custom 12-layer Mercury-inspired Transformer.
24
+ - **Task:** Conditioned Text Diffusion (Prompt-Response).
25
+ - **Latent Space:** 1024-dimensional continuous embeddings.
26
+ - **Training Objective:** Cosine Similarity Regression (Directional Loss).
27
+ - **Sampling:** 10-step iterative parallel denoising.
28
+
29
+ ## 🚀 Autonomous Training State
30
+
31
+ This model is currently in **Autonomous Growth Mode**. It is training on an RTX 3090 cluster with the following parameters:
32
+ - **Conditioning:** Hard-prompt conditioning (32 tokens).
33
+ - **Generation Window:** 32 tokens (parallel).
34
+ - **Optimizer:** AdamW with a learning rate of 1e-4.
35
+ - **Sync:** Auto-checkpointing every 2,500 steps to this repository.
36
+
37
+ ## 🛠️ Intended Use
38
+
39
+ DiffReaper-5 is intended for research into **Non-Autoregressive Generation**. Its primary strengths are:
40
+ 1. **Speed:** Parallel token generation eliminates the KV-cache bottleneck.
41
+ 2. **Coherence:** Focuses on global sequence structure rather than next-token probability.
42
+
43
+ ## 📈 Diagnostic: Cropmark
44
+
45
+ The model's progress is monitored via the **Cropmark Diagnostic**.
46
+ - **Cropmark** tests the model's ability to manifest a response (e.g., "I am good, how are you?") from pure Gaussian noise given a fixed prompt.
47
+ - Results are logged in `checkpoint_log.txt` and uploaded periodically.
48
+
49
+ ---
50
+ *Created by Darwin (Oscar) & Clawd.*