Update README.md
Browse files
README.md
CHANGED
|
@@ -46,7 +46,8 @@ SarcasmDiffusion is a diffusion-based generative model focused on producing **cl
|
|
| 46 |
- **LoRA rank / alpha / dropout:** `r=8`, `alpha=16`, `dropout=0.05`
|
| 47 |
- **Resolution:** 1024 (training); common inference at 768–896 for speed
|
| 48 |
- **Batch:** 1 (gradient accumulation = 4)
|
| 49 |
-
- **Steps:** ~9k (≈2 epoch on ~5k images)
|
|
|
|
| 50 |
- **Precision:** fp16 (LoRA params kept in fp32 during training)
|
| 51 |
- **Optimizer:** AdamW
|
| 52 |
- **Scheduler:** cosine with warmup (recommended)
|
|
|
|
| 46 |
- **LoRA rank / alpha / dropout:** `r=8`, `alpha=16`, `dropout=0.05`
|
| 47 |
- **Resolution:** 1024 (training); common inference at 768–896 for speed
|
| 48 |
- **Batch:** 1 (gradient accumulation = 4)
|
| 49 |
+
- **Steps:** ~9k (≈2 epoch on ~5k images)
|
| 50 |
+
- **Learning Rate:** 0.0001
|
| 51 |
- **Precision:** fp16 (LoRA params kept in fp32 during training)
|
| 52 |
- **Optimizer:** AdamW
|
| 53 |
- **Scheduler:** cosine with warmup (recommended)
|