Update README.md
Browse files
README.md
CHANGED
|
@@ -90,7 +90,7 @@ but I will do my best to monitor them as the diffusion training progresses.
|
|
| 90 |
|
| 91 |
The router structure will have an LRU caching system per device, depending how well accelerate takes - and those subsystems will be augmented to directly handle their own onload/offloading of information.
|
| 92 |
|
| 93 |
-
As it stands, the router structure
|
| 94 |
|
| 95 |
# Block 4 - The inversion.
|
| 96 |
|
|
|
|
| 90 |
|
| 91 |
The router structure will have an LRU caching system per device, depending how well accelerate takes - and those subsystems will be augmented to directly handle their own onload/offloading of information.
|
| 92 |
|
| 93 |
+
As it stands, the router structure is optimized for wide and contains safeguards for behavior that would otherwise cause wide models to corrupt or fail.
|
| 94 |
|
| 95 |
# Block 4 - The inversion.
|
| 96 |
|