Update README.md
Browse files
README.md
CHANGED
|
@@ -45,7 +45,9 @@ anchor system that will be entirely reusable as a pure geometric anchored bank.
|
|
| 45 |
|
| 46 |
So it'll be around 36,000,000 * 5 * 10, roughly 1.8b more samples give or take should be enough for a full caption shared cohesion.
|
| 47 |
|
| 48 |
-
|
|
|
|
|
|
|
| 49 |
|
| 50 |
Saturating the internals of the anchor and the subsystem will allow for more complex processes and easy alignment with pieces of the data. After that
|
| 51 |
it will be quite fast to sample the most accurate captions and begin forming vit association, which will allow for a full next token prediction capacity
|
|
|
|
| 45 |
|
| 46 |
So it'll be around 36,000,000 * 5 * 10, roughly 1.8b more samples give or take should be enough for a full caption shared cohesion.
|
| 47 |
|
| 48 |
+
The training itself can be handled on a single G4 in a few days, nothing too major. Everything is within cost.
|
| 49 |
+
|
| 50 |
+
The data prep on the other hand may take a bit longer, but I can run multiple t4 at low cost to prepare them over time, which will be cheap.
|
| 51 |
|
| 52 |
Saturating the internals of the anchor and the subsystem will allow for more complex processes and easy alignment with pieces of the data. After that
|
| 53 |
it will be quite fast to sample the most accurate captions and begin forming vit association, which will allow for a full next token prediction capacity
|