Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -45,7 +45,9 @@ anchor system that will be entirely reusable as a pure geometric anchored bank.
 So it'll be around 36,000,000 * 5 * 10, roughly 1.8b more samples give or take should be enough for a full caption shared cohesion.
-This can be handled on a single G4 in a few days, nothing too major. Everything is within cost.
 Saturating the internals of the anchor and the subsystem will allow for more complex processes and easy alignment with pieces of the data. After that
 it will be quite fast to sample the most accurate captions and begin forming vit association, which will allow for a full next token prediction capacity

 So it'll be around 36,000,000 * 5 * 10, roughly 1.8b more samples give or take should be enough for a full caption shared cohesion.
+The training itself can be handled on a single G4 in a few days, nothing too major. Everything is within cost.
+The data prep on the other hand may take a bit longer, but I can run multiple t4 at low cost to prepare them over time, which will be cheap.
 Saturating the internals of the anchor and the subsystem will allow for more complex processes and easy alignment with pieces of the data. After that
 it will be quite fast to sample the most accurate captions and begin forming vit association, which will allow for a full next token prediction capacity