Update README.md
Browse files
README.md
CHANGED
|
@@ -18,8 +18,24 @@ base_model:
|
|
| 18 |
- AbstractPhil/geolip-bertenstein
|
| 19 |
---
|
| 20 |
|
| 21 |
-
# Newest:
|
| 22 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
|
| 24 |
As the subsystems aligned, the core system aligned around it, and the accuracy is still R1: 99.9% accuracy, meaning nearly 100% accuracy for validation.
|
| 25 |
|
|
|
|
| 18 |
- AbstractPhil/geolip-bertenstein
|
| 19 |
---
|
| 20 |
|
| 21 |
+
# Newest: Prepping 12m conceptual-captions bert extractions
|
| 22 |
+
|
| 23 |
+
The dataset is going to be in pt chunks because they load directly to vram nearly instantly in colab, and the system operates on them quicker than dataloaders.
|
| 24 |
+
|
| 25 |
+
I'll be running the full 12m set, no exceptions - short llava, long llava, and original captions.
|
| 26 |
+
|
| 27 |
+
After the 12m 5 expert dataset training completes, the core model will be ready.
|
| 28 |
+
|
| 29 |
+
It's legitimately wild watching the system sit there at 100% accuracy validation, but it requires additional complexity so that isn't the measure to analyze.
|
| 30 |
+
The problem is solved for recall, but the internal structure's geometric system needs to align to the larger spectrum of rigidity that the smooth manifold
|
| 31 |
+
deviations require to create a full cohesion, meaning more data.
|
| 32 |
+
|
| 33 |
+
12 million samples roughly 10 epochs should be a fair assessment. Hopefully the data isn't too much.
|
| 34 |
+
|
| 35 |
+
|
| 36 |
+
# 2 additional epochs, 1m samples ran
|
| 37 |
+
The alignment became more aligned, considerably more aligned. The count went from 0.087 to around 0.1 something last I checked. It's rising every batch, and the
|
| 38 |
+
anchor is going to be continued to train on low heat now that they have both begun to align.
|
| 39 |
|
| 40 |
As the subsystems aligned, the core system aligned around it, and the accuracy is still R1: 99.9% accuracy, meaning nearly 100% accuracy for validation.
|
| 41 |
|