Update README.md
Browse files
README.md
CHANGED
|
@@ -4,6 +4,18 @@ datasets:
|
|
| 4 |
- AbstractPhil/geometric-vocab
|
| 5 |
pipeline_tag: zero-shot-classification
|
| 6 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
# Still about the same accuracy deep as shallow
|
| 8 |
|
| 9 |
It's not a capacity issue YET then, since that should have covered it with shaper.
|
|
|
|
| 4 |
- AbstractPhil/geometric-vocab
|
| 5 |
pipeline_tag: zero-shot-classification
|
| 6 |
---
|
| 7 |
+
# Likely reintroduce the theta head tomorrow
|
| 8 |
+
|
| 9 |
+
The theta trains were actually not that bad. The head added some overhead but not really that much and the outcome improved, so it's worth exploring more.
|
| 10 |
+
|
| 11 |
+
Currently the l1 trains are performing well but still not up to the required 85% that I'm aiming for. Today's trains were underwhelming, but enlightening. Longer models aren't helpful in this structure any more than wide models are.
|
| 12 |
+
|
| 13 |
+
Reintroducing theta with some of my diffusion techniques might be in order if I can't get this one to comply. I'll try a couple of projection tricks before I go start digging into other experiments, but as it stands this one isn't yielding. Training an expert with pure data isn't exactly the geometry's forte - training students tends to yield much more effectively in comparison, but I'm trying to make a teacher model that can actually train students with geometry here.
|
| 14 |
+
|
| 15 |
+
To be fair, if it doesn't work, there's plenty of alternative options in the vit realm already - but I have high confidence that I can make it work. I just need to read more about capturing images, and treating the pentachora more as observers rather than direct relational interaction toolkits.
|
| 16 |
+
|
| 17 |
+
It'll likely need the constellation, but we'll see. It probably needs David's expert system.
|
| 18 |
+
|
| 19 |
# Still about the same accuracy deep as shallow
|
| 20 |
|
| 21 |
It's not a capacity issue YET then, since that should have covered it with shaper.
|