AbstractPhil commited on
Commit
bcdc9e4
·
verified ·
1 Parent(s): 51fb679

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -4,6 +4,18 @@ datasets:
4
  - AbstractPhil/geometric-vocab
5
  pipeline_tag: zero-shot-classification
6
  ---
 
 
 
 
 
 
 
 
 
 
 
 
7
  # Still about the same accuracy deep as shallow
8
 
9
  It's not a capacity issue YET then, since that should have covered it with shaper.
 
4
  - AbstractPhil/geometric-vocab
5
  pipeline_tag: zero-shot-classification
6
  ---
7
+ # Likely reintroduce the theta head tomorrow
8
+
9
+ The theta trains were actually not that bad. The head added some overhead but not really that much and the outcome improved, so it's worth exploring more.
10
+
11
+ Currently the l1 trains are performing well but still not up to the required 85% that I'm aiming for. Today's trains were underwhelming, but enlightening. Longer models aren't helpful in this structure any more than wide models are.
12
+
13
+ Reintroducing theta with some of my diffusion techniques might be in order if I can't get this one to comply. I'll try a couple of projection tricks before I go start digging into other experiments, but as it stands this one isn't yielding. Training an expert with pure data isn't exactly the geometry's forte - training students tends to yield much more effectively in comparison, but I'm trying to make a teacher model that can actually train students with geometry here.
14
+
15
+ To be fair, if it doesn't work, there's plenty of alternative options in the vit realm already - but I have high confidence that I can make it work. I just need to read more about capturing images, and treating the pentachora more as observers rather than direct relational interaction toolkits.
16
+
17
+ It'll likely need the constellation, but we'll see. It probably needs David's expert system.
18
+
19
  # Still about the same accuracy deep as shallow
20
 
21
  It's not a capacity issue YET then, since that should have covered it with shaper.