AbstractPhil
/

penta-vit-experiments

Zero-Shot Classification

Model card Files Files and versions

Metrics Training metrics Community

AbstractPhil commited on Sep 13, 2025

Commit

9c64f48

·

verified ·

1 Parent(s): d905f3b

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -4,6 +4,12 @@ datasets:
 - AbstractPhil/geometric-vocab
 pipeline_tag: zero-shot-classification
 ---
 # Formulas have been purified
 The newest vit_zana_nano train has shown a very clean curve. runs/vit_zana_nano/20250913_192119 which is about a 3 meg model capable of creating >50% accuracy 128 dim features from cifar100 classification.

 - AbstractPhil/geometric-vocab
 pipeline_tag: zero-shot-classification
 ---
+# Still about the same accuracy deep as shallow
+It's not a capacity issue YET then, since that should have covered it with shaper.
+I have a few ideas but I think I'll focus on getting more shallow models stable and then scale up slowly instead of trying to just using a logarithm.
 # Formulas have been purified
 The newest vit_zana_nano train has shown a very clean curve. runs/vit_zana_nano/20250913_192119 which is about a 3 meg model capable of creating >50% accuracy 128 dim features from cifar100 classification.