AbstractPhil
/

penta-vit-experiments

Zero-Shot Classification

Model card Files Files and versions

Metrics Training metrics Community

AbstractPhil commited on Sep 9, 2025

Commit

88addf6

·

verified ·

1 Parent(s): b6b2cac

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -4,6 +4,20 @@ datasets:
 - AbstractPhil/geometric-vocab
 pipeline_tag: zero-shot-classification
 ---
 # Pentachoron Geometric Feature Extraction

 - AbstractPhil/geometric-vocab
 pipeline_tag: zero-shot-classification
 ---
+# Enabling the Mix-N-Cut
+I've built a mix-n-cut that I've been avoiding enabling. This one is particularly formatted for pentachoron, so we'll see how it fares. I'm trying to build one as SMALL AS POSSIBLE< so if this mix-n-cut can pull the task out of the bag I may as well run it.
+As it stands the tiny vits cap at 41% cifar100 with no augmentations. I've been running all the trains without a single special effect and only minimal normalization.
+Lets see how the upcoming trains fare.
+pixie_base_128d_patch4_128h
+Pixie base has 10 layers with 5 goemetic and 5 multihead traditional attention. Lets see how the mix-n-cut fares with the earlier ones first, then we'll run the base.
+The smaller ones seem to behave better using the geometric attention at 256 expert heads, which is odd to me but whatever works. They don't get much bigger with more experts, so I'll just try a tiny one with a ton of heads first.
 # Pentachoron Geometric Feature Extraction