AbstractPhil
/

penta-vit-experiments

Zero-Shot Classification

Model card Files Files and versions

Metrics Training metrics Community

AbstractPhil commited on Sep 13, 2025

Commit

e9e26aa

·

verified ·

1 Parent(s): 1a69aa3

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -4,6 +4,14 @@ datasets:
 - AbstractPhil/geometric-vocab
 pipeline_tag: zero-shot-classification
 ---
 # I've had an epiphany. We don't NEED transformer layers in their current form.
 David's architecture already solved this need with high-efficiency multi-stage geometric mathematics.

 - AbstractPhil/geometric-vocab
 pipeline_tag: zero-shot-classification
 ---
+# Formulas have been purified
+The newest vit_zana_nano train has shown a very clean curve.
+I've begun training a much deeper zana dubbed vit_zana_shaper. This model has 32 layers deep with MLP ratio of 1 and 2 attention heads, resting at about 3.5 million params or so.
+Lets see how she fares.
 # I've had an epiphany. We don't NEED transformer layers in their current form.
 David's architecture already solved this need with high-efficiency multi-stage geometric mathematics.