AbstractPhil
/

penta-vit-experiments

Zero-Shot Classification

Model card Files Files and versions

Metrics Training metrics Community

AbstractPhil commited on Sep 17, 2025

Commit

e45cd75

·

verified ·

1 Parent(s): 2810aa4

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -4,6 +4,18 @@ datasets:
 - AbstractPhil/geometric-vocab
 pipeline_tag: zero-shot-classification
 ---
 # Surgery again
 Alright. This initial cycle is concluded, I've determined that the frozen pentachoron are in fact utilizable by about 50% most of the time and will eventually cap at 60% if you stick to standard cross-entropy with tons of geometric regularization. So roughly 3/5ths of the pentas are covered and the other two completely discarded.

 - AbstractPhil/geometric-vocab
 pipeline_tag: zero-shot-classification
 ---
+# A few more variants first
+There are a few unexplored elements with rose5 that I need to explore now that I take stock of the full roster.
+As it stands, the majority of consistency comes directly from cosine with hypersphere using penta as a variant form of vector lattice. It works yes, but it's also not what the models are supposed to be doing.
+If left running too long, the variants show that the cosine primarily collapses to head-dependent, which means eventually the model eventually just falls into the classification state.
+It seems only one loss is necessary, and that one loss is a combination of rose (multi cosine) mixed with alignment and margin losses.
+If my hunch is correct, centroid may be much weaker than rose5 with all the losses except geometric rose turned off.
 # Surgery again
 Alright. This initial cycle is concluded, I've determined that the frozen pentachoron are in fact utilizable by about 50% most of the time and will eventually cap at 60% if you stick to standard cross-entropy with tons of geometric regularization. So roughly 3/5ths of the pentas are covered and the other two completely discarded.