Update README.md
Browse files
README.md
CHANGED
|
@@ -4,6 +4,12 @@ datasets:
|
|
| 4 |
- AbstractPhil/geometric-vocab
|
| 5 |
pipeline_tag: zero-shot-classification
|
| 6 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
# Formulas have been purified
|
| 8 |
|
| 9 |
The newest vit_zana_nano train has shown a very clean curve. runs/vit_zana_nano/20250913_192119 which is about a 3 meg model capable of creating >50% accuracy 128 dim features from cifar100 classification.
|
|
|
|
| 4 |
- AbstractPhil/geometric-vocab
|
| 5 |
pipeline_tag: zero-shot-classification
|
| 6 |
---
|
| 7 |
+
# Still about the same accuracy deep as shallow
|
| 8 |
+
|
| 9 |
+
It's not a capacity issue YET then, since that should have covered it with shaper.
|
| 10 |
+
|
| 11 |
+
I have a few ideas but I think I'll focus on getting more shallow models stable and then scale up slowly instead of trying to just using a logarithm.
|
| 12 |
+
|
| 13 |
# Formulas have been purified
|
| 14 |
|
| 15 |
The newest vit_zana_nano train has shown a very clean curve. runs/vit_zana_nano/20250913_192119 which is about a 3 meg model capable of creating >50% accuracy 128 dim features from cifar100 classification.
|