Update README.md
Browse files
README.md
CHANGED
|
@@ -39,10 +39,10 @@ This repo hosts the trained crystal classification head (+ run configs/metrics)
|
|
| 39 |
|
| 40 |
OVERVIEW
|
| 41 |
- Vision encoder: openai/clip-vit-base-patch32 (Hugging Face transformers), frozen by default.
|
| 42 |
-
Produces exactly one L2-normalized embedding per image (image_embeds, dim=
|
| 43 |
-
- Vocabulary: AbstractPhil/geometric-vocab-
|
| 44 |
For CIFAR-100 class names, any missing tokens are deterministically synthesized via the unicode path to guarantee 100/100 coverage and preserve class ordering.
|
| 45 |
-
- Head: projects both image embeddings (De=
|
| 46 |
- Training: Cross-Entropy on CIFAR-100, AdamW, optional AMP, cosine LR with warmup. Best checkpoint is saved and (optionally) pushed to Hugging Face.
|
| 47 |
|
| 48 |
---
|
|
@@ -50,9 +50,9 @@ OVERVIEW
|
|
| 50 |
MODEL CARD
|
| 51 |
- Task: Image Classification (CIFAR-100)
|
| 52 |
- Backbone: openai/clip-vit-base-patch32 (vision-only)
|
| 53 |
-
- Head: Crystal projection head (image 512→
|
| 54 |
-
- Vocabulary: AbstractPhil/geometric-vocab-
|
| 55 |
-
- Metrics: Top-1 = [
|
| 56 |
- License: MIT
|
| 57 |
|
| 58 |
---
|
|
|
|
| 39 |
|
| 40 |
OVERVIEW
|
| 41 |
- Vision encoder: openai/clip-vit-base-patch32 (Hugging Face transformers), frozen by default.
|
| 42 |
+
Produces exactly one L2-normalized embedding per image (image_embeds, dim=32).
|
| 43 |
+
- Vocabulary: AbstractPhil/geometric-vocab-32d (pentachora crystals).
|
| 44 |
For CIFAR-100 class names, any missing tokens are deterministically synthesized via the unicode path to guarantee 100/100 coverage and preserve class ordering.
|
| 45 |
+
- Head: projects both image embeddings (De=32) and role-selected class anchors (Dv=32) into a shared symbol space (crystal_dims=64), L2-normalizes, and computes cosine logits divided by T (temperature).
|
| 46 |
- Training: Cross-Entropy on CIFAR-100, AdamW, optional AMP, cosine LR with warmup. Best checkpoint is saved and (optionally) pushed to Hugging Face.
|
| 47 |
|
| 48 |
---
|
|
|
|
| 50 |
MODEL CARD
|
| 51 |
- Task: Image Classification (CIFAR-100)
|
| 52 |
- Backbone: openai/clip-vit-base-patch32 (vision-only)
|
| 53 |
+
- Head: Crystal projection head (image 512→64, anchor 32→64) + cosine logits (temperature)
|
| 54 |
+
- Vocabulary: AbstractPhil/geometric-vocab-32d (wordnet_eng split + deterministic unicode synth for gaps)
|
| 55 |
+
- Metrics: Top-1 = [60~], Top-3 = [80>]
|
| 56 |
- License: MIT
|
| 57 |
|
| 58 |
---
|