AbstractPhil
/

geoclip-vit-base-patch-32-32d

Model card Files Files and versions

AbstractPhil commited on Aug 28, 2025

Commit

6dfb17d

·

verified ·

1 Parent(s): 60d8a1f

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -39,10 +39,10 @@ This repo hosts the trained crystal classification head (+ run configs/metrics)
 OVERVIEW
 - Vision encoder: openai/clip-vit-base-patch32 (Hugging Face transformers), frozen by default.
-  Produces exactly one L2-normalized embedding per image (image_embeds, dim=512).
-- Vocabulary: AbstractPhil/geometric-vocab-512d (pentachora crystals).
   For CIFAR-100 class names, any missing tokens are deterministically synthesized via the unicode path to guarantee 100/100 coverage and preserve class ordering.
-- Head: projects both image embeddings (De=512) and role-selected class anchors (Dv=512) into a shared symbol space (crystal_dims=128), L2-normalizes, and computes cosine logits divided by T (temperature).
 - Training: Cross-Entropy on CIFAR-100, AdamW, optional AMP, cosine LR with warmup. Best checkpoint is saved and (optionally) pushed to Hugging Face.
 ---
@@ -50,9 +50,9 @@ OVERVIEW
 MODEL CARD
 - Task: Image Classification (CIFAR-100)
 - Backbone: openai/clip-vit-base-patch32 (vision-only)
-- Head: Crystal projection head (image 512→128, anchor 512→128) + cosine logits (temperature)
-- Vocabulary: AbstractPhil/geometric-vocab-512d (wordnet_eng split + deterministic unicode synth for gaps)
-- Metrics: Top-1 = [80~], Top-3 = [90>]
 - License: MIT
 ---

 OVERVIEW
 - Vision encoder: openai/clip-vit-base-patch32 (Hugging Face transformers), frozen by default.
+  Produces exactly one L2-normalized embedding per image (image_embeds, dim=32).
+- Vocabulary: AbstractPhil/geometric-vocab-32d (pentachora crystals).
   For CIFAR-100 class names, any missing tokens are deterministically synthesized via the unicode path to guarantee 100/100 coverage and preserve class ordering.
+- Head: projects both image embeddings (De=32) and role-selected class anchors (Dv=32) into a shared symbol space (crystal_dims=64), L2-normalizes, and computes cosine logits divided by T (temperature).
 - Training: Cross-Entropy on CIFAR-100, AdamW, optional AMP, cosine LR with warmup. Best checkpoint is saved and (optionally) pushed to Hugging Face.
 ---
 MODEL CARD
 - Task: Image Classification (CIFAR-100)
 - Backbone: openai/clip-vit-base-patch32 (vision-only)
+- Head: Crystal projection head (image 512→64, anchor 32→64) + cosine logits (temperature)
+- Vocabulary: AbstractPhil/geometric-vocab-32d (wordnet_eng split + deterministic unicode synth for gaps)
+- Metrics: Top-1 = [60~], Top-3 = [80>]
 - License: MIT
 ---