Model-J: SupViT Model (model_idx_0005)

This model is part of the Model-J dataset, introduced in:

Learning on Model Weights using Tree Experts (CVPR 2025) by Eliahu Horwitz*, Bar Cavia*, Jonathan Kahana*, Yedid Hoshen

๐ŸŒ Project | ๐Ÿ“ƒ Paper | ๐Ÿ’ป GitHub | ๐Ÿค— Dataset

ProbeX

Model Details

Attribute Value
Subset SupViT
Split val
Base Model google/vit-base-patch16-224
Dataset CIFAR100 (50 classes)

Training Hyperparameters

Parameter Value
Learning Rate 7e-05
LR Scheduler cosine
Epochs 9
Max Train Steps 2997
Batch Size 64
Weight Decay 0.03
Seed 5
Random Crop True
Random Flip True

Performance

Metric Value
Train Accuracy 0.9994
Val Accuracy 0.9421
Test Accuracy 0.9420

Training Categories

The model was fine-tuned on the following 50 CIFAR100 classes:

tulip, shrew, flatfish, crocodile, bus, streetcar, tractor, otter, tank, caterpillar, house, bee, pear, willow_tree, bridge, dinosaur, butterfly, cattle, can, spider, crab, bicycle, sea, apple, bed, snake, oak_tree, porcupine, worm, squirrel, orange, skyscraper, rabbit, raccoon, seal, camel, plain, sweet_pepper, beaver, girl, elephant, snail, orchid, keyboard, cloud, boy, ray, bottle, rose, tiger

Downloads last month
5
Safetensors
Model size
85.8M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ProbeX/Model-J__SupViT__model_idx_0005

Finetuned
(1944)
this model

Collection including ProbeX/Model-J__SupViT__model_idx_0005

Paper for ProbeX/Model-J__SupViT__model_idx_0005