Model-J: MAE Model (model_idx_0000)

This model is part of the Model-J dataset, introduced in:

Learning on Model Weights using Tree Experts (CVPR 2025) by Eliahu Horwitz*, Bar Cavia*, Jonathan Kahana*, Yedid Hoshen

๐ŸŒ Project | ๐Ÿ“ƒ Paper | ๐Ÿ’ป GitHub | ๐Ÿค— Dataset

ProbeX

Model Details

Attribute Value
Subset MAE
Split train
Base Model facebook/vit-mae-base
Dataset CIFAR100 (50 classes)

Training Hyperparameters

Parameter Value
Learning Rate 0.0005
LR Scheduler cosine
Epochs 6
Max Train Steps 1998
Batch Size 64
Weight Decay 0.009
Seed 0
Random Crop False
Random Flip False

Performance

Metric Value
Train Accuracy 0.7001
Val Accuracy 0.4885
Test Accuracy 0.4934

Training Categories

The model was fine-tuned on the following 50 CIFAR100 classes:

skyscraper, rose, squirrel, beetle, telephone, oak_tree, caterpillar, ray, bus, dolphin, girl, bridge, poppy, flatfish, woman, rocket, bear, television, dinosaur, tank, motorcycle, lizard, orchid, cloud, sea, mountain, streetcar, shrew, wardrobe, can, palm_tree, baby, lamp, kangaroo, willow_tree, beaver, possum, aquarium_fish, lobster, bicycle, otter, crocodile, plain, wolf, bowl, cockroach, fox, cup, lawn_mower, skunk

Downloads last month
7
Safetensors
Model size
85.8M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ProbeX/Model-J__MAE__model_idx_0000

Finetuned
(1010)
this model

Collection including ProbeX/Model-J__MAE__model_idx_0000

Paper for ProbeX/Model-J__MAE__model_idx_0000