Model-J: MAE Model (model_idx_0213)
This model is part of the Model-J dataset, introduced in:
Learning on Model Weights using Tree Experts (CVPR 2025) by Eliahu Horwitz*, Bar Cavia*, Jonathan Kahana*, Yedid Hoshen
๐ Project | ๐ Paper | ๐ป GitHub | ๐ค Dataset

Model Details
| Attribute |
Value |
| Subset |
MAE |
| Split |
train |
| Base Model |
facebook/vit-mae-base |
| Dataset |
CIFAR100 (50 classes) |
Training Hyperparameters
| Parameter |
Value |
| Learning Rate |
0.0005 |
| LR Scheduler |
cosine |
| Epochs |
9 |
| Max Train Steps |
2997 |
| Batch Size |
64 |
| Weight Decay |
0.01 |
| Seed |
213 |
| Random Crop |
False |
| Random Flip |
False |
Performance
| Metric |
Value |
| Train Accuracy |
0.9996 |
| Val Accuracy |
0.5976 |
| Test Accuracy |
0.5810 |
Training Categories
The model was fine-tuned on the following 50 CIFAR100 classes:
rabbit, crab, house, clock, raccoon, maple_tree, wardrobe, keyboard, dinosaur, sweet_pepper, seal, otter, lawn_mower, table, streetcar, mountain, cloud, television, pickup_truck, skunk, willow_tree, can, oak_tree, pear, camel, poppy, man, tiger, telephone, shrew, rose, wolf, dolphin, bowl, mushroom, sunflower, skyscraper, girl, bridge, cockroach, woman, chair, bicycle, elephant, turtle, leopard, worm, baby, bottle, road