Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness
Paper
•
2408.05446
•
Published
•
1
reproducing: "Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness" (https://arxiv.org/abs/2408.05446)
source code and usage examples: https://github.com/ETH-DISCO/self-ensembling
architecture based on Torchvision's Resnet152 default implementation
hyperparameters:
torch.nn.CrossEntropyLoss()torch.optim.AdamWGradScaler["cifar10", "cirfar100"]0.000116 (higher would be even better, but maybe by <1%)2 (difference between crossmax_k=2 and crossmax_k=3 is about 1-2%, so it's not a big deal)