File size: 1,871 Bytes
710c2a5
 
 
 
 
 
 
 
 
 
 
 
dc91ace
 
 
b427802
 
1e71483
a32eb7c
1e71483
 
 
 
 
 
f157756
b427802
 
1e71483
 
 
b427802
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
license: cc-by-nc-nd-4.0
language:
- en
pipeline_tag: image-classification
tags:
- ImageNet-1k
- image classification
datasets:
- ILSVRC/imagenet-1k
metrics:
- accuracy
authors:
- Bo Deng, University of Nebraska-Lincoln
- Levi Heath, University of Colorado Colorado Springs
---

# Towards Errorless Training ImageNet-1k
This repository host MATLAB code and models for the manuscript, *Towards Errorless Training ImageNet-1k*, which is available at https://arxiv.org/abs/2508.04941.
We give 6 models trained on the ImageNet-1k dataset, which we list in the table below.
Each model is featured model of archtecture 17x40x2. 
That is, each model is made up of 17x40x2=1360 FNNs, all with homogeneous architecture (900-256-25 or 900-256-77-25), 
working in parallel to produce 1360 predictions which determine a final prediction using the majority voting protocol.

We trained the 6 models using the following transformation of the 64x64 downsampled ImageNet-1k dataset:
 - downsampled images to 32x32, using the mean values of non-overlapping 2x2 grid cells and
 - trimmed off top row, bottom row, left-most column, and right-most column.

This transformed data results in 30x30 images, hence 900-dimensional input vectors.

For a thorough description of our models trained on the ImageNet-1k dataset, please read our preprint linked above.


| Model | Training Method | FNN Architecture | Accuracy (%) |
| ------------- | ------------- | ------------- | ------------- |
| Model_S_h1_m1 | SGD | 900-256-25 | 98.247 |
| Model_S_h1_m2 | SGD | 900-256-25 | 98.299 |
| Model_S_h2_m1 | SGD | 900-256-77-25 | 96.990 |
| Model_T_h1_m1 | SGD followed by GDT | 900-256-25 | 98.289 |
| Model_T_h1_m2 | SGD followed by GDT | 900-256-25 | 98.300 |
| Model_T_h2_m1 | SGD followed by GDT | 900-256-77-25 | 97.770 |
*SGD = stochastic gradient descent
**GDT = gradient descent tunneling