File size: 2,249 Bytes
7f22fff 26d4801 7f22fff 26d4801 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 |
---
language: en
license: mit
tags:
- keras
- tensorflow
- computer-vision
- image-processing
- corn-kernel-classification
pipeline_tag: image-classification
library_name: keras
---
# CornViT
A Multi-Stage Convolutional Vision Transformer Framework for Corn Kernel Analysis
## Overview
Three-stage hierarchical classification pipeline for automated corn kernel quality assessment:
- **Stage 1**: Purity detection (Pure vs Impure)
- **Stage 2**: Shape classification (Flat vs Round)
- **Stage 3**: Embryo orientation (Up vs Down)
## Architecture
- **Model**: CvT-13 (384Γ384) with ImageNet-22k pretraining
- **Framework**: PyTorch + Microsoft CvT
- **Test Accuracy**: 93.8% (Stage 1), 94.1% (Stage 2), 91.1% (Stage 3)
## Setup
```bash
# Clone repository
git clone https://github.com/microsoft/CvT.git
# Install dependencies
pip install -r requirements.txt
```
## Training
Each stage has independent training scripts:
```bash
python stage1/train_cvt13.py # Purity classification
python stage2/train_cvt13.py # Shape classification
python stage3/train_cvt13.py # Embryo orientation
```
## Inference
```bash
python stage1/inference_cvt13.py
python stage2/inference_cvt13.py
python stage3/inference_cvt13.py
```
## Baselines
ResNet50 and DenseNet121 baselines available in `baselines/`.
## Structure
```
βββ stage1/ # Purity classification
βββ stage2/ # Shape classification
βββ stage3/ # Embryo orientation
βββ preprocess/ # Data preprocessing scripts
```
## Requirements
- Python 3.13+
- PyTorch 2.9+
- CUDA (optional, for GPU training)
---
## Citation
If you use this code, models, or catalog in your research, please cite:
```bibtex
@Article{computers15010002,
AUTHOR = {Erukude, Sai Teja and Mascarenhas, Jane and Shamir, Lior},
TITLE = {CornViT: A Multi-Stage Convolutional Vision Transformer Framework for Hierarchical Corn Kernel Analysis},
JOURNAL = {Computers},
VOLUME = {15},
YEAR = {2026},
NUMBER = {1},
ARTICLE-NUMBER = {2},
URL = {https://www.mdpi.com/2073-431X/15/1/2},
ISSN = {2073-431X},
DOI = {10.3390/computers15010002}
}
``` |