rahulkumar11062003 commited on
Commit
5e01552
Β·
verified Β·
1 Parent(s): a50c70c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +141 -15
README.md CHANGED
@@ -1,20 +1,146 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
- title: Smartvision Ai
3
- emoji: πŸš€
4
- colorFrom: red
5
- colorTo: red
6
- sdk: docker
7
- app_port: 8501
8
- tags:
9
- - streamlit
10
- pinned: false
11
- short_description: end to end computer vision system
12
- license: mit
13
  ---
14
 
15
- # Welcome to Streamlit!
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
 
17
- Edit `/src/streamlit_app.py` to customize this app to your heart's desire. :heart:
18
 
19
- If you have any questions, checkout our [documentation](https://docs.streamlit.io) and [community
20
- forums](https://discuss.streamlit.io).
 
 
1
+ # 🧠 SmartVision AI
2
+ ### Intelligent Multi-Class Object Recognition System
3
+
4
+ SmartVision AI is an **end-to-end Computer Vision application** that performs
5
+ **image classification**, **object detection**, and **real-time inference** using
6
+ state-of-the-art deep learning models.
7
+ The project demonstrates the complete AI lifecycle β€” from **model training** to
8
+ **optimized deployment** using **Streamlit**.
9
+
10
+ ---
11
+
12
+ ## πŸš€ Key Features
13
+
14
+ - πŸ–ΌοΈ **Image Classification**
15
+ - Custom-trained deep learning models
16
+ - Top-5 prediction display with confidence scores
17
+ - Side-by-side comparison of multiple CNN architectures
18
+
19
+ - πŸ“¦ **Object Detection**
20
+ - Pretrained YOLOv8 model for real-time object detection
21
+ - Bounding boxes, class labels, and confidence scores
22
+ - Adjustable confidence threshold
23
+
24
+ - πŸ“Έ **Live Webcam Detection (Optimized)**
25
+ - Real-time detection using webcam
26
+ - FPS monitoring and CPU-friendly optimizations
27
+ - Frame skipping and resolution scaling
28
+
29
+ - πŸ“Š **Model Performance Dashboard**
30
+ - Accuracy comparison (Train / Validation / Test)
31
+ - Inference speed analysis
32
+ - Visual performance insights
33
+
34
+ - ⚑ **Optimized Inference**
35
+ - Lightweight models for CPU execution
36
+ - Streamlit caching for faster loading
37
+ - Performance-focused design decisions
38
+
39
+ ---
40
+
41
+ ## πŸ—οΈ Model Architectures Used
42
+
43
+ ### πŸ”Ή Image Classification
44
+ - **VGG16 (Custom Trained)**
45
+ - **ResNet50**
46
+ - **MobileNetV2**
47
+ - **EfficientNet-B0**
48
+
49
+ ### πŸ”Ή Object Detection
50
+ - **YOLOv8 (Pretrained on COCO Dataset)**
51
+
52
+ ---
53
+
54
+ ## πŸ“‚ Dataset Information
55
+
56
+ - **Image Classification Dataset**
57
+ - Domain-specific dataset
58
+ - 25 object classes
59
+ - Train / Validation / Test split
60
+ - Image preprocessing and augmentation applied
61
+
62
+ - **Object Detection Dataset**
63
+ - COCO Dataset
64
+ - 80 general-purpose object classes
65
+ - Bounding box annotations
66
+
67
+ ---
68
+
69
+ ## πŸ› οΈ Tech Stack
70
+
71
+ **Programming Language**
72
+ - Python 🐍
73
+
74
+ **Deep Learning & Computer Vision**
75
+ - PyTorch
76
+ - Torchvision
77
+ - Ultralytics YOLOv8
78
+ - OpenCV
79
+
80
+ **Data Analysis & Visualization**
81
+ - NumPy
82
+ - Pandas
83
+ - Matplotlib
84
+ - Seaborn
85
+
86
+ **Web & Deployment**
87
+ - Streamlit
88
+ - VS Code
89
+ - Git & GitHub
90
+
91
  ---
92
+
93
+ ## ⚑ Performance Optimization Techniques
94
+
95
+ - Frame skipping for real-time webcam inference
96
+ - Reduced image resolution for faster detection
97
+ - Lightweight YOLOv8n model for CPU execution
98
+ - Streamlit resource caching
99
+ - Confidence-based filtering of predictions
100
+
 
 
101
  ---
102
 
103
+ ## πŸ“ Project Structure
104
+ SmartVisionAI/
105
+ β”‚
106
+ β”œβ”€β”€ app.py # Main Streamlit application
107
+ β”œβ”€β”€ requirements.txt # Python dependencies
108
+ β”œβ”€β”€ README.md # Project documentation
109
+ β”œβ”€β”€ Image.txt/ # Images, icons, logos
110
+ β”œβ”€β”€ yolo.ipynb
111
+ └── smartvisionAI.ipynb(Downloading and training process of data)
112
+
113
+ ## Screenshots
114
+ <img width="1920" height="1080" alt="Screenshot (176)" src="https://github.com/user-attachments/assets/feb97730-b862-4504-ac37-bc733fe21aba" />
115
+
116
+ <img width="1920" height="1080" alt="Screenshot (178)" src="https://github.com/user-attachments/assets/60abc70b-3d2c-4aee-aaf2-53dade77d7e3" />
117
+ Demo Images
118
+
119
+ <img width="1920" height="1080" alt="Screenshot (177)" src="https://github.com/user-attachments/assets/5c03e4b4-eaa5-4eb8-942f-b3fae16db210" />
120
+ Detection
121
+
122
+ <img width="1253" height="825" alt="Screenshot 2025-12-14 at 08-17-06 SmartVision AI - Intelligent Multi-Class Object Recognition System" src="https://github.com/user-attachments/assets/158bdfca-c160-4968-a508-d3cd47878768" />
123
+
124
+
125
+
126
+
127
+ ## πŸ“Œ Note on Model Files
128
+
129
+ -Due to size constraints, trained model weights (.pt, .pth) are not included
130
+ in this repository.
131
+
132
+ ## πŸŽ“ Academic & Practical Relevance
133
+
134
+ - This project was built to:
135
+
136
+ - Demonstrate practical Deep Learning & Computer Vision skills
137
+
138
+ - Showcase model deployment and optimization
139
+
140
+ - Serve as a portfolio project for interviews and evaluations
141
 
142
+ ## πŸ‘¨β€πŸ’» Developer
143
 
144
+ Rahul Kumar
145
+ B.Tech in Information Technology
146
+ IIEST Shibpur