kursatkomurcu commited on
Commit
1d3d63b
·
verified ·
1 Parent(s): 20004de

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +62 -0
README.md ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # YOLO-MT-A-Lightweight-Multi-Task-Learning-Framework-
2
+
3
+ You can find the inference and training code from bdd100k.ipynb notebook.
4
+
5
+ - **GitHub:** https://github.com/kursatkomurcu/YOLO-MT-A-Lightweight-Multi-Task-Learning-Framework-
6
+
7
+ # 🚗 YOLO-MT: Lightweight Multi-Task Learning for Unified Scene Understanding
8
+
9
+ YOLO-MT is a **YOLOv12-based lightweight multi-task learning framework** that performs **object detection, lane detection, drivable area segmentation, and scene attribute classification** within a single unified model.
10
+ It is specifically designed for **real-time autonomous driving and embedded systems** with limited computational resources.
11
+
12
+ ## ✨ Key Features
13
+ - ✅ **4 tasks in a single network**
14
+ - Object Detection
15
+ - Lane Detection
16
+ - Drivable Area Segmentation
17
+ - Scene Attribute Classification (weather, scene type, time of day)
18
+ - ⚡ **Real-time inference**
19
+ - 📦 **Only 2.9M parameters**
20
+ - 🧠 **Shared YOLOv12 backbone**
21
+ - 🔥 Optimized for **embedded and edge devices**
22
+ - 📊 Trained and evaluated on the **BDD100K dataset**
23
+
24
+ ## 🏗️ Architecture Overview
25
+ - Shared **YOLOv12 encoder**
26
+ - Lightweight **multi-branch decoder**
27
+ - **ASPP-Lite** module for context aggregation
28
+ - Separate task-specific heads for:
29
+ - Lane segmentation
30
+ - Drivable area segmentation
31
+ - Attribute classification
32
+
33
+ ## 🎯 Training Strategy
34
+ 1. Train YOLOv12 on **object detection**
35
+ 2. **Freeze the backbone**
36
+ 3. Jointly train:
37
+ - Lane detection
38
+ - Drivable area segmentation
39
+ - Attribute classification
40
+
41
+ This two-stage training ensures **strong feature reuse** while keeping the model extremely compact.
42
+
43
+ ## 📁 Dataset
44
+ - **BDD100K**
45
+ - Object detection annotations
46
+ - Lane markings
47
+ - Drivable area masks
48
+ - Scene attributes (weather, scene type, time of day)
49
+
50
+ All images are resized to **384×640** for efficient real-time processing.
51
+
52
+ ## 📌 Use Cases
53
+ - Autonomous driving perception
54
+ - Advanced Driver Assistance Systems (ADAS)
55
+ - Embedded AI systems
56
+ - Edge deployment for robotics
57
+
58
+ ---
59
+
60
+ ## Example outputs
61
+ ![example](https://github.com/kursatkomurcu/YOLO-MT-A-Lightweight-Multi-Task-Learning-Framework-/blob/main/multitasking_example.png)
62
+