cvtechniques
/

BikeLaneDetection

Model card Files Files and versions

xet

Community

dyldang commited on Mar 17

Commit

a70d818

verified ·

1 Parent(s): 4c72429

Update README.md

Browse files

Files changed (1) hide show

README.md +15 -14

README.md CHANGED Viewed

@@ -16,7 +16,6 @@ The goal of this project was not only to train a model, but to understand how da
 - Supporting transportation or urban planning research
 - Analyzing cyclist environments and road conditions
-This model is best suited for **research and learning purposes**, not real-world deployment.
 ---
@@ -25,6 +24,7 @@ This model is best suited for **research and learning purposes**, not real-world
 ### Dataset Source
 Roboflow Universe – Bike Lane Computer Vision Dataset
 ---
@@ -53,7 +53,7 @@ This dataset shows **strong class imbalance**, where some classes appear very fr
 ### Annotation Process
 The dataset included pre-existing YOLO-format bounding box annotations.
-Although the dataset was pre-annotated, I reviewed samples to check for consistency and quality. I observed that some classes (such as "car" and "vehicle") overlap conceptually, which may introduce ambiguity during training. No major corrections were made, but this overlap influenced how results were interpreted.
 I reviewed a subset of images to validate annotation quality, focusing on:
 - alignment of bounding boxes
@@ -61,7 +61,7 @@ I reviewed a subset of images to validate annotation quality, focusing on:
 No major corrections were made. This allowed me to focus on model training and evaluation, but it also represents a limitation since annotation quality was not significantly improved.
-This project therefore emphasizes **evaluation and understanding of model performance** rather than dataset refinement.
 ---
@@ -115,13 +115,14 @@ Training was performed in Google Colab using an RTX 3070. Training took approxim
 - Recall: ~0.38
 - mAP50: ~0.48
-These metrics show that the model is **highly precise but has low recall**.
 This means:
 - The model is usually correct when it makes predictions
-- But it misses many objects, especially harder or less frequent ones
 Performance varied significantly across classes. Common classes such as "Vehicle" achieved higher precision and recall, while underrepresented classes like "Bicycle" and "Car" performed poorly due to limited training samples.
 ---
@@ -145,7 +146,7 @@ The confusion matrix highlights where the model struggles, particularly between
 ![Training Results](./results.png)
-The training curve shows steady learning, but performance plateaus due to dataset limitations.
 ---
@@ -155,11 +156,11 @@ The training curve shows steady learning, but performance plateaus due to datase
 This image shows two types of errors made by the model.
-In the first image, the model incorrectly detects a **vehicle** where there is actually part of a building. This is likely because the building has visual features (such as rectangular shapes and edges) that resemble vehicles in the training data.
-The second model identifies a **bike lane** where the road marking appears to be a bus lane. This suggests that the model has difficulty distinguishing between different types of lane markings, especially when they share similar visual patterns.
-These errors highlight an important limitation: the model relies heavily on visual similarity rather than deeper contextual understanding. Because the dataset contains limited variation and strong class imbalance, the model may generalize incorrectly when encountering unfamiliar or ambiguous scenes.
 ---
@@ -208,7 +209,7 @@ The model may perform poorly under:
 ### Additional Observations
-The model sometimes misclassifies lane types (e.g., solid vs shared lanes) when markings are partially broken or unclear. This suggests the model relies heavily on strong visual patterns.
 ---
@@ -223,19 +224,19 @@ This model should **not** be used for:
 ### Sample Size Limitations
-Some classes (e.g., bicycle and car) have extremely limited training data, making reliable detection difficult. This contributes directly to low recall.
 ---
 ## Final Reflection
-This project demonstrates that model performance is heavily dependent on dataset quality.
 Even with a strong model like YOLOv11, issues such as:
 - class imbalance
 - small dataset size
 - annotation limitations
-can significantly impact results.
-Overall, this project highlights the importance of **data quality, not just model choice**, in computer vision applications.

 - Supporting transportation or urban planning research
 - Analyzing cyclist environments and road conditions
 ---
 ### Dataset Source
 Roboflow Universe – Bike Lane Computer Vision Dataset
+https://universe.roboflow.com/bike-lane/bike-lane
 ---
 ### Annotation Process
 The dataset included pre-existing YOLO-format bounding box annotations.
+Although the dataset was pre-annotated, I reviewed samples in order to check for consistency and quality. I observed that some classes such as "car" and "vehicle" overlap conceptually, which may introduce ambiguity during training. No major corrections were made, but this overlap influenced how results were interpreted.
 I reviewed a subset of images to validate annotation quality, focusing on:
 - alignment of bounding boxes
 No major corrections were made. This allowed me to focus on model training and evaluation, but it also represents a limitation since annotation quality was not significantly improved.
+This project therefore, emphasizes **evaluation and understanding of model performance** rather than dataset refinement which is something I wanted in this process.
 ---
 - Recall: ~0.38
 - mAP50: ~0.48
+According to the results, these metrics show that the model is **highly precise but has low recall**.
 This means:
 - The model is usually correct when it makes predictions
+- It misses many objects especially harder or less frequent ones
 Performance varied significantly across classes. Common classes such as "Vehicle" achieved higher precision and recall, while underrepresented classes like "Bicycle" and "Car" performed poorly due to limited training samples.
+This made it so that the performance differences across classes were influenced by class imbalance, with larger classes performing more reliably.
 ---
 ![Training Results](./results.png)
+The training curve shows steady learning, but performance plateaus due to the dataset limitations.
 ---
 This image shows two types of errors made by the model.
+In the first image, the model incorrectly detects a vehicle where there is actually part of a building. This is likely because the building has visual features such as rectangular shapes and edges that resemble vehicles in the training data.
+The second model identifies a bike lane where the road marking appears to be a bus lane. This suggests that the model has difficulty distinguishing between different types of lane markings, especially when they share similar visual patterns.
+These errors highlight an important limitation: the model relies heavily on visual similarity rather than deeper contextual understanding. Since the dataset contains limited variation and strong class imbalance, the model may generalize incorrectly when encountering unfamiliar scenes.
 ---
 ### Additional Observations
+The model sometimes misclassifies lane types such as solid vs shared lane, when markings are partially broken or unclear. This suggests the model relies heavily on strong visual patterns.
 ---
 ### Sample Size Limitations
+Some classes like bicycles and car have extremely limited training data, making reliable detection difficult. This contributes directly to low recall.
 ---
 ## Final Reflection
+I found that this project demonstrates that model performance is heavily dependent on dataset quality.
 Even with a strong model like YOLOv11, issues such as:
 - class imbalance
 - small dataset size
 - annotation limitations
+It can significantly impact results.
+In the end this project to me highlights the importance of **data quality, not just model choice**, in computer vision applications.