HugoHE commited on
Commit
16a0d2c
·
verified ·
1 Parent(s): 38c4bf7

Add comprehensive model card for faster-rcnn-bdd-vanilla

Browse files
Files changed (1) hide show
  1. README.md +133 -0
README.md ADDED
@@ -0,0 +1,133 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: pytorch
4
+ tags:
5
+ - faster-rcnn
6
+ - object-detection
7
+ - computer-vision
8
+ - pytorch
9
+ - bdd100k
10
+ - autonomous-driving
11
+ - BDD 100K
12
+ - from-scratch
13
+ pipeline_tag: object-detection
14
+ datasets:
15
+ - bdd100k
16
+ widget:
17
+ - src: https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/bounding-boxes-sample.png
18
+ example_title: "Sample Image"
19
+ model-index:
20
+ - name: faster-rcnn-bdd-vanilla
21
+ results:
22
+ - task:
23
+ type: object-detection
24
+ dataset:
25
+ type: bdd100k
26
+ name: Berkeley DeepDrive (BDD) 100K
27
+ metrics:
28
+ - type: mean_average_precision
29
+ name: mAP
30
+ value: "TBD"
31
+ ---
32
+
33
+ # Faster R-CNN - Berkeley DeepDrive (BDD) 100K Vanilla
34
+
35
+ Faster R-CNN model trained from scratch on Berkeley DeepDrive (BDD) 100K dataset for object detection in autonomous driving scenarios.
36
+
37
+ ## Model Details
38
+
39
+ - **Model Type**: Faster R-CNN Object Detection
40
+ - **Dataset**: Berkeley DeepDrive (BDD) 100K
41
+ - **Training Method**: trained from scratch
42
+ - **Framework**: PyTorch
43
+ - **Task**: Object Detection
44
+
45
+ ## Dataset Information
46
+
47
+ This model was trained on the **Berkeley DeepDrive (BDD) 100K** dataset, which contains the following object classes:
48
+
49
+ car, truck, bus, motorcycle, bicycle, person, traffic light, traffic sign, train, rider
50
+
51
+ ### Dataset-specific Details:
52
+
53
+ **Berkeley DeepDrive (BDD) 100K Dataset:**
54
+ - 100,000+ driving images with diverse weather and lighting conditions
55
+ - Designed for autonomous driving applications
56
+ - Contains urban driving scenarios from multiple cities
57
+ - Annotations include bounding boxes for vehicles, pedestrians, and traffic elements
58
+
59
+ ## Usage
60
+
61
+ This model can be used with PyTorch and common object detection frameworks:
62
+
63
+ ```python
64
+ import torch
65
+ import torchvision.transforms as transforms
66
+ from PIL import Image
67
+
68
+ # Load the model (example using torchvision)
69
+ model = torch.load('path/to/model.pth')
70
+ model.eval()
71
+
72
+ # Prepare your image
73
+ transform = transforms.Compose([
74
+ transforms.ToTensor(),
75
+ ])
76
+
77
+ image = Image.open('path/to/image.jpg')
78
+ image_tensor = transform(image).unsqueeze(0)
79
+
80
+ # Run inference
81
+ with torch.no_grad():
82
+ predictions = model(image_tensor)
83
+
84
+ # Process results
85
+ boxes = predictions[0]['boxes']
86
+ scores = predictions[0]['scores']
87
+ labels = predictions[0]['labels']
88
+ ```
89
+
90
+ ## Model Performance
91
+
92
+ This model was trained from scratch on the Berkeley DeepDrive (BDD) 100K dataset using Faster R-CNN architecture.
93
+
94
+ ## Architecture
95
+
96
+ **Faster R-CNN** (Region-based Convolutional Neural Network) is a two-stage object detection framework:
97
+
98
+ 1. **Region Proposal Network (RPN)**: Generates object proposals
99
+ 2. **Fast R-CNN detector**: Classifies proposals and refines bounding box coordinates
100
+
101
+ Key advantages:
102
+ - High accuracy object detection
103
+ - Precise localization
104
+ - Good performance on small objects
105
+ - Well-established architecture with extensive research backing
106
+
107
+ ## Intended Use
108
+
109
+ - **Primary Use**: Object detection in autonomous driving scenarios
110
+ - **Suitable for**: Research, development, and deployment of object detection systems
111
+ - **Limitations**: Performance may vary on images significantly different from the training distribution
112
+
113
+ ## Citation
114
+
115
+ If you use this model, please cite:
116
+
117
+ ```bibtex
118
+ @article{ren2015faster,
119
+ title={Faster r-cnn: Towards real-time object detection with region proposal networks},
120
+ author={Ren, Shaoqing and He, Kaiming and Girshick, Ross and Sun, Jian},
121
+ journal={Advances in neural information processing systems},
122
+ volume={28},
123
+ year={2015}
124
+ }
125
+ ```
126
+
127
+ ## License
128
+
129
+ This model is released under the MIT License.
130
+
131
+ ## Keywords
132
+
133
+ Faster R-CNN, Object Detection, Computer Vision, BDD 100K, Autonomous Driving, Deep Learning, Two-Stage Detection