aadex commited on
Commit
3c499ce
·
verified ·
1 Parent(s): 8e0958d

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +77 -0
README.md ADDED
@@ -0,0 +1,77 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: mmdetection
4
+ tags:
5
+ - object-detection
6
+ - vision-transformer
7
+ - mmdetection
8
+ - pytorch
9
+ - faster-rcnn
10
+ datasets:
11
+ - coco
12
+ metrics:
13
+ - map
14
+ ---
15
+
16
+ # Faster R-CNN with RoPE-ViT Backbone for Object Detection
17
+
18
+ This model is a Faster R-CNN object detection model with a RoPE-ViT (Vision Transformer with Rotary Position Embeddings) backbone, trained on the COCO dataset.
19
+
20
+ ## Model Description
21
+
22
+ - **Architecture:** Faster R-CNN
23
+ - **Backbone:** RoPE-ViT Tiny
24
+ - **Dataset:** COCO
25
+ - **Task:** Object Detection
26
+ - **Framework:** MMDetection
27
+
28
+ ## Training Results
29
+
30
+ | Metric | Value |
31
+ |--------|-------|
32
+ | bbox_mAP | 0.0680 |
33
+ | bbox_mAP_50 | 0.1510 |
34
+ | bbox_mAP_75 | 0.0530 |
35
+ | bbox_mAP_s (small) | 0.0360 |
36
+ | bbox_mAP_m (medium) | 0.1260 |
37
+ | bbox_mAP_l (large) | 0.0640 |
38
+
39
+ ## Usage
40
+
41
+ ```python
42
+ from mmdet.apis import init_detector, inference_detector
43
+
44
+ config_file = 'faster_rcnn_rope_vit_tiny_coco.py'
45
+ checkpoint_file = 'best_coco_bbox_mAP_epoch_12.pth'
46
+
47
+ # Initialize the model
48
+ model = init_detector(config_file, checkpoint_file, device='cuda:0')
49
+
50
+ # Inference on an image
51
+ result = inference_detector(model, 'demo.jpg')
52
+ ```
53
+
54
+ ## Training Configuration
55
+
56
+ The model was trained with the following configuration:
57
+ - Input size: 512x512
58
+ - Training epochs: 12
59
+ - Optimizer: SGD with momentum
60
+ - Learning rate scheduler: Step decay
61
+
62
+ ## Citation
63
+
64
+ If you use this model, please cite:
65
+
66
+ ```bibtex
67
+ @misc{rope-vit-detection,
68
+ author = {VLG IITR},
69
+ title = {Faster R-CNN with RoPE-ViT for Object Detection},
70
+ year = {2026},
71
+ publisher = {Hugging Face},
72
+ }
73
+ ```
74
+
75
+ ## License
76
+
77
+ This model is released under the Apache 2.0 license.