| # Vision transformer - RVSA for object detection | |
| ## Table of Contents | |
| - [Features](#Features) | |
| - [Getting Started](#getting-started) | |
| - [Prerequisites](#prerequisites) | |
| - [Installation](#installation) | |
| ## Features | |
| Load object detection model - remote sensing image. | |
| * Visualize | |
| ## Getting Started | |
| Below is the list of dependencies and steps to use the repository | |
| ### Prerequisites | |
| List any software or libraries that are required to run your project. | |
| * Python 3 | |
| * mmcv | |
| * mmrotate | |
| * torch | |
| * mmdetection | |
| ### Installation | |
| Step-by-step instructions on how to install your project. | |
| 1. Install environment: | |
| - Install torch | |
| * CPU: | |
| ``` | |
| pip install -q torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cpu | |
| pip install -q mmcv==2.0.1 -f https://download.openmmlab.com/mmcv/dist/cpu/torch2.0/index.html | |
| ``` | |
| * GPU: | |
| ``` | |
| pip install -q torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 | |
| pip install -q mmcv==2.0.1 -f https://download.openmmlab.com/mmcv/dist/cu117/torch2.0/index.html | |
| ``` | |
| - Install other dependencies: | |
| ``` | |
| pip install -q "mmdet>=3.0.0rc6,<3.1.0" | |
| git clone https://github.com/open-mmlab/mmrotate -b 1.x | |
| cd mmrotate | |
| pip install -q -e . | |
| cd .. | |
| git clone https://github.com/ViTAE-Transformer/MTP | |
| cp -r ./MTP/RS_Tasks_Finetune/Rotated_Detection/mmrotate1.x/mmrotate/* ./mmrotate/mmrotate | |
| git clone https://huggingface.co/datasets/KhangTruong/NWPU-Caption | |
| tar -xf NWPU-Caption/02_NWPU_RESISC45.tar | |
| tar -xf NWPU-Caption/02_NWPU_caption.tar | |
| mv 02_NWPU_RESISC45 data | |
| mv 02_NWPU_caption data | |
| python install.py --root . | |
| ``` | |
| 2. Change the directory to *mmrotate*, which is inside the current working directory | |
| ``` | |
| cd mmrotate | |
| ``` | |
| 3. Run the inference | |
| ``` | |
| python ../extract_feature.py -n ../data/airplane/airplane_002.jpg | |
| ``` | |
| 4. It will produce an image in *vis_data*, which is the labelled image. | |