--- library_name: libreyolo pipeline_tag: object-detection license: mit tags: - libreyolo - yolov9 - visdrone - aerial-imagery - object-detection datasets: - Voxel51/VisDrone2019-DET --- # ander2221/visdrone-yolo9-preview YOLOv9-t fine-tuned on VisDrone2019-DET aerial imagery using [LibreYOLO](https://github.com/LibreYOLO/libreyolo). Ten classes (pedestrian, people, bicycle, car, van, truck, tricycle, awning-tricycle, bus, motor), top-down drone perspective. **Companion use case:** [LibreYOLO/use-cases/visdrone-finetune](https://github.com/LibreYOLO/use-cases/tree/main/visdrone-finetune). ## Training - size: `t` - imgsz: `384` - epochs: `5` - dataset: VisDrone2019-DET via Voxel51's HuggingFace mirror - compute: Apple Metal Performance Shaders (MPS, M-series GPU) ## Metrics ```json {} ``` ## Usage — Python ```python from huggingface_hub import hf_hub_download from libreyolo import LibreYOLO ckpt = hf_hub_download(repo_id="ander2221/visdrone-yolo9-preview", filename="visdrone.pt") model = LibreYOLO(ckpt) result = model("aerial.jpg") for box, cls, conf in zip(result.boxes.xyxy, result.boxes.cls, result.boxes.conf): print(box, ["pedestrian","people","bicycle","car","van","truck","tricycle","awning-tricycle","bus","motor"][int(cls)], float(conf)) ``` ## Usage — ONNX (browser, edge, cross-runtime) ```python import onnxruntime as ort from huggingface_hub import hf_hub_download onnx = hf_hub_download(repo_id="ander2221/visdrone-yolo9-preview", filename="visdrone.onnx") session = ort.InferenceSession(onnx, providers=["CPUExecutionProvider"]) # Preprocess image to (1, 3, 384, 384) float32 in [0,1] then: out = session.run(None, {"images": preprocessed}) ``` A live browser demo using this ONNX is at https://libreyolo.github.io/use-cases/visdrone-finetune/demo/ (zero-install, runs locally in Chrome via WebGPU/onnxruntime-web). ## Classes (index → name) | idx | name | |---|---| | 0 | pedestrian | | 1 | people | | 2 | bicycle | | 3 | car | | 4 | van | | 5 | truck | | 6 | tricycle | | 7 | awning-tricycle | | 8 | bus | | 9 | motor | ## License MIT (the model file). Dataset (VisDrone2019-DET) is governed by its own [license terms](http://aiskyeye.com/) — please review for your use case.