File size: 3,950 Bytes

---
language:
  - en
license: apache-2.0
tags:
    - object-detection
    - AgTech
    - transformers
library_name: pytorch
inference: false
datasets:
  - Laudando-Associates-LLC/pucks
base_model: Laudando-Associates-LLC/d-fine
base_model_relation: finetune
model-index:
- name: D-FINE Medium
  results:
  - task:
      type: object-detection
      name: Object Detection
    dataset:
      type: Laudando-Associates-LLC/pucks
      name: L&A Pucks Dataset
      config: default
      split: validation
    metrics:
    - type: mean_average_precision
      name: mAP@[IoU=0.50:0.95]
      value: 0.840
    - type: mean_average_precision
      name: mAP@0.50
      value: 0.992
    - type: mean_average_precision
      name: mAP@0.75
      value: 0.974
    - type: recall
      name: AR@[IoU=0.50:0.95 | maxDets=100]
      value: 0.894
    - type: recall
      name: AR@0.50
      value: 1.000
    - type: recall
      name: AR@0.75
      value: 0.988
    - type: f1
      value: 0.924
    - type: precision
      value: 0.898
    - type: recall
      value: 0.952
    - type: iou
      value: 0.784
---

<h1 align="center"><strong>D-FINE Medium</strong></h1>

<p align="center">
  <a href="https://huggingface.co/Laudando-Associates-LLC/d-fine-medium">
    <img src="https://img.shields.io/badge/HuggingFace-Model-yellow?logo=huggingface&style=for-the-badge">
  </a>
</p>

This repository contains the [D-FINE](https://arxiv.org/abs/2410.13842) Medium model, a real-time object detector designed for efficient and accurate object detection tasks.

<p align="center">
  <img src="assets/medium.png" alt="Medium Detections" />
</p>

## Try it in the Browser

You can test this model using our interactive Gradio demo:

<p align="center">
  <a href="https://huggingface.co/spaces/Laudando-Associates-LLC/d-fine-demo">
    <img src="https://img.shields.io/badge/Launch%20Demo-Gradio-FF4B4B?logo=gradio&logoColor=white&style=for-the-badge">
  </a>
</p>

## Model Overview

* Architecture: D-FINE Medium

* Parameters: 19.6M

* Performance: 
  
  - mAP@[0.50:0.95]: 0.840

  - mAP@[0.50]: 0.992

  - AR@[0.50:0.95]: 0.894

  - F1 Score: 0.924

* Framework: PyTorch / ONNX

* Training Hardware: 2× NVIDIA RTX A6000 GPUs

## Download

| Format | Link |
|:--------:|:------:|
| ONNX   | <a href="https://huggingface.co/Laudando-Associates-LLC/d-fine-medium/resolve/main/model.onnx"><img src="https://img.shields.io/badge/-ONNX-005CED?style=for-the-badge&logo=onnx&logoColor=white"></a> |
| PyTorch | <a href="https://huggingface.co/Laudando-Associates-LLC/d-fine-medium/resolve/main/pytorch_model.bin"><img src="https://img.shields.io/badge/PyTorch-EE4C2C?style=for-the-badge&logo=pytorch&logoColor=white"></a> |

## Usage

To utilize this model, ensure you have the shared [D-FINE processor](https://huggingface.co/Laudando-Associates-LLC/d-fine):

```python
from transformers import AutoProcessor, AutoModel

# Load processor
processor = AutoProcessor.from_pretrained("Laudando-Associates-LLC/d-fine", trust_remote_code=True)

# Load model
model = AutoModel.from_pretrained("Laudando-Associates-LLC/d-fine-medium", trust_remote_code=True)

# Process image
inputs = processor(image)

# Run inference
outputs = model(**inputs, conf_threshold=0.4)
```

## Evaluation

This model was trained and evaluated on the [L&A Pucks Dataset](https://huggingface.co/datasets/Laudando-Associates-LLC/pucks).

## License

This model is licensed under the [Apache License 2.0](https://github.com/Peterande/D-FINE/blob/master/LICENSE).

## Citation
If you use `D-FINE` or its methods in your work, please cite the following BibTeX entries:

```latex
@misc{peng2024dfine,
      title={D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement},
      author={Yansong Peng and Hebei Li and Peixi Wu and Yueyi Zhang and Xiaoyan Sun and Feng Wu},
      year={2024},
      eprint={2410.13842},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}
```