File size: 1,846 Bytes

597fcce

---
license: apache-2.0
tags:
- computer-vision
- object-detection
- rf-detr
- face-detection
datasets:
- fareselmenshawii/face-detection-dataset
metrics:
- mAP
---

# Model Card for RFDETR-Medium - Face Detection Finetune

This model is a fine-tuned version of **RFDETR-Medium** (Real-time Detection Transformer) specifically optimized for the detection of human faces. It leverages a DINOv2 backbone for high-quality feature extraction and a transformer-based head for NMS-free, end-to-end detection.

## Model Details

### Model Description

- **Model type:** Object Detection
- **Task:** Human Face Detection
- **Finetuned from model:** RFDETR-Medium
- **Language(s):** N/A (Computer Vision)
- **License:** Apache 2.0

---

## Training Details

### Training Hyperparameters
The following hyperparameters were used during the fine-tuning process:

* **Epochs:** 20
* **Batch Size:** 16
* **Learning Rate:** $1 \times 10^{-4}$
* **Input Image Size:** $576 \times 576$

### Evalutation Results

* **mAP@50:** 0.9
* **mAP@5095:** 0.6

### Training Data
The model was trained on the Face Detection Dataset (Kaggle), which contains approximately 16,700 images with bounding box annotations for human faces.

### Model Sources

- **Repository:** [Roboflow RF-DETR GitHub](https://github.com/roboflow/rf-detr)
- **Dataset:** [Face Detection Dataset (Kaggle)](https://www.kaggle.com/datasets/fareselmenshawii/face-detection-dataset/data)

## Uses

### Direct Use
This model is designed for **real-time human face detection** in images and video streams.

## How to Get Started with the Model

```python
from rfdetr import RFDETRMedium

# Load your fine-tuned weights
model = RFDETRMedium(device="cuda", pretrain_weights="rfdetr_medium_face.pth")
model.optimize_for_inference()

# Run inference
results = model.predict("input_image.jpg", threshold=0.5)