You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Model Card for Phone Detection (RF-DETR)

This model performs single-class object detection to identify and localize mobile phones in images.
It is fine-tuned from stevenbucaille/rf-detr-base using the harshdadiya-wappnet/phone_detection dataset.


Model Details

Model Description

  • Developed by: Harsh Dadiya
  • Funded by: Wappnet Systems
  • Shared by: harshdadiya-wappnet
  • Model type: Object Detection
  • Language(s): Not applicable (vision-only)
  • License: Apache-2.0
  • Finetuned from model: stevenbucaille/rf-detr-base

Model Sources

  • Repository: Hugging Face Hub

Dataset Details

Dataset Split Statistics

Class Name Total Train Validation Test
mobile_phone 293 198 68 27
  • Number of classes: 1
  • Annotation format: COCO
  • Task: Bounding box detection

Uses

Direct Use

  • Detecting mobile phones in still images
  • Bounding box localization for computer vision pipelines
  • Benchmarking single-class object detection models

Downstream Use

  • Further fine-tuning on domain-specific phone imagery
  • Integration into monitoring, compliance, or vision-based analysis systems

Out-of-Scope Use

  • Multi-class object detection
  • Video-based temporal detection
  • Classification-only tasks without localization

Bias, Risks, and Limitations

  • Trained on a single-class dataset, limiting generalization to other object categories
  • Performance may degrade on images with extreme occlusion, motion blur, or very small phones
  • Dataset size is relatively small and may not cover all real-world variations

Training Details

Training Data

  • Dataset: harshdadiya-wappnet/phone_detection
  • Classes: mobile_phone
  • Format: COCO

Training Procedure

  • Fine-tuning performed on RF-DETR base model
  • Training tracked using Weights & Biases (W&B)

Training Hyperparameters

  • Training regime: Not specified

Training Metrics and Run History

Run Summary (Final Epoch: 39)

Metric Value
Loss / Train 2.52222
Loss / Test 4.15493
Base AP@50 0.96172
Base AP@50โ€“90 0.79702
Base AR@50โ€“90 0.85217
EMA AP@50 0.97117
EMA AP@50โ€“90 0.84201
EMA AR@50โ€“90 0.88551

Training Curves

The following plots were recorded during fine-tuning:

  • Training Loss vs Epoch
  • Validation Loss vs Epoch
  • AP@50 vs Epoch
  • AP@50โ€“90 vs Epoch
  • Recall (AR@50โ€“90) vs Epoch

Weights & Biases Logs

Artifacts:

  • Auto-checkpoints saved
  • Best model checkpoint saved
  • TensorBoard logs generated

Evaluation

Metrics

  • mAP@50: 83.6%
  • Precision: 86.3%
  • Recall: 82.1%

Summary

The model demonstrates strong performance for single-class phone detection, with high precision and balanced recall across evaluation splits.


Technical Specifications

Model Architecture

  • RF-DETR (Transformer-based object detection)
  • Bounding box regression + class prediction

Model Card Authors

Harsh Dadiya


Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for harshdadiya-wappnet/phone_detction_rfdetr_base

Finetuned
(1)
this model

Dataset used to train harshdadiya-wappnet/phone_detction_rfdetr_base