YOLO Semantic Conditioning β€” Parking Slot Detection

YOLO26n fine-tuned with CLIP-based semantic conditioning on a parking slot dataset. Text descriptions attached to bounding boxes guide neck features via InfoNCE contrastive loss.

Comparison


Results

Model mAP50 mAP50-95
Baseline (no semantic) 0.872 0.663
This model (semantic_v4) 0.890 0.660

+0.018 mAP50 over equal-epoch baseline.

Training Curves

Baseline Semantic v4
baseline semantic

Confusion Matrix

Baseline Semantic v4
baseline cm semantic cm

Semantic Alignment Visualization

Epoch 20 Epoch 30 Epoch 40 Epoch 49
e20 e30 e40 e49

semantic space


Usage

from ultralytics import YOLO
model = YOLO('best.pt')
results = model('your_image.jpg')

Training Config

  • tau=0.1 β€” InfoNCE temperature
  • sem_weight=0.2 β€” semantic loss weight
  • sem_warmup=10 β€” epochs before semantic loss activates
  • 50 epochs, batch=16, imgsz=640, A100 40GB

Links

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support