Face Alignment Across Large Poses: A 3D Solution
Paper โข 1511.07212 โข Published
YOLO-based human head detection using the HollywoodHeads dataset and Ultralytics YOLOv8.
| mAP(50) | mAP50-95 | precision | recall | fitness |
|---|---|---|---|---|
| 0.871 | 0.529 | 0.894 | 0.796 | 0.529 |
| Dataset | mAP(50) | mAP50-95 | precision | recall | fitness |
|---|---|---|---|---|---|
| DAD-3D | 0.937 | 0.535 | 0.892 | 0.892 | 0.535 |
| 300W-LP | 0.855 | 0.265 | 0.827 | 0.883 | 0.265 |
You can view the training and testing scripts from here
from huggingface_hub import hf_hub_download
from ultralytics import YOLO
model_path = hf_hub_download(
repo_id="abhiWanKenobi/yolov8n-head-detection",
filename="yolov8n_head_detector.pt",
)
model = YOLO(model_path)
results = model("image.jpg")
@inproceedings{vu15heads,
author = {Vu, Tuan{-}Hung and Osokin, Anton and Laptev, Ivan},
title = {Context-aware {CNNs} for person head detection},
booktitle = {International Conference on Computer Vision (ICCV)},
year = {2015}}
@article{DBLP:journals/corr/ZhuLLSL15,
author = {Xiangyu Zhu and
Zhen Lei and
Xiaoming Liu and
Hailin Shi and
Stan Z. Li},
title = {Face Alignment Across Large Poses: {A} 3D Solution},
journal = {CoRR},
volume = {abs/1511.07212},
year = {2015},
url = {http://arxiv.org/abs/1511.07212},
archivePrefix = {arXiv},
eprint = {1511.07212},
timestamp = {Mon, 13 Aug 2018 16:48:23 +0200},
biburl = {https://dblp.org/rec/bib/journals/corr/ZhuLLSL15},
bibsource = {dblp computer science bibliography, https://dblp.org}
}
@inproceedings{dad3dheads,
title={DAD-3DHeads: A Large-scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image},
author={Martyniuk, Tetiana and Kupyn, Orest and Kurlyak, Yana and Krashenyi, Igor and Matas, Ji\v{r}i and Sharmanska, Viktoriia},
booktitle = {Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
year={2022}
}
CC BY-NC 4.0 โ free for research, not for commercial use. Model trained on research-only datasets (HollywoodHeads, 300W-LP, DAD-3DHeads).