rishabhjain16's picture
Upload folder using huggingface_hub
eac995c verified
# ibug.face_alignment
2D facial landmark detector based on [FAN](http://openaccess.thecvf.com/content_ICCV_2017/papers/Bulat_How_Far_Are_ICCV_2017_paper.pdf) \[1\] with some pretrained weights. Our training code is available in this repostory: [https://github.com/hhj1897/fan_training](https://github.com/hhj1897/fan_training).
## Prerequisites
* [Numpy](https://www.numpy.org/): `$pip3 install numpy`
* [OpenCV](https://opencv.org/): `$pip3 install opencv-python`
* [PyTorch](https://pytorch.org/): `$pip3 install torch torchvision`
* [ibug.face_detection](https://github.com/hhj1897/face_detection) (only needed by the test script): See this repository for details: [https://github.com/hhj1897/face_detection](https://github.com/hhj1897/face_detection).
## How to Install
```
git clone https://github.com/hhj1897/face_alignment.git
cd face_alignment
pip install -e .
```
## How to Test
* To test on live video: `python face_alignment_test.py [-i webcam_index]`
* To test on a video file: `python face_alignment_test.py [-i input_file] [-o output_file]`
## How to Use
```python
# Import the libraries
import cv2
from ibug.face_detection import RetinaFacePredictor
from ibug.face_alignment import FANPredictor
from ibug.face_alignment.utils import plot_landmarks
# Create a RetinaFace detector using Resnet50 backbone, with the confidence
# threshold set to 0.8
face_detector = RetinaFacePredictor(
threshold=0.8, device='cuda:0',
model=RetinaFacePredictor.get_model('resnet50'))
# Create a facial landmark detector
landmark_detector = FANPredictor(
device='cuda:0', model=FANPredictor.get_model('2dfan2_alt'))
# Load a test image. Note that images loaded by OpenCV adopt the B-G-R channel
# order.
image = cv2.imread('test.png')
# Detect faces from the image
detected_faces = face_detector(image, rgb=False)
# Detect landmarks from the faces
# Note:
# 1. The input image must be a byte array of dimension HxWx3.
# 2. The input face boxes must be a array of dimension Nx4, N being the
# number of faces. More columns are allowed, but only the first 4
# columns will be used (which should be the left, top, right, and
# bottom coordinates of the face).
# 3. The returned landmarks are stored in a Nx68x2 arrays, each row giving
# the X and Y coordinates of a landmark.
# 4. The returned scores are stored in a Nx68 array. Scores are usually
# within the range of 0 to 1, but could go slightly beyond.
landmarks, scores = landmark_detector(image, detected_faces, rgb=False)
# Draw the landmarks onto the image
for lmks, scs in zip(landmarks, scores):
plot_landmarks(image, lmks, scs, threshold=0.2)
```
## References
\[1\] Bulat, Adrian, and Georgios Tzimiropoulos. "[How far are we from solving the 2d & 3d face alignment problem?(and a dataset of 230,000 3d facial landmarks).](http://openaccess.thecvf.com/content_ICCV_2017/papers/Bulat_How_Far_Are_ICCV_2017_paper.pdf)" In _Proceedings of the IEEE International Conference on Computer Vision_, pp. 1021-1030. 2017.