rishabhjain16
/

tools-for-lip-reading

Model card Files Files and versions

tools-for-lip-reading / face_alignment /README.md

rishabhjain16's picture

Upload folder using huggingface_hub

eac995c verified 6 months ago

|

history blame contribute delete

3.03 kB

	# ibug.face_alignment
	2D facial landmark detector based on [FAN](http://openaccess.thecvf.com/content_ICCV_2017/papers/Bulat_How_Far_Are_ICCV_2017_paper.pdf) \[1\] with some pretrained weights. Our training code is available in this repostory: [https://github.com/hhj1897/fan_training](https://github.com/hhj1897/fan_training).

	## Prerequisites
	* [Numpy](https://www.numpy.org/): `$pip3 install numpy`
	* [OpenCV](https://opencv.org/): `$pip3 install opencv-python`
	* [PyTorch](https://pytorch.org/): `$pip3 install torch torchvision`
	* [ibug.face_detection](https://github.com/hhj1897/face_detection) (only needed by the test script): See this repository for details: [https://github.com/hhj1897/face_detection](https://github.com/hhj1897/face_detection).

	## How to Install
	```
	git clone https://github.com/hhj1897/face_alignment.git
	cd face_alignment
	pip install -e .
	```

	## How to Test
	* To test on live video: `python face_alignment_test.py [-i webcam_index]`
	* To test on a video file: `python face_alignment_test.py [-i input_file] [-o output_file]`

	## How to Use
	```python
	# Import the libraries
	import cv2
	from ibug.face_detection import RetinaFacePredictor
	from ibug.face_alignment import FANPredictor
	from ibug.face_alignment.utils import plot_landmarks

	# Create a RetinaFace detector using Resnet50 backbone, with the confidence
	# threshold set to 0.8
	face_detector = RetinaFacePredictor(
	threshold=0.8, device='cuda:0',
	model=RetinaFacePredictor.get_model('resnet50'))

	# Create a facial landmark detector
	landmark_detector = FANPredictor(
	device='cuda:0', model=FANPredictor.get_model('2dfan2_alt'))

	# Load a test image. Note that images loaded by OpenCV adopt the B-G-R channel
	# order.
	image = cv2.imread('test.png')

	# Detect faces from the image
	detected_faces = face_detector(image, rgb=False)

	# Detect landmarks from the faces
	# Note:
	# 1. The input image must be a byte array of dimension HxWx3.
	# 2. The input face boxes must be a array of dimension Nx4, N being the
	# number of faces. More columns are allowed, but only the first 4
	# columns will be used (which should be the left, top, right, and
	# bottom coordinates of the face).
	# 3. The returned landmarks are stored in a Nx68x2 arrays, each row giving
	# the X and Y coordinates of a landmark.
	# 4. The returned scores are stored in a Nx68 array. Scores are usually
	# within the range of 0 to 1, but could go slightly beyond.
	landmarks, scores = landmark_detector(image, detected_faces, rgb=False)

	# Draw the landmarks onto the image
	for lmks, scs in zip(landmarks, scores):
	plot_landmarks(image, lmks, scs, threshold=0.2)
	```

	## References
	\[1\] Bulat, Adrian, and Georgios Tzimiropoulos. "[How far are we from solving the 2d & 3d face alignment problem?(and a dataset of 230,000 3d facial landmarks).](http://openaccess.thecvf.com/content_ICCV_2017/papers/Bulat_How_Far_Are_ICCV_2017_paper.pdf)" In _Proceedings of the IEEE International Conference on Computer Vision_, pp. 1021-1030. 2017.