Update README.md
Browse files
README.md
CHANGED
|
@@ -8,13 +8,13 @@ Gaze following and social gaze prediction are fundamental tasks providing insigh
|
|
| 8 |
|
| 9 |
## Overview
|
| 10 |
|
| 11 |
-
* **Training**: resnet18-gaze360 was trained on [
|
| 12 |
* **Backbone**: resnet18-gaze360 is adapted from [ResNet-18](https://huggingface.co/microsoft/resnet-18)
|
| 13 |
* License: [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0.html)
|
| 14 |
* **Parameters**: 11M
|
| 15 |
* **Task**: MTGS performs multi-person gaze following and social gaze prediction in images and videos. Given an image or a video frame with multiple people, the model predicts where each person is looking in the scene (gaze following) and infers pair-wise social gaze interactions among individuals (social gaze prediction). Specifically, we tackle three social gaze prediction tasks:
|
| 16 |
1. Looking at Heads (LAH): whether a person is looking at another person's head.
|
| 17 |
-
2. Looking at Each Other (
|
| 18 |
3. Shared Attention (SA): whether two people are looking at the same target in the scene.
|
| 19 |
* **Framework**: PyTorch Lightning
|
| 20 |
|
|
|
|
| 8 |
|
| 9 |
## Overview
|
| 10 |
|
| 11 |
+
* **Training**: resnet18-gaze360 was trained on [Gaze360](https://github.com/erkil1452/gaze360/)
|
| 12 |
* **Backbone**: resnet18-gaze360 is adapted from [ResNet-18](https://huggingface.co/microsoft/resnet-18)
|
| 13 |
* License: [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0.html)
|
| 14 |
* **Parameters**: 11M
|
| 15 |
* **Task**: MTGS performs multi-person gaze following and social gaze prediction in images and videos. Given an image or a video frame with multiple people, the model predicts where each person is looking in the scene (gaze following) and infers pair-wise social gaze interactions among individuals (social gaze prediction). Specifically, we tackle three social gaze prediction tasks:
|
| 16 |
1. Looking at Heads (LAH): whether a person is looking at another person's head.
|
| 17 |
+
2. Looking at Each Other (LAEO): whether two people are looking at each other.
|
| 18 |
3. Shared Attention (SA): whether two people are looking at the same target in the scene.
|
| 19 |
* **Framework**: PyTorch Lightning
|
| 20 |
|