File size: 1,126 Bytes
789eef1 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 | # Sapiens: Image Encoder
## Model Zoo
Our 1024 x 1024 resolution vision transformers.
| Model | Checkpoint Path
|---------------|--------------------------------------------------------------------------------------------------
| Sapiens-0.3B | `$SAPIENS_CHECKPOINT_ROOT/pretrain/checkpoints/sapiens_0.3b/sapiens_0.3b_epoch_1600.pth`
| Sapiens-0.6B | `$SAPIENS_CHECKPOINT_ROOT/pretrain/checkpoints/sapiens_0.6b/sapiens_0.6b_epoch_1600.pth`
| Sapiens-1B | `$SAPIENS_CHECKPOINT_ROOT/pretrain/checkpoints/sapiens_1b/sapiens_1b_epoch_173.pth`
| Sapiens-2B | `$SAPIENS_CHECKPOINT_ROOT/pretrain/checkpoints/sapiens_2b/sapiens_2b_epoch_660.pth`
## Inference Guide
- Navigate to your script directory:
```bash
cd $SAPIENS_ROOT/pretrain/scripts/demo/local
```
- For image feature extraction (uncomment your model config line):
```bash
./extract_feature.sh
```
Define `INPUT` for your image directory and `OUTPUT` for results. The features are ```C x H x W``` dimensional and saved as .npy files to the `OUTPUT` folder.
Adjust `JOBS_PER_GPU`, `TOTAL_GPUS` and `VALID_GPU_IDS` for multi-GPU configurations.
|