| # Sapiens: Image Encoder | |
| ## Model Zoo | |
| Our 1024 x 1024 resolution vision transformers. | |
| | Model | Checkpoint Path | |
| |---------------|-------------------------------------------------------------------------------------------------- | |
| | Sapiens-0.3B | `$SAPIENS_CHECKPOINT_ROOT/pretrain/checkpoints/sapiens_0.3b/sapiens_0.3b_epoch_1600.pth` | |
| | Sapiens-0.6B | `$SAPIENS_CHECKPOINT_ROOT/pretrain/checkpoints/sapiens_0.6b/sapiens_0.6b_epoch_1600.pth` | |
| | Sapiens-1B | `$SAPIENS_CHECKPOINT_ROOT/pretrain/checkpoints/sapiens_1b/sapiens_1b_epoch_173.pth` | |
| | Sapiens-2B | `$SAPIENS_CHECKPOINT_ROOT/pretrain/checkpoints/sapiens_2b/sapiens_2b_epoch_660.pth` | |
| ## Inference Guide | |
| - Navigate to your script directory: | |
| ```bash | |
| cd $SAPIENS_ROOT/pretrain/scripts/demo/local | |
| ``` | |
| - For image feature extraction (uncomment your model config line): | |
| ```bash | |
| ./extract_feature.sh | |
| ``` | |
| Define `INPUT` for your image directory and `OUTPUT` for results. The features are ```C x H x W``` dimensional and saved as .npy files to the `OUTPUT` folder. | |
| Adjust `JOBS_PER_GPU`, `TOTAL_GPUS` and `VALID_GPU_IDS` for multi-GPU configurations. | |