| # MeFEm: Medical Face Embedding Models | |
| Vision Transformers pre-trained on face data for potential medical applications. Available in Small (MeFEm-S) and Base (MeFEm-B) sizes. | |
| ## Quick Start | |
| ```python | |
| import torch | |
| import timm | |
| # Load model (MeFEm-S example) | |
| model = timm.create_model( | |
| 'vit_small_patch16_224', | |
| pretrained=False, | |
| num_classes=0, # No classification head | |
| global_pool='token' # Use CLS token (default) | |
| ) | |
| model.load_state_dict(torch.load('mefem-s.pt')) | |
| model.eval() | |
| # Forward pass | |
| x = torch.randn(1, 3, 224, 224) # Your face image | |
| embeddings = model(x) # [1, 384] CLS token embeddings | |
| ``` | |
| ## Model Details | |
| - **Architecture**: ViT-Small/16 (384-dim) or ViT-Base/16 (768-dim) with CLS token | |
| - **Training**: Modified I-JEPA on ~6.5M face images | |
| - **Input**: Face crops with 2× expanded bounding boxes, 224×224 resolution | |
| - **Output**: CLS token embeddings (`global_pool='token'`) or all tokens (`global_pool=''`) | |
| ## Usage Tips | |
| ```python | |
| # For all tokens (CLS + patches): | |
| model = timm.create_model('vit_small_patch16_224', num_classes=0, global_pool='') | |
| tokens = model(x) # [1, 197, 384] | |
| # For patch embeddings only: | |
| tokens = model.forward_features(x) | |
| patch_embeddings = tokens[:, 1:] # [1, 196, 384] | |
| ``` | |
| ## Training Data | |
| Face images from FaceCaption-15M, AVSpeech, and SHFQ datasets (~6.5M total). Images were cropped with expanded (2×) face bounding boxes. | |
| ## Notes | |
| - Optimized for face images with loose cropping | |
| - Intended for representation learning and transfer to medical tasks | |
| - Results may vary for non-face or tightly-cropped images | |
| - More info on training and metrics [here](https://arxiv.org/pdf/2602.14672) | |
| ## License | |
| CC BY 4.0. Reference paper if used: | |
| ``` | |
| @misc{borets2026mefemmedicalfaceembedding, | |
| title={MeFEm: Medical Face Embedding model}, | |
| author={Yury Borets and Stepan Botman}, | |
| year={2026}, | |
| eprint={2602.14672}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.CV}, | |
| url={https://arxiv.org/abs/2602.14672}, | |
| } | |
| ``` | |