MiVOLO weights mirror

Convenience HuggingFace mirror of the two MiVOLO checkpoints needed by the upstream WildChlamydia/MiVOLO inference pipeline. Upstream distributes these via Google Drive only; this mirror exists so automated tooling can huggingface_hub.snapshot_download them instead of relying on gdown.

This is a re-distribution. All credit and attribution goes to the original MiVOLO authors (Maksim Kuprashevich and Irina Tolstykh, SberDevices). Please cite their paper and use the upstream repo as the source of truth:

Files

File Size What it is
yolov8x_person_face.pt ~131 MB YOLOv8x detector for person + face bounding boxes. Standard torch.load-compatible PyTorch zip checkpoint.
mivolo_face_body.pth.tar ~105 MB MiVOLO age + gender model (face+body variant; the SOTA option in the upstream repo with reported MAE โ‰ˆ 4.24 yr face+body / 6.87 yr body-only). Despite the suffix, this is not a tar archive โ€” it's a single torch.load-compatible Python pickle. The dict contains state_dict (290 tensor params) plus metadata fields (min_age, max_age, avg_age, no_gender, with_persons_model). The .pth.tar extension is an old PyTorch-examples convention that stuck.

Both files are identical to the originals downloaded from the upstream Google Drive links cited in the MiVOLO README.

Original Drive links (for reference / diff verification):

License note

The upstream MiVOLO repo includes a top-level LICENSE file that is Apache License 2.0, but their setup.py classifier declares CC BY-SA 4.0. The two are inconsistent. The plain-text LICENSE file is typically considered the legally controlling artifact, so this mirror is tagged apache-2.0 to match. If the upstream authors clarify the discrepancy in the future, this metadata may be updated.

If you re-distribute these weights elsewhere, preserve the credit, link back to the upstream repo, and reproduce both license artifacts in your copy until the upstream authors resolve the inconsistency.

Usage from the parent project

These checkpoints are consumed by MicroMaxField's training-data safety pass (see its SAFETY.md for the design). The scripts/setup_safety.sh setup script in that repo can fetch these files via huggingface_hub instead of Google Drive.

Standalone usage (with the upstream MiVOLO repo)

import argparse
from huggingface_hub import snapshot_download
from mivolo.predictor import Predictor

ckpt_dir = snapshot_download("panopstor/MiVOLO-mirror")

config = argparse.Namespace(
    detector_weights=f"{ckpt_dir}/yolov8x_person_face.pt",
    checkpoint=f"{ckpt_dir}/mivolo_face_body.pth.tar",
    device="cuda",
    with_persons=True,
    disable_faces=False,
    draw=False,
)
predictor = Predictor(config, verbose=False)

(Note: at the time of this mirror, MiVOLO pinned timm==0.8.13.dev0 and ultralytics==8.1.0 in their requirements.txt. Newer versions of either break model instantiation โ€” see the MicroMaxField SAFETY.md for a sidecar pattern that isolates the old pins from a modern training stack.)

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Paper for panopstor/MiVOLO-mirror