videomae-base-finetuned-dogBehavior

This model is based on the VideoMae base model and was finetuned on videos of sniffer dogs detecting Parkinson's disease.

The dogs are trained to lie down when they detect Parkinson's disease in a sample. The training is done with a mix of negative and positive samples.

It achieves the following results on the evaluation set:

  • Loss: 0.0611
  • Accuracy: 0.9910

Model description

The dataset consists of video clips of 2 behavior types : lying down and exploratory sniffing the sample. The goal of the model is to detect other behavior.

Intended uses & limitations

Weights are uploaded but video clips or images are protected and cannot be shared.

Training and evaluation data

The model was trained with 25 epochs. If the dataset changes, the model is retrained for 10 epochs. The accuracy to detect the behavior of lying and sniffing is 99%. Detections that are under this confidence are considered as "other" behavior that needs further analysis in respect ot its relation to the type of sample (negative or positive).

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_ratio: 0.1
  • training_steps: 370

Training results

Training Loss Epoch Step Validation Loss Accuracy
0.0379 0.1027 38 0.0912 0.9820
0.2053 1.1027 76 0.3690 0.8559
0.2644 2.1027 114 0.0611 0.9910
0.1297 3.1027 152 0.0697 0.9820
0.1778 4.1027 190 0.1969 0.9640
0.1767 5.1027 228 0.4106 0.8919
0.1548 6.1027 266 0.2711 0.9369
0.127 7.1027 304 0.2564 0.9459
0.1019 8.1027 342 0.2746 0.9459
0.348 9.0757 370 0.2201 0.9550

Framework versions

  • Transformers 4.57.3
  • Pytorch 2.9.1+cu130
  • Datasets 4.4.2
  • Tokenizers 0.22.2

Sources

basemodel: MCG-NJU/videomae-base

misc{https://doi.org/10.48550/arxiv.2203.12602, doi = {10.48550/ARXIV.2203.12602}, url = {https://arxiv.org/abs/2203.12602}, author = {Tong, Zhan and Song, Yibing and Wang, Jue and Wang, Limin}, keywords = {Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences, FOS: Computer and information sciences}, title = {VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training}, publisher = {arXiv}, year = {2022}, copyright = {Creative Commons Attribution 4.0 International} }

Many thanks to

Chang-Qing Gao and Ni Wang and Meimei Wang et al., "Sensitivity of Sniffer Dogs for a Diagnosis of Parkinson's Disease: A Diagnostic Accuracy Study" Aug. 2022. Lab: Chang-Qing Gao's Lab [Online].Available: {https://www.researchgate.net/publication/362936321_Sensitivity_of_Sniffer_Dogs_for_a_Diagnosis_of_Parkinson's_Disease_A_Diagnostic_Accuracy_Study}

Howest University for Applied Sciences, Creative Technologies and Artificial Intelligence

Downloads last month
43
Safetensors
Model size
86.2M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for IDS75912/videomae-base-finetuned-dogBehavior

Finetuned
(684)
this model

Paper for IDS75912/videomae-base-finetuned-dogBehavior