videomae-base-finetuned-dogBehavior

This model is based on the VideoMae base model and was finetuned on videos of sniffer dogs detecting Parkinson's disease.

The dogs are trained to lie down when they detect Parkinson's disease in a sample. The training is done with a mix of negative and positive samples.

It achieves the following results on the evaluation set:

Loss: 0.0611
Accuracy: 0.9910

Model description

The dataset consists of video clips of 2 behavior types : lying down and exploratory sniffing the sample. The goal of the model is to detect other behavior.

Intended uses & limitations

Weights are uploaded but video clips or images are protected and cannot be shared.

Training and evaluation data

The model was trained with 25 epochs. If the dataset changes, the model is retrained for 10 epochs. The accuracy to detect the behavior of lying and sniffing is 99%. Detections that are under this confidence are considered as "other" behavior that needs further analysis in respect ot its relation to the type of sample (negative or positive).

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_ratio: 0.1
training_steps: 370

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy
0.0379	0.1027	38	0.0912	0.9820
0.2053	1.1027	76	0.3690	0.8559
0.2644	2.1027	114	0.0611	0.9910
0.1297	3.1027	152	0.0697	0.9820
0.1778	4.1027	190	0.1969	0.9640
0.1767	5.1027	228	0.4106	0.8919
0.1548	6.1027	266	0.2711	0.9369
0.127	7.1027	304	0.2564	0.9459
0.1019	8.1027	342	0.2746	0.9459
0.348	9.0757	370	0.2201	0.9550

Framework versions

Transformers 4.57.3
Pytorch 2.9.1+cu130
Datasets 4.4.2
Tokenizers 0.22.2

Sources

basemodel: MCG-NJU/videomae-base

misc{https://doi.org/10.48550/arxiv.2203.12602, doi = {10.48550/ARXIV.2203.12602}, url = {https://arxiv.org/abs/2203.12602}, author = {Tong, Zhan and Song, Yibing and Wang, Jue and Wang, Limin}, keywords = {Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences, FOS: Computer and information sciences}, title = {VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training}, publisher = {arXiv}, year = {2022}, copyright = {Creative Commons Attribution 4.0 International} }

Many thanks to

Chang-Qing Gao and Ni Wang and Meimei Wang et al., "Sensitivity of Sniffer Dogs for a Diagnosis of Parkinson's Disease: A Diagnostic Accuracy Study" Aug. 2022. Lab: Chang-Qing Gao's Lab [Online].Available: {https://www.researchgate.net/publication/362936321_Sensitivity_of_Sniffer_Dogs_for_a_Diagnosis_of_Parkinson's_Disease_A_Diagnostic_Accuracy_Study}

Howest University for Applied Sciences, Creative Technologies and Artificial Intelligence

Downloads last month: 8

Safetensors

Model size

86.2M params

Tensor type

F32

Inference Providers NEW

Video Classification

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for IDS75912/videomae-base-finetuned-dogBehavior

Base model

MCG-NJU/videomae-base

Finetuned

(689)

this model

Paper for IDS75912/videomae-base-finetuned-dogBehavior

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Paper • 2203.12602 • Published Mar 23, 2022 • 3