End of training

Browse files

Files changed (4) hide show

README.md +63 -0
config.json +96 -0
model.safetensors +3 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,63 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: microsoft/swin-base-patch4-window7-224
+tags:
+- generated_from_trainer
+datasets:
+- pascal_voc
+model-index:
+- name: multilabel_classification
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# multilabel_classification
+This model is a fine-tuned version of [microsoft/swin-base-patch4-window7-224](https://huggingface.co/microsoft/swin-base-patch4-window7-224) on the pascal_voc dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.2309
+- Roc Auc: 0.7662
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- num_epochs: 3.0
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Roc Auc |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 0.4603        | 1.0   | 157  | 0.3015          | 0.6697  |
+| 0.2922        | 2.0   | 314  | 0.2428          | 0.7435  |
+| 0.2561        | 3.0   | 471  | 0.2309          | 0.7662  |
+### Framework versions
+- Transformers 4.55.3
+- Pytorch 2.8.0+cu128
+- Datasets 3.6.0
+- Tokenizers 0.21.2

config.json ADDED Viewed

	@@ -0,0 +1,96 @@

+{
+  "architectures": [
+    "SwinForImageClassification"
+  ],
+  "attention_probs_dropout_prob": 0.0,
+  "depths": [
+    2,
+    2,
+    18,
+    2
+  ],
+  "drop_path_rate": 0.1,
+  "embed_dim": 128,
+  "encoder_stride": 32,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.0,
+  "hidden_size": 1024,
+  "id2label": {
+    "0": "Aeroplane",
+    "1": "Bicycle",
+    "2": "Bird",
+    "3": "Boat",
+    "4": "Bottle",
+    "5": "Bus",
+    "6": "Car",
+    "7": "Cat",
+    "8": "Chair",
+    "9": "Cow",
+    "10": "Diningtable",
+    "11": "Dog",
+    "12": "Horse",
+    "13": "Motorbike",
+    "14": "Person",
+    "15": "Potted plant",
+    "16": "Sheep",
+    "17": "Sofa",
+    "18": "Train",
+    "19": "Tv/monitor"
+  },
+  "image_size": 224,
+  "initializer_range": 0.02,
+  "label2id": {
+    "Aeroplane": 0,
+    "Bicycle": 1,
+    "Bird": 2,
+    "Boat": 3,
+    "Bottle": 4,
+    "Bus": 5,
+    "Car": 6,
+    "Cat": 7,
+    "Chair": 8,
+    "Cow": 9,
+    "Diningtable": 10,
+    "Dog": 11,
+    "Horse": 12,
+    "Motorbike": 13,
+    "Person": 14,
+    "Potted plant": 15,
+    "Sheep": 16,
+    "Sofa": 17,
+    "Train": 18,
+    "Tv/monitor": 19
+  },
+  "layer_norm_eps": 1e-05,
+  "mlp_ratio": 4.0,
+  "model_type": "swin",
+  "num_channels": 3,
+  "num_heads": [
+    4,
+    8,
+    16,
+    32
+  ],
+  "num_layers": 4,
+  "out_features": [
+    "stage4"
+  ],
+  "out_indices": [
+    4
+  ],
+  "patch_size": 4,
+  "path_norm": true,
+  "problem_type": "multi_label_classification",
+  "qkv_bias": true,
+  "stage_names": [
+    "stem",
+    "stage1",
+    "stage2",
+    "stage3",
+    "stage4"
+  ],
+  "torch_dtype": "float32",
+  "transformers_version": "4.55.3",
+  "use_absolute_embeddings": false,
+  "window_size": 7
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:73e56907922f512ba972b090a2d9e8e31f5d5f4fd76911440c0020184259fd4c
+size 347572616

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:92ef616127bdcbf20e859946968ea26378f11d9cdb56f13927158423d78a830f
+size 5777