Upload DINO pre-trained ViT-Small model

Files changed (4) hide show

README.md CHANGED Viewed

@@ -1,43 +1,43 @@
-# DINO ViT-Small Custom Dataset
-This model is a Vision Transformer (ViT) Small model trained using DINO (self-DIstillation with NO labels) on a custom dataset.
-## Model Details
-- **Architecture**: ViT-Small (patch size 16)
-- **Pre-training Method**: DINO
-- **Training Epochs**: 2
-- **Output Dimension**: 384
-- **Dataset Size**: ~3000 images
-- **Base Model**: WinKawaks/vit-small-patch16-224
-## Training Configuration
-- Batch Size: 32
-- Learning Rate: 0.0003
-- Teacher Temperature: 0.07
-- Local Crops: 4
-- Weight Decay: 0.04 → 0.4
-- Optimizer: adamw
-## Training Results
-- Final Loss: 5.9383
-- Training Time: 0:03:55
-## Usage
-```python
-from transformers import ViTModel
-import torch
-# Load the model
-model = ViTModel.from_pretrained("odinson/dino-vit-small-custom")
-# Use for feature extraction
-model.eval()
-with torch.no_grad():
-    features = model(images).last_hidden_state
-Training Curves
-See the training plots in the repository for loss, learning rate, and weight decay curves.

+# DINO ViT-Small Custom Dataset
+This model is a Vision Transformer (ViT) Small model trained using DINO (self-DIstillation with NO labels) on a custom dataset.
+## Model Details
+- **Architecture**: ViT-Small (patch size 16)
+- **Pre-training Method**: DINO
+- **Training Epochs**: 10
+- **Output Dimension**: 384
+- **Dataset Size**: ~3000 images
+- **Base Model**: WinKawaks/vit-small-patch16-224
+## Training Configuration
+- Batch Size: 32
+- Learning Rate: 0.0003
+- Teacher Temperature: 0.07
+- Local Crops: 4
+- Weight Decay: 0.04 → 0.4
+- Optimizer: adamw
+## Training Results
+- Final Loss: 5.8926
+- Training Time: 0:06:19
+## Usage
+```python
+from transformers import ViTModel
+import torch
+# Load the model
+model = ViTModel.from_pretrained("odinson/dino-vit-small-custom")
+# Use for feature extraction
+model.eval()
+with torch.no_grad():
+    features = model(images).last_hidden_state
+Training Curves
+See the training plots in the repository for loss, learning rate, and weight decay curves.

config.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a3a661f38d35dfdfcecddbd5deb4eddcc43944ad67068baf357e81988bfe5481
 size 87276144

 version https://git-lfs.github.com/spec/v1
+oid sha256:3a0a9ecef85397f127ee22742d036a12db213b50d64a8511a7692af6eca24260
 size 87276144

training_curves.png CHANGED Viewed