koreashin
/

Driver_monitoring

@@ -22,14 +22,14 @@ model-index:
       name: Video Classification
     metrics:
     - type: accuracy
-      value: 0.9735
       name: Accuracy
     - type: f1
-      value: 0.9666
       name: Macro F1
 ---
-# Driver Behavior Detection Model (Epoch 5)
 운전자 이상행동 감지를 위한 Video Swin Transformer 기반 모델입니다.
@@ -44,18 +44,18 @@ model-index:
 | Label | Class | F1-Score |
 |:-----:|-------|:--------:|
-| 0 | 정상 (Normal) | 0.95 |
 | 1 | 졸음운전 (Drowsy Driving) | 0.99 |
-| 2 | 물건찾기 (Reaching/Searching) | 0.95 |
-| 3 | 휴대폰 사용 (Phone Usage) | 0.94 |
 | 4 | 운전자 폭행 (Driver Assault) | 1.00 |
-## Performance (Epoch 5)
 | Metric | Value |
 |--------|-------|
-| **Accuracy** | 97.35% |
-| **Macro F1** | 0.9666 |
 | **Validation Samples** | 1,371,062 |
 ## Training Configuration
@@ -73,42 +73,6 @@ model-index:
 | Loss | CrossEntropy + Label Smoothing (0.1) |
 | Regularization | Mixup (a=0.4), Dropout (0.3) |
-## Usage
-```python
-import torch
-from model import DriverBehaviorModel
-# Load model
-model = DriverBehaviorModel(num_classes=5, pretrained=False)
-checkpoint = torch.load("pytorch_model.bin", map_location="cpu")
-model.load_state_dict(checkpoint["model"])
-model.eval()
-# Inference
-# input: [1, 3, 30, 224, 224] - 30 frames, 224x224, RGB normalized
-with torch.no_grad():
-    output = model(video_tensor)
-    prediction = output.argmax(dim=1)
-```
-## Dataset
-- **Total Videos**: 243,979
-- **Total Samples (windows)**: 1,371,062
-- **Window Size**: 30 frames
-- **Stride**: 15 frames
-- **Resolution**: 224x224
-## Training Progress
-| Epoch | Accuracy | Macro F1 |
-|:-----:|:--------:|:--------:|
-| 2 | 95.15% | 0.9392 |
-| 3 | 96.56% | 0.9568 |
-| 4 | 96.83% | 0.9600 |
-| **5** | **97.35%** | **0.9666** |
 ## Files
 | File | Size | Description |
@@ -167,6 +131,22 @@ Resize: 224x224 (BILINEAR)
 Frames: 30 frames uniformly sampled
 ```
 ## License
 This model is for research purposes only.

       name: Video Classification
     metrics:
     - type: accuracy
+      value: 0.9805
       name: Accuracy
     - type: f1
+      value: 0.9757
       name: Macro F1
 ---
+# Driver Behavior Detection Model (Epoch 7)
 운전자 이상행동 감지를 위한 Video Swin Transformer 기반 모델입니다.
 | Label | Class | F1-Score |
 |:-----:|-------|:--------:|
+| 0 | 정상 (Normal) | 0.97 |
 | 1 | 졸음운전 (Drowsy Driving) | 0.99 |
+| 2 | 물건찾기 (Reaching/Searching) | 0.96 |
+| 3 | 휴대폰 사용 (Phone Usage) | 0.96 |
 | 4 | 운전자 폭행 (Driver Assault) | 1.00 |
+## Performance (Epoch 7)
 | Metric | Value |
 |--------|-------|
+| **Accuracy** | 98.05% |
+| **Macro F1** | 0.9757 |
 | **Validation Samples** | 1,371,062 |
 ## Training Configuration
 | Loss | CrossEntropy + Label Smoothing (0.1) |
 | Regularization | Mixup (a=0.4), Dropout (0.3) |
 ## Files
 | File | Size | Description |
 Frames: 30 frames uniformly sampled
 ```
+## Dataset
+- **Total Videos**: 243,979
+- **Total Samples (windows)**: 1,371,062
+- **Window Size**: 30 frames
+- **Stride**: 15 frames
+- **Resolution**: 224x224
+## Training Progress
+| Epoch | Accuracy | Macro F1 |
+|:-----:|:--------:|:--------:|
+| 5 | 97.35% | 0.9666 |
+| 6 | 97.74% | 0.9720 |
+| **7** | **98.05%** | **0.9757** |
 ## License
 This model is for research purposes only.

config.json CHANGED Viewed

@@ -11,9 +11,9 @@
     "layers": ["LayerNorm(768)", "Dropout(0.3)", "Linear(768, 5)"]
   },
   "training": {
-    "epoch": 5,
-    "accuracy": 0.9735,
-    "macro_f1": 0.9666,
     "batch_size": 32,
     "optimizer": "AdamW",
     "learning_rate": 1e-3,
@@ -23,10 +23,10 @@
     "augmentation": ["Mixup(0.4)", "RandomResizedCrop", "HorizontalFlip", "ColorJitter", "TemporalAugmentation"]
   },
   "performance": {
-    "정상": {"precision": 0.95, "recall": 0.96, "f1": 0.95},
-    "졸음운전": {"precision": 0.99, "recall": 0.99, "f1": 0.99},
-    "물건찾기": {"precision": 0.94, "recall": 0.96, "f1": 0.95},
-    "휴대폰 사용": {"precision": 0.95, "recall": 0.93, "f1": 0.94},
     "운전자 폭행": {"precision": 1.00, "recall": 1.00, "f1": 1.00}
   }
 }

     "layers": ["LayerNorm(768)", "Dropout(0.3)", "Linear(768, 5)"]
   },
   "training": {
+    "epoch": 7,
+    "accuracy": 0.9805,
+    "macro_f1": 0.9757,
     "batch_size": 32,
     "optimizer": "AdamW",
     "learning_rate": 1e-3,
     "augmentation": ["Mixup(0.4)", "RandomResizedCrop", "HorizontalFlip", "ColorJitter", "TemporalAugmentation"]
   },
   "performance": {
+    "정상": {"precision": 0.97, "recall": 0.97, "f1": 0.97},
+    "졸음운전": {"precision": 1.00, "recall": 0.99, "f1": 0.99},
+    "물건찾기": {"precision": 0.95, "recall": 0.97, "f1": 0.96},
+    "휴대폰 사용": {"precision": 0.96, "recall": 0.96, "f1": 0.96},
     "운전자 폭행": {"precision": 1.00, "recall": 1.00, "f1": 1.00}
   }
 }

model.onnx CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:036ac05c4b782d2ea1e24eb61366f197e0692ba24abaddca6798d56c1a337cec
-size 171169182

 version https://git-lfs.github.com/spec/v1
+oid sha256:5b16e8969d749bb7754b2a42daa98a4f64a6e4c42082d028111457c3abed9759
+size 171169172

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e6442f71cdbc76335fe751ada9eb7e9c4c6461d7beb82e52088bafa7e15107a5
 size 126244047

 version https://git-lfs.github.com/spec/v1
+oid sha256:db2e18ab37ceb942118a6390fce0e95220774048ec44eaca90ad5713fa1dce9c
 size 126244047