ilessio-aiflowlab commited on
Commit
1f81ff2
·
verified ·
1 Parent(s): ffb6dc3

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +24 -0
  2. hdino_t_best.pt +3 -0
  3. hdino_t_best.pth +3 -0
  4. hdino_t_best.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # LOKI — HDINO-T Open-Vocabulary Detector
2
+
3
+ **Model**: HDINO-T (Swin-T backbone + deformable transformer)
4
+ **Paper**: [HDINO: A Concise and Efficient Open-Vocabulary Detector](https://hf.co/papers/2603.02924)
5
+ **Training**: Fine-tuned on COCO 2017 (118K images, 80 classes)
6
+ **Best val_loss**: 2.36 (epoch 2)
7
+
8
+ ## Files
9
+ - `hdino_t_best.pth` — Full checkpoint (model + optimizer + scheduler)
10
+ - `hdino_t_best.safetensors` — Model weights only (safetensors format)
11
+ - `hdino_t_best.pt` — Model bundle (weights + config + text embeddings)
12
+
13
+ ## Training Config
14
+ - LR: 0.0001 (cosine decay)
15
+ - Batch size: 6
16
+ - Precision: bf16
17
+ - Optimizer: AdamW
18
+ - CLIP text encoder: frozen (openai/clip-vit-base-patch32)
19
+
20
+ ## Usage
21
+ Requires HDINO repo: https://github.com/HaoZ416/HDINO
22
+
23
+ ## Part of ANIMA Wave-6
24
+ Module: project_loki | Tier 2 Perception
hdino_t_best.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7808478b6c8b11ecced55054518569df28eb7ca65462dc08599ff44e83b2c53
3
+ size 452461847
hdino_t_best.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9bc726c969c9c039da21c516c547ceaa7397507c31c0b8968099b022bd8e7423
3
+ size 830141040
hdino_t_best.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7275922af385f8e5b535bec18a91f2b6d562c727645389804514e66970cb5342
3
+ size 446387272