euhidaman
/

embervlm-tiny

Image-Text-to-Text

vision-language

edge-deployment

Model card Files Files and versions

euhidaman commited on 17 days ago

Commit

aed5fac

·

verified ·

1 Parent(s): c7a0fe2

Update model - STAGE2 Epoch 1 | Loss: 5.2714

Files changed (3) hide show

README.md +8 -9
pytorch_model.bin +1 -1
training_info.json +4 -5

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ tags:
 - tiny-vlm
 - repvit
 - tinyllm
-- stage1
 base_model:
 - tinyllm
 library_name: transformers
@@ -21,18 +21,17 @@ pipeline_tag: image-text-to-text
 **🔥 Efficient Vision-Language Model for Edge Deployment & Robotic Applications**
-This model is currently in training - **STAGE1 (Epoch 1)**.
 ## 📊 Current Training Status
-- **Stage**: Visual-Language Alignment - Learning to ground vision and language
 - **Epoch**: 1
-- **Last Updated**: 2026-02-01 16:00:11 UTC
 ### Latest Metrics
-- **captioning_loss**: 8.5561
-- **contrastive_loss**: 2.7994
-- **loss**: 5.6777
 ## 🏗️ Model Architecture
@@ -51,7 +50,7 @@ EmberVLM follows a 4-stage training curriculum:
 3. ✅ **Stage 3: Robot Fleet Selection** - Task-robot matching
 4. ⏳ **Stage 4: Chain-of-Thought Reasoning** - Reasoning generation
-**Current Stage**: STAGE1
 ## 💻 Usage
@@ -126,5 +125,5 @@ Apache 2.0
 ---
-**Note**: This is a checkpoint from stage1 training (epoch 1).
 The model will be updated after each epoch with improved performance.

 - tiny-vlm
 - repvit
 - tinyllm
+- stage2
 base_model:
 - tinyllm
 library_name: transformers
 **🔥 Efficient Vision-Language Model for Edge Deployment & Robotic Applications**
+This model is currently in training - **STAGE2 (Epoch 1)**.
 ## 📊 Current Training Status
+- **Stage**: Multimodal Instruction Tuning - Following complex instructions
 - **Epoch**: 1
+- **Last Updated**: 2026-02-01 16:01:18 UTC
 ### Latest Metrics
+- **instruction_loss**: 0.0000
+- **loss**: 5.2714
 ## 🏗️ Model Architecture
 3. ✅ **Stage 3: Robot Fleet Selection** - Task-robot matching
 4. ⏳ **Stage 4: Chain-of-Thought Reasoning** - Reasoning generation
+**Current Stage**: STAGE2
 ## 💻 Usage
 ---
+**Note**: This is a checkpoint from stage2 training (epoch 1).
 The model will be updated after each epoch with improved performance.

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c6be11d39bd7c475a6e51883249a0a9ba175c11618e424678674cb2ef649fe66
 size 100663623

 version https://git-lfs.github.com/spec/v1
+oid sha256:471ac551d9b840a27745efe6b15526884be81d959b0082c65c1f5e2ba9f5bd97
 size 100663623

training_info.json CHANGED Viewed

@@ -1,13 +1,12 @@
 {
-  "stage": "stage1",
   "epoch": 1,
   "metrics": {
-    "loss": 5.6777140368586005,
-    "contrastive_loss": 2.7993588654891304,
-    "captioning_loss": 8.556068959443465
   },
   "carbon_emissions_kg": 0.0,
-  "timestamp": "2026-02-01T16:00:11.852746",
   "vision_backbone": "repvit",
   "language_backbone": "tinyllm",
   "total_parameters": 40196257,

 {
+  "stage": "stage2",
   "epoch": 1,
   "metrics": {
+    "loss": 5.271356953514947,
+    "instruction_loss": 0.0
   },
   "carbon_emissions_kg": 0.0,
+  "timestamp": "2026-02-01T16:01:18.866166",
   "vision_backbone": "repvit",
   "language_backbone": "tinyllm",
   "total_parameters": 40196257,