DriveFusion
/

DriveFusion-V0.2

text-generation

AutonomousDriving

Model card Files Files and versions

OmarSamir commited on 19 days ago

Commit

f56f884

·

verified ·

1 Parent(s): fd64a39

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -37,6 +37,11 @@ Built on the **Qwen2.5-VL** foundation, DriveFusion-V0.2 adds specialized MLP he
 DriveFusion-V0.2 extends the Qwen2.5-VL architecture with a modular "Driving Intelligence" layer.
 ### Technical Specifications
 - **Text Encoder**: Qwen2.5-VL (36 Transformer layers).
 - **Vision Encoder**: 32-layer ViT with configurable patch sizes.

 DriveFusion-V0.2 extends the Qwen2.5-VL architecture with a modular "Driving Intelligence" layer.
+<div align="left">
+  <img src="drivefusion_architectures.png" alt="DriveFusion Architecture" width="700"/>
+  <p><i>The DriveFusion-V0.2 Architecture: Integrating visual tokens with telemetry-encoded tokens for dual-head output.</i></p>
+</div>
 ### Technical Specifications
 - **Text Encoder**: Qwen2.5-VL (36 Transformer layers).
 - **Vision Encoder**: 32-layer ViT with configurable patch sizes.