Wakals
/

CoVT-LLaVA-13B-depth

Model card Files Files and versions

CoVT-LLaVA-13B-depth / README.md

Wakals's picture

Update README.md

912c910 verified 3 months ago

|

history blame contribute delete

351 Bytes

	---
	license: apache-2.0
	---
	# CoVT Checkpoint (Segmentation, Depth, and DINO Aligned)

	Checkpoint of https://huggingface.co/papers/2511.19418.

	## Model Description
	This CoVT checkpoint is aligned with 4 Depth tokens, based on LLaVA-v1.5-13B.
	These task-specific tokens are integrated into the model’s embedding space to enhance 3D-awareness.