Depth Estimation
Transformers
Safetensors
qwen3_vl
image-text-to-text
vision-language-model
3d-vision
multimodal
qwen3-vl
Instructions to use JonnyYu828/DepthVLM-4B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use JonnyYu828/DepthVLM-4B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("depth-estimation", model="JonnyYu828/DepthVLM-4B")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("JonnyYu828/DepthVLM-4B") model = AutoModelForImageTextToText.from_pretrained("JonnyYu828/DepthVLM-4B") - Notebooks
- Google Colab
- Kaggle
Improve model card metadata and content
#1
by nielsr HF Staff - opened
This PR improves the model card for DepthVLM-4B. Key changes include:
- Moving the ArXiv ID from the YAML metadata to the Markdown content section.
- Adding
library_name: transformersto the metadata as the model is compatible with the Transformers library. - Adding a link to the project page.
- Organizing the model description and highlights for better readability.
JonnyYu828 changed pull request status to merged