Run 3D human pose estimation with images
Generate 3D models from text or images
Generate depth map from an image
Engage in multimedia chat with LLMs and ML models