Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
zhangtao's picture

zhangtao

zhangtao-whu
2 19 5
Agatha7k's profile picture godx7's profile picture bitersun's profile picture
·
https://github.com/zhang-tao-whu
  • zhang-tao-whu

AI & ML interests

segmentation

Recent Activity

upvoted a paper 11 days ago
PerceptionDLM: Parallel Region Perception with Multimodal Diffusion Language Models
upvoted a paper 20 days ago
OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data
upvoted a paper 21 days ago
Skill-3D: Evolving Scene-Aware Skills for Agentic 3D Spatial Reasoning
View all activity

Organizations

Wuhan Univeristy's profile picture Dense World's profile picture Path to Multimodal Generalist's profile picture

zhangtao-whu 's models 13

zhangtao-whu/vectorllm-hf

Updated Mar 27

zhangtao-whu/DW

Updated Jul 8, 2025

zhangtao-whu/ocr_vqa

Updated Jun 9, 2025 • 3

zhangtao-whu/ocr_vqa_text

Updated Jun 9, 2025

zhangtao-whu/internvl3_merge_model

8B • Updated May 9, 2025 • 2

zhangtao-whu/r1

Updated Apr 26, 2025

zhangtao-whu/st_pth

Updated Mar 7, 2025

zhangtao-whu/gpcv_pth

Updated Mar 3, 2025

zhangtao-whu/capcls1.0_1024M_imgfull_withpt_lr5e-4-0_rp0.1_iter62500_hf

Updated Feb 9, 2025

zhangtao-whu/P2PFormer

Updated Dec 3, 2024 • 2

zhangtao-whu/OMG-LLaVA

Updated Jul 3, 2024 • 7

zhangtao-whu/PCM

Updated Mar 25, 2024

zhangtao-whu/DVIS_Plus

Updated Jan 31, 2024 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs