Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YM Qin's picture
15 8

YM Qin

Wakals
aiproje's profile picture sanaka87's profile picture Deiweiwei's profile picture
·
https://wakals.github.io/

AI & ML interests

Computer Vision, Vision-language Model, Generative Model

Recent Activity

upvoted a paper 8 days ago
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning
upvoted a paper 26 days ago
Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning
upvoted a collection about 1 month ago
Qwen3.5
View all activity

Organizations

None yet

Wakals 's collections 1

CoVT: Chain-of-Visual-Thought
Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought!
  • Wakals/CoVT-7B-seg_depth_dino

    8B • Updated Dec 5, 2025 • 1.74k • 2
  • Wakals/CoVT-7B-seg_depth_dino_edge

    8B • Updated Dec 5, 2025 • 138 • 2
  • Wakals/CoVT-7B-depth

    8B • Updated Dec 5, 2025 • 5 • 2
  • Wakals/CoVT-7B-seg

    8B • Updated Dec 5, 2025 • 25 • 1
CoVT: Chain-of-Visual-Thought
Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought!
  • Wakals/CoVT-7B-seg_depth_dino

    8B • Updated Dec 5, 2025 • 1.74k • 2
  • Wakals/CoVT-7B-seg_depth_dino_edge

    8B • Updated Dec 5, 2025 • 138 • 2
  • Wakals/CoVT-7B-depth

    8B • Updated Dec 5, 2025 • 5 • 2
  • Wakals/CoVT-7B-seg

    8B • Updated Dec 5, 2025 • 25 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs