Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shaojie Zhang's picture
3 6

Shaojie Zhang

zhshj0110
  • https://github.com/Eezekiel

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Paper • 2511.13026 • Published Nov 17, 2025 • 26
upvoted 2 papers 3 months ago

Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs

Paper • 2506.22139 • Published Jun 27, 2025 • 2

HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration

Paper • 2510.27266 • Published Oct 31, 2025 • 21
upvoted 2 papers 4 months ago

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Paper • 2510.00406 • Published Oct 1, 2025 • 66

BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent

Paper • 2509.15566 • Published Sep 19, 2025 • 14
upvoted an article 5 months ago
view article
Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025
•
78
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs