Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Yu Zhao's picture
3 26 11

Yu Zhao

yuzhaouoe
Neelectric's profile picture HEmile's profile picture junkim100's profile picture
·
https://yuzhaouoe.github.io/
  • yuzhaouoe
  • yuzhaouoe
  • yu-zhao-b303482b3

AI & ML interests

NLP/ML

Recent Activity

upvoted a paper 1 day ago
From Pixels to Words -- Towards Native One-Vision Models at Scale
upvoted a paper 9 days ago
Pythagoras-Prover: Advancing Efficient Formal Proving via Augmented Lean Formalisation
upvoted a paper 22 days ago
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments
View all activity

Organizations

EdinburghNLP - Natural Language Processing Group at the University of Edinburgh's profile picture hallucinations-leaderboard's profile picture Edinburgh Dataset Analytics Working Group's profile picture OpenBox's profile picture Mini Reasoning's profile picture LEMUR Decoding's profile picture ZWProj's profile picture

commented 3 papers over 1 year ago

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21, 2024 • 20 •
3

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Paper • 2410.16090 • Published Oct 21, 2024 • 8 •
2

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21, 2024 • 20 •
3
commented a paper about 2 years ago

A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression

Paper • 2406.11430 • Published Jun 17, 2024 • 25 •
3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs