Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shu Yao's picture
3 6

Shu Yao

ZCODE0
·
https://yao.notion.site

AI & ML interests

None yet

Organizations

AFLeague's profile picture

upvoted 2 papers 2 months ago

Self-Reflective Generation at Test Time

Paper • 2510.02919 • Published Oct 3 • 9

Test-Time Policy Adaptation for Enhanced Multi-Turn Interactions with LLMs

Paper • 2509.23166 • Published Sep 27 • 6
upvoted a paper 6 months ago

ReDit: Reward Dithering for Improved LLM Policy Optimization

Paper • 2506.18631 • Published Jun 23 • 7
upvoted a paper 10 months ago

PAFT: Prompt-Agnostic Fine-Tuning

Paper • 2502.12859 • Published Feb 18 • 15
upvoted 2 papers about 1 year ago

Flexora: Flexible Low Rank Adaptation for Large Language Models

Paper • 2408.10774 • Published Aug 20, 2024 • 3

Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models

Paper • 2409.06277 • Published Sep 10, 2024 • 15
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs