Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yutao Zeng's picture
4 18

Yutao Zeng

Taoer
Randyz's profile picture FetchFortune's profile picture zhangysk's profile picture
·

AI & ML interests

None yet

Organizations

Open-LLM-Researches's profile picture ICT-Golaxy's profile picture ICT-GoKnow's profile picture Open-Foundation-Models's profile picture

commented a paper 8 months ago

Stepsize anything: A unified learning rate schedule for budgeted-iteration training

Paper • 2505.24452 • Published May 30, 2025 • 5 •
2
commented 4 papers 11 months ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6, 2025 • 21 •
8

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6, 2025 • 21 •
8

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6, 2025 • 21 •
8

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21, 2025 • 15 •
2
commented a paper about 1 year ago

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 19 •
1
commented a paper over 1 year ago

Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 26 •
8
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs