Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
4
xiangyuzhang
xiangyuzhang
Follow
AI & ML interests
None yet
Organizations
None yet
Collections
1
CoT
s1: Simple test-time scaling
Paper
•
2501.19393
•
Published
Jan 31, 2025
•
126
CoT
s1: Simple test-time scaling
Paper
•
2501.19393
•
Published
Jan 31, 2025
•
126
models
3
Sort: Recently updated
xiangyuzhang/ppo-Huggy
Reinforcement Learning
•
Updated
Feb 20
xiangyuzhang/ppo-LunarLander-v3
Reinforcement Learning
•
Updated
Feb 18
•
1
xiangyuzhang/SmolLM2-FT-MyDataset
Text Generation
•
0.1B
•
Updated
Jan 2, 2025
•
2
datasets
0
None public yet