Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
SeungWon Kook
Aiant56
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 11 hours ago
KL for a KL: On-Policy Distillation with Control Variate Baseline
upvoted
a
paper
about 11 hours ago
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
upvoted
a
paper
19 days ago
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding
View all activity
Organizations
None yet
Aiant56
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
20 days ago
seongsubae/KorMedMCQA-V
Viewer
•
Updated
Feb 17
•
1.84k
•
477
•
8