Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
sungjiblim
sungzip
Follow
0 followers
·
2 following
sungzip
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 11 hours ago
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
upvoted
a
paper
about 11 hours ago
KL for a KL: On-Policy Distillation with Control Variate Baseline
upvoted
a
paper
17 days ago
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding
View all activity
Organizations
sungzip
's models
1
Sort: Recently updated
sungzip/code-llama-7b-text-to-sql
Updated
Jul 28, 2024