Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
sungjiblim
sungzip
Follow
0 followers
ยท
2 following
sungzip
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 10 hours ago
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
upvoted
a
paper
about 10 hours ago
KL for a KL: On-Policy Distillation with Control Variate Baseline
upvoted
a
paper
17 days ago
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding
View all activity
Organizations
models
1
sungzip/code-llama-7b-text-to-sql
Updated
Jul 28, 2024
datasets
0
None public yet