Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Hacastle12
hacastle12
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 11 hours ago
KL for a KL: On-Policy Distillation with Control Variate Baseline
upvoted
a
paper
about 11 hours ago
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
commented
on
a paper
19 days ago
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding
View all activity
Organizations
None yet
hacastle12
's models
1
Sort: Recently updated
hacastle12/Meta-Llama-3-70B-Instruct-BitsAndBytes
Text Generation
•
73B
•
Updated
Jul 13, 2024
•
4