Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
causal reward modeling
university
https://docs.google.com/document/u/0/?tgif=d
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
harman
Â
submitted
a paper
8 days ago
V_1: Unifying Generation and Self-Verification for Parallel Reasoners
pragsri8
Â
published
a dataset
6 months ago
causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new
pragsri8
Â
updated
a dataset
6 months ago
causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new
View all activity
Team members
2
causal-rewards
's models
2
Sort:Â Recently updated
causal-rewards/llama-3.1-8b-sft_ultrachat_200k
Text Generation
•
8B
•
Updated
Sep 16, 2025
•
1
causal-rewards/gemma2-9b_rm
9B
•
Updated
Apr 21, 2025