Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

causal reward modeling

university
https://docs.google.com/document/u/0/?tgif=d
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

harman  submitted a paper 8 days ago
V_1: Unifying Generation and Self-Verification for Parallel Reasoners
pragsri8  published a dataset 6 months ago
causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new
pragsri8  updated a dataset 6 months ago
causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new
View all activity

Pragya Srivastava's profile picture Harman Singh's profile picture

causal-rewards 's models 2

causal-rewards/llama-3.1-8b-sft_ultrachat_200k

Text Generation • 8B • Updated Sep 16, 2025 • 1

causal-rewards/gemma2-9b_rm

9B • Updated Apr 21, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs