Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Oxford Human Information Processing Lab

university
https://humaninformationprocessing.com
Activity Feed

AI & ML interests

None defined yet.

Brian Christian's profile pictureElle's profile pictureJessica Thompson's profile picture

Oxford-HIPlab 's collections 1

Reward Models Inherit Value Biases from Pretraining ICLR2026
Reward models and logprobs for the paper Christian et al., "Reward Models Inherit Value Biases from Pretraining" (ICLR 2026)
  • Oxford-HIPlab/BT_LoRA_skywork80k_on_gemma-2-2b-it_seed1

    Updated Sep 12, 2025
  • Oxford-HIPlab/BT_LoRA_skywork80k_on_gemma-2-2b-it_seed1-every_1

    Updated Sep 20, 2025
  • Oxford-HIPlab/BT_LoRA_skywork80k_on_gemma-2-2b-it_seed1-every_10

    Updated Sep 20, 2025
  • Oxford-HIPlab/BT_LoRA_skywork80k_on_Llama_3.2_3B_Instruct_seed1

    Updated Sep 12, 2025
Reward Models Inherit Value Biases from Pretraining ICLR2026
Reward models and logprobs for the paper Christian et al., "Reward Models Inherit Value Biases from Pretraining" (ICLR 2026)
  • Oxford-HIPlab/BT_LoRA_skywork80k_on_gemma-2-2b-it_seed1

    Updated Sep 12, 2025
  • Oxford-HIPlab/BT_LoRA_skywork80k_on_gemma-2-2b-it_seed1-every_1

    Updated Sep 20, 2025
  • Oxford-HIPlab/BT_LoRA_skywork80k_on_gemma-2-2b-it_seed1-every_10

    Updated Sep 20, 2025
  • Oxford-HIPlab/BT_LoRA_skywork80k_on_Llama_3.2_3B_Instruct_seed1

    Updated Sep 12, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs