Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LLMnegotiation
/
tas_rps_vanilla_ad_align

Safetensors
Model card Files Files and versions
xet
Community
tas_rps_vanilla_ad_align / src_code_for_reproducibility /training
281 kB
  • 1 contributor
History: 3 commits
Muqeeth's picture
Muqeeth
Add files using upload-large-folder tool
80daf44 verified 2 months ago
  • __pycache__
    Add files using upload-large-folder tool 2 months ago
  • README.md
    772 Bytes
    Add files using upload-large-folder tool 2 months ago
  • __init__.py
    0 Bytes
    Add files using upload-large-folder tool 2 months ago
  • annealing_methods.py
    138 Bytes
    Add files using upload-large-folder tool 2 months ago
  • credit_methods.py
    10.6 kB
    Add files using upload-large-folder tool 2 months ago
  • tally_metrics.py
    1.65 kB
    Add files using upload-large-folder tool 2 months ago
  • tally_rollout.py
    4.97 kB
    Add files using upload-large-folder tool 2 months ago
  • tally_tokenwise.py
    9.36 kB
    Add files using upload-large-folder tool 2 months ago
  • tokenize_chats.py
    5.25 kB
    Add files using upload-large-folder tool 2 months ago
  • trainer_ad_align.py
    21.4 kB
    Add files using upload-large-folder tool 2 months ago
  • trainer_common.py
    45.5 kB
    Add files using upload-large-folder tool 2 months ago
  • trainer_independent.py
    5.63 kB
    Add files using upload-large-folder tool 2 months ago
  • trainer_sum_rewards.py
    5.18 kB
    Add files using upload-large-folder tool 2 months ago
  • training_data_utils.py
    15.6 kB
    Add files using upload-large-folder tool 2 months ago