Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LLMnegotiation
/
TAS_gpt4.1_self_play

Safetensors
Model card Files Files and versions
xet
Community
TAS_gpt4.1_self_play / src_code_for_reproducibility /training
280 kB
  • 1 contributor
History: 2 commits
dereckpichemila's picture
dereckpichemila
Add files using upload-large-folder tool
580868a verified 5 months ago
  • __pycache__
    Add files using upload-large-folder tool 5 months ago
  • README.md
    772 Bytes
    Add files using upload-large-folder tool 5 months ago
  • __init__.py
    0 Bytes
    Add files using upload-large-folder tool 5 months ago
  • annealing_methods.py
    138 Bytes
    Add files using upload-large-folder tool 5 months ago
  • credit_methods.py
    12.5 kB
    Add files using upload-large-folder tool 5 months ago
  • produce_training_stats.py
    9.25 kB
    Add files using upload-large-folder tool 5 months ago
  • tally_basic.py
    5.94 kB
    Add files using upload-large-folder tool 5 months ago
  • tally_tokenwise.py
    9.27 kB
    Add files using upload-large-folder tool 5 months ago
  • tokenize_chats.py
    7.71 kB
    Add files using upload-large-folder tool 5 months ago
  • trainer_ad_align.py
    19.7 kB
    Add files using upload-large-folder tool 5 months ago
  • trainer_common.py
    36.5 kB
    Add files using upload-large-folder tool 5 months ago
  • trainer_independent.py
    5.05 kB
    Add files using upload-large-folder tool 5 months ago
  • trainer_sum_rewards.py
    4.05 kB
    Add files using upload-large-folder tool 5 months ago
  • training_data_utils.py
    14.7 kB
    Add files using upload-large-folder tool 5 months ago