Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LLMnegotiation
/
tas_rps_vanilla_ad_align
like
0
Follow
LLM Negotiation
3
Safetensors
Model card
Files
Files and versions
xet
Community
main
tas_rps_vanilla_ad_align
/
src_code_for_reproducibility
/
training
281 kB
1 contributor
History:
3 commits
Muqeeth
Add files using upload-large-folder tool
80daf44
verified
2 months ago
__pycache__
Add files using upload-large-folder tool
2 months ago
README.md
772 Bytes
Add files using upload-large-folder tool
2 months ago
__init__.py
0 Bytes
Add files using upload-large-folder tool
2 months ago
annealing_methods.py
138 Bytes
Add files using upload-large-folder tool
2 months ago
credit_methods.py
10.6 kB
Add files using upload-large-folder tool
2 months ago
tally_metrics.py
1.65 kB
Add files using upload-large-folder tool
2 months ago
tally_rollout.py
4.97 kB
Add files using upload-large-folder tool
2 months ago
tally_tokenwise.py
9.36 kB
Add files using upload-large-folder tool
2 months ago
tokenize_chats.py
5.25 kB
Add files using upload-large-folder tool
2 months ago
trainer_ad_align.py
21.4 kB
Add files using upload-large-folder tool
2 months ago
trainer_common.py
45.5 kB
Add files using upload-large-folder tool
2 months ago
trainer_independent.py
5.63 kB
Add files using upload-large-folder tool
2 months ago
trainer_sum_rewards.py
5.18 kB
Add files using upload-large-folder tool
2 months ago
training_data_utils.py
15.6 kB
Add files using upload-large-folder tool
2 months ago