Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Student0809
/
interactSpeech
like
0
arxiv:
2408.05517
arxiv:
2309.00986
Model card
Files
Files and versions
xet
Community
main
interactSpeech
/
swift
/
trainers
/
rlhf_trainer
234 kB
1 contributor
History:
1 commit
Student0809
Add files using upload-large-folder tool
7feac49
verified
5 months ago
.ipynb_checkpoints
Add files using upload-large-folder tool
5 months ago
__pycache__
Add files using upload-large-folder tool
5 months ago
__init__.py
Safe
1.25 kB
Add files using upload-large-folder tool
5 months ago
cpo_trainer.py
Safe
1.29 kB
Add files using upload-large-folder tool
5 months ago
dpo_trainer.py
Safe
5.27 kB
Add files using upload-large-folder tool
5 months ago
grpo_trainer.py
70.1 kB
Add files using upload-large-folder tool
5 months ago
kto_trainer.py
Safe
2.41 kB
Add files using upload-large-folder tool
5 months ago
orpo_trainer.py
Safe
633 Bytes
Add files using upload-large-folder tool
5 months ago
ppo_trainer.py
Safe
2.44 kB
Add files using upload-large-folder tool
5 months ago
reward_trainer.py
Safe
3.58 kB
Add files using upload-large-folder tool
5 months ago
rlhf_mixin.py
Safe
5.01 kB
Add files using upload-large-folder tool
5 months ago
utils.py
Safe
5.52 kB
Add files using upload-large-folder tool
5 months ago
vllm_client.py
Safe
8.98 kB
Add files using upload-large-folder tool
5 months ago