Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Chuanming
/
Mixtral-QLoRA-test
like
0
arxiv:
1909.08593
Model card
Files
Files and versions
xet
Community
main
Mixtral-QLoRA-test
/
trl
/
trainer
230 kB
1 contributor
History:
1 commit
Chuanming
Upload folder using huggingface_hub
fa4458a
about 2 years ago
__init__.py
1.4 kB
Upload folder using huggingface_hub
about 2 years ago
base.py
1.77 kB
Upload folder using huggingface_hub
about 2 years ago
ddpo_config.py
4.89 kB
Upload folder using huggingface_hub
about 2 years ago
ddpo_trainer.py
24.9 kB
Upload folder using huggingface_hub
about 2 years ago
dpo_trainer.py
37.9 kB
Upload folder using huggingface_hub
about 2 years ago
iterative_sft_trainer.py
16.7 kB
Upload folder using huggingface_hub
about 2 years ago
ppo_config.py
8.16 kB
Upload folder using huggingface_hub
about 2 years ago
ppo_trainer.py
62.2 kB
Upload folder using huggingface_hub
about 2 years ago
reward_trainer.py
13.7 kB
Upload folder using huggingface_hub
about 2 years ago
sft_trainer.py
21.4 kB
Upload folder using huggingface_hub
about 2 years ago
training_configs.py
1.94 kB
Upload folder using huggingface_hub
about 2 years ago
utils.py
35.2 kB
Upload folder using huggingface_hub
about 2 years ago