Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Chuanming
/
Mixtral-QLoRA-test
like
0
arxiv:
1909.08593
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Mixtral-QLoRA-test
/
trl
/
trainer
230 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Chuanming
Upload folder using huggingface_hub
fa4458a
over 2 years ago
__init__.py
1.4 kB
Upload folder using huggingface_hub
over 2 years ago
base.py
1.77 kB
Upload folder using huggingface_hub
over 2 years ago
ddpo_config.py
4.89 kB
Upload folder using huggingface_hub
over 2 years ago
ddpo_trainer.py
24.9 kB
Upload folder using huggingface_hub
over 2 years ago
dpo_trainer.py
37.9 kB
Upload folder using huggingface_hub
over 2 years ago
iterative_sft_trainer.py
16.7 kB
Upload folder using huggingface_hub
over 2 years ago
ppo_config.py
8.16 kB
Upload folder using huggingface_hub
over 2 years ago
ppo_trainer.py
62.2 kB
Upload folder using huggingface_hub
over 2 years ago
reward_trainer.py
13.7 kB
Upload folder using huggingface_hub
over 2 years ago
sft_trainer.py
21.4 kB
Upload folder using huggingface_hub
over 2 years ago
training_configs.py
1.94 kB
Upload folder using huggingface_hub
over 2 years ago
utils.py
35.2 kB
Upload folder using huggingface_hub
over 2 years ago