Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
percyraskova
/
llm-training
like
0
Text Generation
Transformers
prolewiki/qa-corpus
English
grpo
rlhf
fine-tuning
marxism
political-theory
lora
deepseek
qwen
License:
agpl-3.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llm-training
/
tests
/
unit
169 kB
1 contributor
History:
1 commit
percyraskova
Upload folder using huggingface_hub
81b3473
verified
23 days ago
__init__.py
13 Bytes
Upload folder using huggingface_hub
23 days ago
test_grpo_rewards.py
143 kB
Upload folder using huggingface_hub
23 days ago
test_train_headless.py
9.27 kB
Upload folder using huggingface_hub
23 days ago
test_wandb_logging.py
16.3 kB
Upload folder using huggingface_hub
23 days ago