Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
percyraskova
/
llm-training
like
0
Text Generation
Transformers
prolewiki/qa-corpus
English
grpo
rlhf
fine-tuning
marxism
political-theory
lora
deepseek
qwen
License:
agpl-3.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llm-training
/
src
/
prolewiki_llm
126 kB
1 contributor
History:
1 commit
percyraskova
Upload folder using huggingface_hub
81b3473
verified
about 1 month ago
__init__.py
2.76 kB
Upload folder using huggingface_hub
about 1 month ago
convert_to_qwen.py
1.44 kB
Upload folder using huggingface_hub
about 1 month ago
export_grpo_dataset.py
6.86 kB
Upload folder using huggingface_hub
about 1 month ago
grpo_rewards.py
62.8 kB
Upload folder using huggingface_hub
about 1 month ago
train_grpo_marxist.py
10.7 kB
Upload folder using huggingface_hub
about 1 month ago
train_headless.py
16.8 kB
Upload folder using huggingface_hub
about 1 month ago
train_marxist.py
6.04 kB
Upload folder using huggingface_hub
about 1 month ago
transform_to_grpo.py
1.91 kB
Upload folder using huggingface_hub
about 1 month ago
wandb_logging.py
16.4 kB
Upload folder using huggingface_hub
about 1 month ago