Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
percyraskova
/
llm-training
like
0
Text Generation
Transformers
prolewiki/qa-corpus
English
grpo
rlhf
fine-tuning
marxism
political-theory
lora
deepseek
qwen
License:
agpl-3.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llm-training
/
training_data
11.3 MB
1 contributor
History:
1 commit
percyraskova
Upload folder using huggingface_hub
81b3473
verified
22 days ago
formatted
Upload folder using huggingface_hub
22 days ago
entity_whitelist.json
2.6 MB
Upload folder using huggingface_hub
22 days ago
entity_whitelist_clean.json
1.75 MB
Upload folder using huggingface_hub
22 days ago
grpo_dataset.jsonl
4.84 MB
Upload folder using huggingface_hub
22 days ago
training_logs_20251218_075909.csv
76.1 kB
Upload folder using huggingface_hub
22 days ago
training_logs_20251218_075909.json
330 kB
Upload folder using huggingface_hub
22 days ago