Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
percyraskova
/
llm-training
like
0
Text Generation
Transformers
prolewiki/qa-corpus
English
grpo
rlhf
fine-tuning
marxism
political-theory
lora
deepseek
qwen
License:
agpl-3.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llm-training
/
ai-docs
87.6 kB
1 contributor
History:
1 commit
percyraskova
Upload folder using huggingface_hub
81b3473
verified
23 days ago
chatbot-ideology.yaml
13 kB
Upload folder using huggingface_hub
23 days ago
finetune.yaml
11.4 kB
Upload folder using huggingface_hub
23 days ago
reward-modeling.yaml
35 kB
Upload folder using huggingface_hub
23 days ago
runpod.yaml
11.2 kB
Upload folder using huggingface_hub
23 days ago
training-schema.yaml
17 kB
Upload folder using huggingface_hub
23 days ago