Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

percyraskova
/
llm-training

Text Generation
Transformers
English
grpo
rlhf
fine-tuning
marxism
political-theory
lora
deepseek
qwen
Model card Files Files and versions
xet
Community
llm-training / tests /unit
169 kB
  • 1 contributor
History: 1 commit
percyraskova's picture
percyraskova
Upload folder using huggingface_hub
81b3473 verified 23 days ago
  • __init__.py
    13 Bytes
    Upload folder using huggingface_hub 23 days ago
  • test_grpo_rewards.py
    143 kB
    Upload folder using huggingface_hub 23 days ago
  • test_train_headless.py
    9.27 kB
    Upload folder using huggingface_hub 23 days ago
  • test_wandb_logging.py
    16.3 kB
    Upload folder using huggingface_hub 23 days ago