Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

juspay
/
task2file-llm

Safetensors
Model card Files Files and versions
xet
Community
task2file-llm / trainer-kit /DPO
39.9 MB
  • 1 contributor
History: 1 commit
SirajRLX's picture
SirajRLX
Upload folder using huggingface_hub
4eae728 verified about 1 month ago
  • __pycache__
    Upload folder using huggingface_hub about 1 month ago
  • QUICK_START.md
    7.29 kB
    Upload folder using huggingface_hub about 1 month ago
  • README.md
    5.22 kB
    Upload folder using huggingface_hub about 1 month ago
  • apply_critical_fixes.py
    6.78 kB
    Upload folder using huggingface_hub about 1 month ago
  • config_dpo.yaml
    3.65 kB
    Upload folder using huggingface_hub about 1 month ago
  • create_synthetic_pairs.py
    5.1 kB
    Upload folder using huggingface_hub about 1 month ago
  • dpo_dataset.jsonl
    5.67 kB
    Upload folder using huggingface_hub about 1 month ago
  • dpo_pairs_generated.jsonl
    39.8 MB
    xet
    Upload folder using huggingface_hub about 1 month ago
  • f1_score_utils.py
    9.36 kB
    Upload folder using huggingface_hub about 1 month ago
  • prepare_data.py
    12 kB
    Upload folder using huggingface_hub about 1 month ago
  • requirements.txt
    397 Bytes
    Upload folder using huggingface_hub about 1 month ago
  • run_dpo.py
    33.6 kB
    Upload folder using huggingface_hub about 1 month ago
  • run_dpo.py.backup
    31.4 kB
    Upload folder using huggingface_hub about 1 month ago
  • run_dpo_enhanced.py
    9.54 kB
    Upload folder using huggingface_hub about 1 month ago
  • test_fixes.py
    3.7 kB
    Upload folder using huggingface_hub about 1 month ago